[PPT] - 3D Photography: Stereo Vision Kalin Kolev, Marc Pollefeys Spring PowerPoint Presentation

SLIDE 1

3D Photography: Stereo Vision

Kalin Kolev, Marc Pollefeys Spring 2013

http://cvg.ethz.ch/teaching/2013spring/3dphoto/

SLIDE 2

Feb 18 Introduction Feb 25 Lecture: Geometry, Camera Model, Calibration Mar 4 Lecture: Features, Tracking/Matching Mar 11

Project Proposals by Students

Mar 18 Lecture: Epipolar Geometry Mar 25

Lecture: Stereo Vision

Apr 1 Easter Apr 8 Short lecture “SfM / SLAM” + 2 papers Apr 15

Project Updates

Apr 22 Short lecture “Active Ranging, Structured Light” + 2 papers Apr 29 Short lecture “Volumetric Modeling” + 2 papers May 6 Short lecture “Mesh-based Modeling” + 2 papers May 13 Short lecture “Shape-from-X” + 2 papers May 20 Pentecost / White Monday May 27

Final Demos

Schedule (tentative)

SLIDE 3

Stereo & Multi-View Stereo

http://cat.middlebury.edu/stereo/ Tsukuba dataset

SLIDE 4

Stereo

Standard stereo geometry
Stereo matching
Correlation
Optimization (DP, GC)
General camera configuration
Rectifications
Plane-sweep
Multi-view stereo

SLIDE 5

Stereo

SLIDE 6

Occlusions

(S lide from Pascal Fua)

SLIDE 7

Exploiting scene constraints

SLIDE 8

Ordering constraint

1 2 3 4,5 6 1 2,3 4 5 6 2 1 3 4,5 6 1 2,3 4 5 6

surface slice surface as a path

cclusion right
cclusion left

SLIDE 9

Uniqueness constraint

In an image pair each pixel has at most
ne corresponding pixel
In general one corresponding pixel
In case of occlusion there is none

SLIDE 10

Disparity constraint

surface slice surface as a path

bounding box

use reconstructed features to determine bounding box

constant disparity surfaces

SLIDE 11

Stereo matching

Optimal path (dynamic programming ) Similarity measure (SSD or NCC) Constraints

epipolar
ordering
uniqueness
disparity limit

Trade-off

Matching cost (data)
Discontinuities (prior)

Consider all paths that satisfy the constraints pick best using dynamic programming

SLIDE 12

Hierarchical stereo matching

Downsampling

(Gaussian pyramid)

Disparity propagation

Allows faster computation Deals with large disparity ranges

SLIDE 13

Disparity map

image I(x,y) image I´(x´,y´) Disparity map D(x,y)

(x´,y´)=(x+D(x,y),y)

SLIDE 14

SLIDE 15

Energy minimization

(S lide from Pascal Fua)

SLIDE 16

Graph Cut

(S lide from Pascal Fua)

(general formulation requires multi-way cut!)

SLIDE 17

(Boykov et al ICCV‘ 99) (Roy and Cox ICCV‘ 98)

Simplified graph cut

SLIDE 18

SLIDE 19

Stereo matching with general camera configuration

SLIDE 20

Image pair rectification

SLIDE 21

Planar rectification

Bring two views to standard stereo setup (moves epipole to ∞) (not possible when in/close to image)

~ image size

(calibrated)

Distortion minimization

(uncalibrated)

SLIDE 22

SLIDE 23

Polar re-paramet erizat ion around epipoles Requires only (orient ed) epipolar geomet ry Preserve lengt h of epipolar lines Choose ∆θ so t hat no pixels are compressed

riginal image

rectified image

Polar rectification

(Pollefeys et al. ICCV’99)

Works for all relative motions Guarantees minimal image size

SLIDE 24

polar rectification planar rectification

riginal

image pair

SLIDE 25

Example: Béguinage of Leuven

Does not work with standard Homography-based approaches

SLIDE 26

Stereo camera configurations

(S lide from Pascal Fua)

SLIDE 27

Multi-baseline, multi-resolution
At each depth, baseline and resolution

selected proportional to that depth

Allows to keep depth accuracy constant!

Variable Baseline/Resolution Stereo

(Gallup et al., CVPR08)

SLIDE 28

Variable Baseline/Resolution Stereo: comparison

SLIDE 29

Multi-view depth fusion

Compute depth for every

pixel of reference image

Triangulation
Use multiple views
Up- and down sequence
Use Kalman filter

(Koch, Pollefeys and Van Gool. ECCV‘ 98)

Allows to compute robust texture

SLIDE 30

Plane-sweep multi-view matching

Simple algorithm for multiple cameras
no rectification necessary
doesn’t deal with occlusions

Collins’96; Roy and Cox’98 (GC); Yang et al.’02/’03 (GPU)

SLIDE 31

Space Carving

SLIDE 32

3D Reconstruction from Calibrated Images

Scene Volume

V

Input Images (Calibrated)

Goal: Determine transparency, radiance of points in V

SLIDE 33

Discrete Formulation: Voxel Coloring

Discretized Scene Volume Input Images (Calibrated)

Goal: Assign RGBA values to voxels in V

photo-consistent with images

SLIDE 34

Complexity and Computability

Discretized Scene Volume

N voxels C colors

3

All Scenes (CN3) Photo-Consistent Scenes True Scene

SLIDE 35

Issues

Theoretical Questions

Identify class of all photo-consistent scenes

Practical Questions

How do we compute photo-consistent models?

SLIDE 36

1. C= 2 (silhouettes)

Volume intersection [Martin 81, Szeliski 93]

2. C unconstrained, viewpoint constraints

Voxel coloring algorithm [Seitz & Dyer 97]

3. General Case

Space carving [Kutulakos & Seitz 98]

Voxel Coloring Solutions

SLIDE 37

Reconstruction from Silhouettes (C = 2)

Binary Images

Approach:

Backproject each silhouette Intersect backprojected volumes

SLIDE 38

Voxel Algorithm for Volume Intersection

Color voxel black if on silhouette in every image

O(MN3), for M images, N3 voxels Don’t have to search 2N3 possible scenes!

SLIDE 39

Properties of Volume Intersection

Pros

Easy to implement, fast
Accelerated via octrees [Szeliski 1993]

Cons

No concavities
Reconstruction is not photo-consistent
Requires identification of silhouettes

SLIDE 40

1. C= 2 (silhouettes)

Volume intersection [Martin 81, Szeliski 93]

2. C unconstrained, viewpoint constraints

Voxel coloring algorithm [Seitz & Dyer 97]

3. General Case

Space carving [Kutulakos & Seitz 98]

Voxel Coloring Solutions

SLIDE 41

1. Choose voxel
2. Project and correlate
3. Color if consistent

Voxel Coloring Approach

Visibility Problem: in which images is each voxel visible?

SLIDE 42

Layers

Depth Ordering: visit occluders first!

Scene Traversal

Condition: depth order is view-independent

SLIDE 43

Compatible Camera Configurations

Depth-Order Constraint

Scene outside convex hull of camera centers

Outward-Looking

cameras inside scene

Inward-Looking

cameras above scene

SLIDE 44

Calibrated Image Acquisition

Calibrated Turntable

360° rotation (21 images)

Selected Dinosaur Images Selected Flower Images

SLIDE 45

Voxel Coloring Results

Dinosaur Reconstruction

72 K voxels colored 7.6 M voxels tested 7 min. to compute

n a 250MHz SGI

Flower Reconstruction

70 K voxels colored 7.6 M voxels tested 7 min. to compute

n a 250MHz SGI

SLIDE 46

Limitations of Depth Ordering

A view-independent depth order may not exist

p q

Need more powerful general-case algorithms

Unconstrained camera positions Unconstrained scene geometry/topology

SLIDE 47

1. C= 2 (silhouettes)

Volume intersection [Martin 81, Szeliski 93]

2. C unconstrained, viewpoint

constraints

Voxel coloring algorithm [Seitz & Dyer 97]

3. General Case

Space carving [Kutulakos & Seitz 98]

Voxel Coloring Solutions

SLIDE 48

Space Carving Algorithm

Image 1 Image N

…...

Initialize to a volume V containing the true scene Repeat until convergence Choose a voxel on the current surface Carve if not photo-consistent Project to visible input images

SLIDE 49

Convergence

Consistency Property

The resulting shape is photo-consistent

all inconsistent points are removed

Convergence Property

Carving converges to a non-empty shape

a point on the true scene is never removed

V’ V

p

SLIDE 50

What is Computable?

The Photo Hull is the UNION of all photo-consistent scenes in V

It is a photo-consistent scene reconstruction
Tightest possible bound on the true scene
Computable via provable Space Carving Algorithm

True Scene V Photo Hull V

SLIDE 51

Space Carving Algorithm

The Basic Algorithm is Unwieldy

Complex update procedure

Alternative: Multi-Pass Plane Sweep

Efficient, can use texture-mapping

hardware

Converges quickly in practice
Easy to implement

SLIDE 52

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

True Scene Reconstruction

SLIDE 53

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 54

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 55

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 56

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 57

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 58

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 59

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 60

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 61

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 62

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 63

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 64

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 65

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 66

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 67

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 68

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 69

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 70

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 71

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 72

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 73

Multi-Pass Plane Sweep

Sweep plane in each of 6 principle directions
Consider cameras on only one side of plane
Repeat until convergence

SLIDE 74

Space Carving Results: African Violet

I nput I mage (1 of 45) Reconstruction Reconstruction Reconstruction

SLIDE 75

Space Carving Results: Hand

I nput I mage (1 of 100) Views of Reconstruction

SLIDE 76

Other Features

Coarse-to-fine Reconstruction

Represent scene as octree
Reconstruct low-res model first, then refine

Hardware-Acceleration

Use texture-mapping to compute voxel

projections

Process voxels an entire plane at a time

Limitations

Need to acquire calibrated images
Restriction to simple radiance models
Bias toward maximal (fat) reconstructions
Transparency not supported

SLIDE 77

SLIDE 78

voxel occluded

Probal robalistic S Space pace C Carvi arving

Broadhurst et al. ICCV’01

SLIDE 79

I

Light Intensity Object Color

N

Normal vector

L

Lighting vector

V

View Vector

R

Reflection vector

color of the light Diffuse color Saturation point 1 1 1

Reflected Light in RGB color space Dielectric Materials (such as plastic and glass)

C 

Space-carving for specular surfaces

(Yang, Pollefeys & Welch 2003)

Extended photoconsistency:

SLIDE 80

Experiment

SLIDE 81

Animated Views

Our result

SLIDE 82

Volumetric Graph cuts

ρ(x)

1. Outer surface
2. Inner surface (at

constant offset)

3. Discretize

middle volume

4. Assign

photoconsistency cost to voxels

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 83

Volumetric Graph cuts

Source Sink

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 84

Volumetric Graph cuts

Source Sink Cost of a cut ≈ ∫∫ ρ(x) dS S

S

cut ⇔ 3D Surface S

[Boykov and Kolmogorov ICCV 2001]

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 85

Volumetric Graph cuts

Source Sink Minimum cut ⇔ Minimal 3D Surface under photo-consistency metric

[Boykov and Kolmogorov ICCV 2001]

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 86

Photo-consistency

Occlusion
1. Get nearest point
n outer surface
2. Use outer

surface for

cclusions

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 87

Photo-consistency

Occlusion

Self occlusion

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 88

Photo-consistency

Occlusion

Self occlusion

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 89

Photo-consistency

Occlusion

N threshold on angle between normal and viewing direction threshold= ~60°

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 90

Photo-consistency

Score

Normalised cross correlation Use all remaining cameras pair wise

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 91

Photo-consistency

Score

Average NCC = C Voxel score ρ = 1 - exp( -tan2[π(C-1)/4] / σ2 )

0 ≤ ρ ≤ 1 σ = 0.05 in all experiments

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 92

Example

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 93

Example - Visual Hull

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 94

Example - Slice

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 95

Example - Slice with graphcut

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 96

Example – 3D

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 97

Shrinking Bias

‘Balooning’ force
favouring bigger volumes that fill the visual hull

L.D. Cohen and I. Cohen. Finite-element methods for active contour models and balloons for 2-d and 3-d

images. PAMI, 15(11):1131–1147, November 1993.

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 98

Shrinking Bias

‘Balooning’ force
favouring bigger volumes that fill the visual hull

L.D. Cohen and I. Cohen. Finite-element methods for active contour models and balloons for 2-d and 3-d images. PAMI, 15(11):1131– 1147, November 1993.

∫∫

ρ(x) dS - λ ∫∫∫ dV S V

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 99

Shrinking Bias

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 100

Shrinking Bias

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 101

wij SOURCE

wb wb

Graph

h j i

wb = λh3 wij = 4/3πh2 * (ρi+ρj)/2

[Boykov and Kolmogorov ICCV 2001]

Slides from [Vogiatzis et al. CVPR2005]

SLIDE 102

102

Address Memory and Computational Overhead

(Sinha et. al. 2007)

– Compute Photo-consistency only where it is needed – Detect Interior Pockets using Visibility

Graph-cut on Dual of Adaptive Tetrahedral Mesh

SLIDE 103

Interesting comparison of multi- view stereo approaches

http://vision.middlebury.edu/mview/