Instance-level recognition I. - Camera geometry and image alignment - - PowerPoint PPT Presentation

instance level recognition i camera geometry and image
SMART_READER_LITE
LIVE PREVIEW

Instance-level recognition I. - Camera geometry and image alignment - - PowerPoint PPT Presentation

Reconnaissance dobjets et vision artificielle 2011 Instance-level recognition I. - Camera geometry and image alignment Josef Sivic http://www.di.ens.fr/~josef INRIA, WILLOW, ENS/INRIA/CNRS UMR 8548 Laboratoire dInformatique, Ecole


slide-1
SLIDE 1

Instance-level recognition I. - Camera geometry and image alignment

Josef Sivic

http://www.di.ens.fr/~josef INRIA, WILLOW, ENS/INRIA/CNRS UMR 8548 Laboratoire d’Informatique, Ecole Normale Supérieure, Paris

With slides from: S. Lazebnik, J. Ponce, and A. Zisserman

Reconnaissance d’objets et vision artificielle 2011

slide-2
SLIDE 2

Class webpage: http://www.di.ens.fr/willow/teaching/recvis11/

http://www.di.ens.fr/willow/teaching/recvis11/

slide-3
SLIDE 3

Object recognition and computer vision 2011

Class webpage: http://www.di.ens.fr/willow/teaching/recvis11/ Grading:

  • 3 programming assignments (60%)
  • Panorama stitching
  • Image classification
  • Basic face detector
  • Final project (40%)

More independent work, resulting in the report and a class presentation.

slide-4
SLIDE 4

Matlab tutorial

Friday 30/09/2011 at 10:30-12:00. The tutorial will be at 23 avenue d'Italie - Salle Rose. Come if you have no/limited experience with Matlab.

slide-5
SLIDE 5

Research

Both WILLOW (J. Ponce, I. Laptev, J. Sivic) and LEAR (C. Schmid) groups are active in computer vision and visual recognition research. http://www.di.ens.fr/willow/ http://lear.inrialpes.fr/ with close links to SIERRA – machine learning (F. Bach) http://www.di.ens.fr/sierra/ There will be master internships available. Talk to us if you are interested.

slide-6
SLIDE 6

Outline

Part I - Camera geometry – image formation

  • Perspective projection
  • Affine projection
  • Projection of planes

Part II - Image matching and recognition with local features

  • Correspondence
  • Semi-local and global geometric relations
  • Robust estimation – RANSAC and Hough Transform
slide-7
SLIDE 7

Reading: Part I. Camera geometry

Forsyth&Ponce – Chapters 1 and 2 Hartley&Zisserman – Chapter 6: “Camera models”

slide-8
SLIDE 8

Motivation: Stitching panoramas

slide-9
SLIDE 9

Feature-based alignment outline

slide-10
SLIDE 10

Feature-based alignment outline

Extract features

slide-11
SLIDE 11

Feature-based alignment outline

Extract features Compute putative matches

slide-12
SLIDE 12

Feature-based alignment outline

Extract features Compute putative matches Loop:

  • Hypothesize transformation T (small group of putative

matches that are related by T)

slide-13
SLIDE 13

Feature-based alignment outline

Extract features Compute putative matches Loop:

  • Hypothesize transformation T (small group of putative

matches that are related by T)

  • Verify transformation (search for other matches

consistent with T)

slide-14
SLIDE 14

Feature-based alignment outline

Extract features Compute putative matches Loop:

  • Hypothesize transformation T (small group of putative

matches that are related by T)

  • Verify transformation (search for other matches

consistent with T)

slide-15
SLIDE 15

2D transformation models

Similarity (translation, scale, rotation) Affine Projective (homography)

Why these transformations ???

slide-16
SLIDE 16

Camera geometry

slide-17
SLIDE 17

Images are two-dimensional patterns of brightness values. They are formed by the projection of 3D objects.

slide-18
SLIDE 18

Animal eye: a looonnng time ago. Pinhole perspective projection: Brunelleschi, XVth Century. Camera obscura: XVIth Century. Photographic camera: Niepce, 1816.

slide-19
SLIDE 19
slide-20
SLIDE 20

Massaccio’s Trinity, 1425

Pompei painting, 2000 years ago. Van Eyk, XIVth Century Brunelleschi, 1415

slide-21
SLIDE 21

Pinhole Perspective Equation NOTE: z is always negative..

Camera center Image plane (retina) Principal axis Camera co-

  • rdinate system

P = (x,y,z)T P'= (x',y')T y' f ' z y

O

World point Imaged point

slide-22
SLIDE 22

Affine projection models: Weak perspective projection is the magnification. When the scene relief is small compared its distance from the Camera, m can be taken constant: weak perspective projection.

slide-23
SLIDE 23

Affine projection models: Orthographic projection When the camera is at a (roughly constant) distance from the scene, take m=1.

slide-24
SLIDE 24

Strong perspective: Angles are not preserved The projections of parallel lines intersect at one point

slide-25
SLIDE 25

From Zisserman & Hartley

slide-26
SLIDE 26

Strong perspective: Angles are not preserved The projections of parallel lines intersect at one point Weak perspective: Angles are better preserved The projections of parallel lines are (almost) parallel

slide-27
SLIDE 27

Beyond pinhole camera model: Geometric Distortion

slide-28
SLIDE 28

Rectification

slide-29
SLIDE 29

Radial Distortion Model

slide-30
SLIDE 30

Perspective Projection x,y: World coordinates x’,y’: Image coordinates f: pinhole-to-retina distance Weak-Perspective Projection (Affine) x,y: World coordinates x’,y’: Image coordinates m: magnification Orthographic Projection (Affine) x,y: World coordinates x’,y’: Image coordinates Common distortion model x’,y’: Ideal image coordinates x’’,y’’: Actual image coordinates

slide-31
SLIDE 31

Cameras and their parameters

Images from M. Pollefeys

slide-32
SLIDE 32

The Intrinsic Parameters of a Camera Normalized Image Coordinates Physical Image Coordinates Units: k,l : pixel/m f : m α,β : pixel

slide-33
SLIDE 33

The Intrinsic Parameters of a Camera Calibration Matrix The Perspective Projection Equation

slide-34
SLIDE 34

Notation Euclidean Geometry

slide-35
SLIDE 35

Recall: Coordinate Changes and Rigid Transformations

slide-36
SLIDE 36

The Extrinsic Parameters of a Camera

WP = W x W y W z

         

u v 1           = 1 z m12 m12 m13 m14 m21 m22 m23 m24 m31 m32 m33 m34          

W x W y W z

1            

slide-37
SLIDE 37

Explicit Form of the Projection Matrix Note: M is only defined up to scale in this setting!!

slide-38
SLIDE 38

Weak perspective (affine) camera

zr = m3

TP = const.

u v 1           = 1 zr m1

T

m2

T

m3

T

          P = m1

TP /m3 TP

m2

TP /m3 TP

m3

TP /m3 TP

         

u v       = a11 a12 a13 b

1

a21 a22 a23 b2      

W x W y W z

1             =AWP + b

b2×1 A2×3

u v       = 1 zr m1

T

m2

T

      P

slide-39
SLIDE 39

Observations: is the equation of a plane of normal direction a1

  • From the projection equation, it is also

the plane defined by: u = 0

  • Similarly:
  • (a2,b2) describes the plane defined by: v = 0
  • (a3,b3) describes the plane defined by:

 That is the plane parallel to image plane passing through the pinhole (z = 0) – principal plane

Geometric Interpretation

a1

T

X Y Z           + b

1 = 0

Projection equation:

slide-40
SLIDE 40

u v a3 C

Geometric Interpretation: The rows of the projection matrix describe the three planes defined by the image coordinate system

a1 a2

slide-41
SLIDE 41

Other useful geometric properties Principal axis of the camera: The ray passing through the camera centre with direction vector a3 a3

slide-42
SLIDE 42

Other useful geometric properties Depth of points: How far a point lies from the principal plane of a camera? a3 P

d(P,Π3) = a3

T wx wy wz

          + b3 a3 = m3

TP

a3 = m3

TP

If ||a3||=1

=

wx wy wz

1            

But for general camera matrices:

  • need to be careful about the sign.
  • need to normalize matrix to have

||a3||=1

slide-43
SLIDE 43

p P Other useful geometric properties Q: Can we compute the position of the camera center Ω? A: Q: Given an image point p, what is the direction of the corresponding ray in space? A:

Hint: Start from the projection equation. Show that the right null-space of camera matrix M is the camera center. Hint: Start from a projection equation and write all points along direction w, that project to point p.

slide-44
SLIDE 44

Re-cap: imaging and camera geometry (with a slight change of notation)

  • perspective projection
  • camera centre, image point and

scene point are collinear

  • an image point back projects to a

ray in 3-space

  • depth of the scene point is

unknown camera centre image plane image point scene point

C X

x

Slide credit: A. Zisserman

slide-45
SLIDE 45

Slide credit: A. Zisserman

slide-46
SLIDE 46

How a “scene plane” projects into an image?

slide-47
SLIDE 47

Plane projective transformations

Slide credit: A. Zisserman

slide-48
SLIDE 48

Projective transformations continued

  • This is the most general transformation between the world

and image plane under imaging by a perspective camera.

  • It is often only the 3 x 3 form of the matrix that is important in

establishing properties of this transformation.

  • A projective transformation is also called a ``homography''

and a ``collineation''.

  • H has 8 degrees of freedom.

Slide credit: A. Zisserman

slide-49
SLIDE 49

Planes under affine projection

x1 x2       = a11 a12 a13 b

1

a21 a22 a23 b2       x y 1             = a11 a12 a21 a22       x y       + b

1

b2       = A2×2P + b2×1

Points on a world plane map with a 2D affine geometric transformation (6 parameters)

slide-50
SLIDE 50
  • Affine projections induce affine

transformations from planes

  • nto their images.
  • Perspective projections

induce projective transformations from planes

  • nto their images.

Summary

slide-51
SLIDE 51

2D transformation models

Similarity (translation, scale, rotation) Affine Projective (homography)

slide-52
SLIDE 52

When is homography a valid transformation model?

slide-53
SLIDE 53

Case I: Plane projective transformations

Slide credit: A. Zisserman

slide-54
SLIDE 54

Case I: Projective transformations continued

  • This is the most general transformation between the world

and image plane under imaging by a perspective camera.

  • It is often only the 3 x 3 form of the matrix that is important in

establishing properties of this transformation.

  • A projective transformation is also called a ``homography''

and a ``collineation''.

  • H has 8 degrees of freedom.

Slide credit: A. Zisserman

slide-55
SLIDE 55

Case II: Cameras rotating about their centre

image plane 1 image plane 2

  • The two image planes are related by a homography H
  • H depends only on the relation between the image

planes and camera centre, C, not on the 3D structure

P = K [ I | 0 ] P’ = K’ [ R | 0 ] H = K’ R K^(-1)

Slide credit: A. Zisserman

slide-56
SLIDE 56

Case II: Cameras rotating about their centre

image plane 1 image plane 2

P = K[I |0] P'= K'[R |0] x = K[I |0] X 1       = KX ⇒ X = K−1x x'= K'[R |0] X 1       = K'RX x'= K'RK−1

H

   x

Slide credit: A. Zisserman