[PPT] - EECS 504: Foundations of Computer Vision Andrew Owens Course staff PowerPoint Presentation

SLIDE 1

EECS 504: Foundations of Computer Vision Andrew Owens

SLIDE 2

Haozhu Wang Graduate student (GSI) Anthony Liang Instructional aide (IA) Bingqi Sun Instructional aide (IA)

Course staff

SLIDE 3

Interacting with us

Ask questions on Piazza.
Sign up: https://bit.ly/36mfYeP
Submit written work to Gradescope
Office hours on website:

SLIDE 4

Course website

https://web.eecs.umich.edu/~ahowens/eecs504/w20/

SLIDE 5

Grading

Assignments (70%)
Final project (30%)
No exams!

SLIDE 6

Assignments

Weekly homework assignments (≈10 total)
Due each Tuesday at midnight
Late submissions penalized 30% per day
You have 5 "late days”
Assignments should be done independently.
Encouraged to discuss them
Programming and writing should all be yours

SLIDE 7

Assignments

Mix of programming and written problems
Python + numerical computing libraries (numpy, scipy, etc.)
PyTorch for deep learning
Linear algebra and multivariable calculus
Jupyter notebooks and Google Colab for problem sets

SLIDE 8

Assignments

SLIDE 9

Project

Open-ended! Example projects:
Implement and extend a recent computer vision paper
Use computer vision in your research
We’ll also provide a list of project ideas
Work in small groups (up to 4 people)
Complete in last month of class.
Project proposal (after spring break)
Short presentation (finals period)
Writeup (finals period)

SLIDE 10

Readings

http://szeliski.org/Book

https://www.deeplearningbook.org

Manuscript chapters by Torralba, Freeman, and Isola (on course website). Class based

n this coursework.

And also occasional paper readings

SLIDE 11

Class topics

SLIDE 12

}

Signal processing Apple + orange = ?

Homework problem:

…

SLIDE 13

…

Intro to deep learning

}

SLIDE 14

Learning for vision

}

Spring break

Beach

…

SLIDE 15

…

Cameras,

ptics, motion

}

Homework problem:

SLIDE 16

Advanced topics and applications

}

SLIDE 17

Today

1. A bit of vision history
2. Why vision is hard
3. A simple visual system

SLIDE 18

Medical applications Robotics Driving Mobile devices 3D modeling

Exciting times for computer vision

Accessibility

Slide credit: Torralba, Freeman, Isola

SLIDE 19

To see

“What does it mean, to see? The plain man's answer (and Aristotle's, too) would be, to know what is where by looking.” To discover from images what is present in the world, where things are, what actions are taking place, to predict and anticipate events in the world.

Slides from MIT 6.869 class by Torralba, Freeman, and Isola

SLIDE 20

Slide credit: Torralba, Freeman, Isola

SLIDE 21

[“HOGgles”, Vondrick et al. , ICCV 2013]

Just a few years ago…

SLIDE 22

[“Mask RCNN”, He et al., ICCV 2017]

Slide credit: Torralba, Freeman, Isola

SLIDE 23

[“GauGAN”, Park et al., CVPR 2019]

SLIDE 24

Different signals, same methods

Time Amplitude

Sound WiFi

(Zhao et al. 2019)

Touch

(Calandra et al. 2018)

SLIDE 25

Input video

(Owens and Efros 2018)

SLIDE 26

On-screen audio

(Owens and Efros 2018)

SLIDE 27

(Owens and Efros 2018)

Off-screen audio

SLIDE 28

What makes vision hard?

SLIDE 29

To see: perception vs. measurement

Slide credit: Torralba, Freeman, Isola

SLIDE 30

Slide credit: Torralba, Freeman, Isola

To see: perception vs. measurement

SLIDE 31

Sinha & Adelson 93

Other ambiguities

Slide credit: Antonio Torralba

SLIDE 32

Sinha & Adelson 93

Other ambiguities

Slide credit: Antonio Torralba

SLIDE 33

A simple visual system

A simple world
A simple image formation model
A simple goal

SLIDE 34

Slide credit: Antonio Torralba

A simple world

SLIDE 35

A simple world

Slide credit: Antonio Torralba

SLIDE 36

A simple image formation model

Simple world rules:

Surfaces can be horizontal or vertical.
Objects will be resting on a white horizontal ground plane

Slide credit: Antonio Torralba

SLIDE 37

A simple image formation model

Camera plane World reference system

Slide credit: Antonio Torralba

SLIDE 38

A simple image formation model

X + x0 cos(θ) Y – sin(θ) Z + y0 x = y = image coordinates World coordinates Image and projection of the world  coordinate axes into the image plane World coordinates image   coordinates

Slide credit: Antonio Torralba

SLIDE 39

A simple goal

Recover the 3D structure of the world

We want to recover X(x,y), Y(x,y), Z(x,y) using as input I(x,y)

Slide credit: Antonio Torralba

SLIDE 40

Edges

Occlusion Change of Surface orientation Contact edge Shadow boundary Vertical 3D edge Horizontal 3D edge

Slide credit: Antonio Torralba

SLIDE 41

Treating the image as a function

y x I(x,y)

255

Slide credit: Antonio Torralba

SLIDE 42

Finding edges in the image

Image gradient: Approximation image derivative: Edge strength Edge orientation: Edge normal: I(x,y)

Slide credit: Antonio Torralba

SLIDE 43

Finding edges in the image

E(x,y) n(x,y) and I(x,y)

Slide credit: Antonio Torralba

SLIDE 44

Edge classification

Figure/ground segmentation

– Using the fact that objects have  color

Occlusion edges

– Occlusion edges are

wned by

the foreground

Contact edges

Slide credit: Antonio Torralba

SLIDE 45

From edges to surface constraints

Y(x,y) Z(x,y) X(x,y)

?

Slide credit: Antonio Torralba

SLIDE 46

From edges to surface constraints

Ground
Contact edge

… now things get a bit more complicated. Y(x,y) = 0 if (x,y) belongs to a ground pixel Y(x,y) = 0 if (x,y) belongs to foreground and is a contact edge

What happens inside the objects?

Slide credit: Antonio Torralba

SLIDE 47

From edges to surface constraints

Vertical edges

Z = constant along the edge X + x0 cos(θ) Y – sin(θ) Z + y0 x = y = image coordinates World coordinates How can we relate the information in the pixels with 3D surfaces in the world? Given the image, what can we say about X, Y and Z in the pixels that belong  to a vertical edge?

Slide credit: Antonio Torralba

SLIDE 48

From edges to surface constraints

Horizontal edges

Y = constant along the edge Where t is the vector parallel to the edge X + x0 cos(θ) Y – sin(θ) Z + y0 x = y = image coordinates World coordinates Given the image, what can we say about X, Y and Z in the pixels that belong  to an horizontal 3D edge?

Slide credit: Antonio Torralba

SLIDE 49

From edges to surface constraints

What happens where there are no edges?

? Assumption of planar faces: Information has to be propagated from the edges

Slide credit: Antonio Torralba

SLIDE 50

A simple inference scheme

All the constraints are linear!

Y(x,y) = 0 if (x,y) belongs to a ground pixel if (x,y) belongs to a vertical edge if (x,y) belongs to an horizontal edge if (x,y) is not on an edge

A similar set of constraints could be derived for Z

Slide credit: Antonio Torralba

SLIDE 51

Discrete approximation

We can transform every differential constraint into a linear constraint on Y(x,y)

Y(x,y)

111 115 113 111 112 111 112 111 135 138 137 139 145 146 149 147 163 168 188 196 206 202 206 207 180 184 206 219 202 200 195 193 189 193 214 216 104 79 83 77 191 201 217 220 103 59 60 68 195 205 216 222 113 68 69 83 199 203 223 228 108 68 71 77

dY dx ≈ Y(x,y) – Y(x-1,y)

Slide credit: Antonio Torralba

SLIDE 52

Discrete approximation

Y(x,y) Transform the “image” Y(x,y) into a column vector:

1

1 x=0  y=0

dY dx ≈ Y(x,y) – Y(x-1,y) = Y(2,2) – Y(1,2)= x=2, y=2

Slide credit: Antonio Torralba

SLIDE 53

A simple inference scheme: solve for Y

=

A Y = b

Constraint weights Y b

Slide credit: Antonio Torralba

SLIDE 54

Results

X Y Z Edge normals Edge strength 3D orientation Depth discontinuities Contact edges

Slide credit: Antonio Torralba

SLIDE 55

Changing view point

Input New view points:

Slide credit: Antonio Torralba

SLIDE 56

Failure cases… even in a simple world!

SLIDE 57

Failure cases… even in a simple world!

SLIDE 58

Failure cases… even in a simple world!

Edges Input image

Missing edges Extra edges

More on this next week!

SLIDE 59

EECS 504: Foundations of Computer Vision Andrew Owens

Course staff

Interacting with us

Course website

Grading

Assignments

Assignments

Assignments

Project

Readings

Class topics

}

Signal processing Apple + orange = ?

Homework problem:

Intro to deep learning

}

Learning for vision

}

Beach

Cameras,

}

Homework problem:

Advanced topics and applications

}

Today

Exciting times for computer vision

To see

Just a few years ago…

Different signals, same methods

Input video

On-screen audio

Off-screen audio

What makes vision hard?

To see: perception vs. measurement

To see: perception vs. measurement

Other ambiguities

Other ambiguities

A simple visual system

A simple world

A simple world

A simple image formation model

A simple image formation model

A simple image formation model

A simple goal

Edges

Treating the image as a function

Finding edges in the image

Finding edges in the image

Edge classification

From edges to surface constraints

From edges to surface constraints

From edges to surface constraints

From edges to surface constraints

From edges to surface constraints

A simple inference scheme

Discrete approximation

Discrete approximation

A simple inference scheme: solve for Y

Results

Changing view point

Failure cases… even in a simple world!

Failure cases… even in a simple world!

Failure cases… even in a simple world!

More on this next week!

Questions about the course?