Learning and Using Image Manifolds Robert Pless Associate - - PowerPoint PPT Presentation

learning and using image manifolds
SMART_READER_LITE
LIVE PREVIEW

Learning and Using Image Manifolds Robert Pless Associate - - PowerPoint PPT Presentation

Learning and Using Image Manifolds Robert Pless Associate Professor of Computer Science What are image manifolds? Sets of images that locally have only a few degrees of freedom. For example, 8 s from the NIST character


slide-1
SLIDE 1

Learning and Using Image Manifolds

Robert Pless Associate Professor of Computer Science

slide-2
SLIDE 2
slide-3
SLIDE 3

What are image manifolds?

  • Sets of images that

locally have only a few degrees of freedom.

  • For example, “8’s”

from the NIST character database

slide-4
SLIDE 4

Example 2

slide-5
SLIDE 5

What is manifold learning?

  • Given unorganized images from low-dimensional manifold,

assign each image a low-dimensional coordinate.

slide-6
SLIDE 6

Overloading of “Manifold Learning”

  • 1. Given data that comes from a (very) low

dimensional manifold, give each data point parameters that reflect relative positions on that manifold.

  • 2. Algorithms for (statistical) learning, when you

know your data lies on some (perhaps not low dimensional) manifold in the underlying space.

my talk is about 1, but both are represented in later talks tpday.

slide-7
SLIDE 7

[ ] ≈ [ ]

[ ]

M = B C M ≈ B C

PCA: Learning linear manifolds (on video data)

slide-8
SLIDE 8

[ ] ≈ [ ]

[ ]

M ≈ B C

coefficient 1 coefficient 2 “time”

slide-9
SLIDE 9

Principal Component Analysis

Basis images

Coordinates for each image

slide-10
SLIDE 10

Instead of projection, start with similarity measure between images. Use many images rather than many features in each image.

Similarity Based Image Analysis

slide-11
SLIDE 11

Image distances to low-dimensional locations

Suppose we had a distance between every pair of images. Tool: Multi-dimensional scaling Input: all pairwise distances D. Output: set of point positions X whose pairwise distances match D.

0 5 3 1 3 5 0 8 2 1 3 8 0 6 4 1 2 6 0 7 3 1 4 7 0 D = 3 1 3 5

Distance Matrix Points with that set of pairwise distances.

slide-12
SLIDE 12

| | | l1 0 0 X1 Y1 Z1 . . . | | | 0 l2 0 X2 Y2 Z2 . . . V1 V2 V3 … 0 0 l3 … = X3 Y3 Z3 . . . | | | 0 0 0 . . . | | | 0 0 0 . . .

MDS algorithm:

Squared distance matrix S: S(i,j) = D(i,j)2. Centering matrix H: H = I – 1/N. (identity – uniform matrix of 1/N) Dot-product matrix: t(D) = -HSH/2 defined so: X’X = t(D) if for all i,j (Xi – Xj)’ (Xi – Xj) = S(i,j),

where X is a matrix whose columns are position vectors in parameter space.

Consider eigenvalue problem: X’X = t(D) Let lp , vp,be the p-th eigenpair of t(D) Each row is optimal embedding in k-dimensional space, if you use k eigenpairs.

slide-13
SLIDE 13

Images which are very similar should be embedded as points which are close to each other. For images which are not similar, we don’t know how close their embedded points should be.

… but image similarity is only meaningful for small image distances.

slide-14
SLIDE 14

But, we don’t believe image similarities for anything but very similar images. In 2000, two papers presented methods of extending local similarities to give global constraints: Isomap (Tenenbaum, et al, 2000) LLE (Roweis and Saul, 2000) followed by Semi-Definite Embedding, Maximum Variance Unfolding, Diffusion Maps, Laplacian Eigenmaps, Hessian Eigenmaps, Locality Preserving Projections, and others, all of which are techniques for non-linear dimension reduction.

slide-15
SLIDE 15

Isomap:

Define G(V,E):

  • V is the set of points (in our case, images)
  • E is the set of comparable points, (images

with differences that are very small)

  • w(e) is the image difference.

Algorithm:

  • Run all pairs shortest path algorithm on G,
  • Define D, the pairwise distance matrix to be

shortest path distance in G. Run MDS, using D as given pairwise distances.

slide-16
SLIDE 16

Classic example: Swiss Roll.

slide-17
SLIDE 17

From Isomap paper by Joshua Tenenbaum, Science, December 2000

slide-18
SLIDE 18
slide-19
SLIDE 19

(Isomap – ShortestPath) == PCA

The key to Isomap is the shortest-path distances. If you run MDS on points with original distances (from high dimensional space), it gives the SAME embedding as PCA, up to Euclidean transformation.

slide-20
SLIDE 20

Isomap for video analysis Example: bird flight

slide-21
SLIDE 21

Temporal Super-Resolution

4 x Framerate 20 x Framerate (different input)

slide-22
SLIDE 22

Woman on a treadmill

slide-23
SLIDE 23

Example: human behaviors

slide-24
SLIDE 24

Last example: gait

slide-25
SLIDE 25

Applications to medical imagery

  • Medical imagery (MRI/CT) is an ideal

problem domain for manifold learning.

  • Often just a few degrees of freedom:
  • Contrast agent perfusion
  • Viewpoint change
  • Breathing/heartbeat
  • Which may be difficult to measure
slide-26
SLIDE 26

Cardiac MRI imagery courtesy of Nikos Tsekos, Department of Radiology, Washington University Medical School.

Isomap Visualization

slide-27
SLIDE 27

Consider distance between Gabor Filter Responses at each pixel. Complex Gabor response separates motion (phase change) from contrast change (magnitude)

slide-28
SLIDE 28

1-D embedding from Gabor response magnitude difference. 1-D embedding by phase shift Motion axis

slide-29
SLIDE 29
slide-30
SLIDE 30

Isomap Visualization

slide-31
SLIDE 31

4D CT – alignment of ungated images

Acquire data in 16 slice sections (chunks), in cine mode (25 frames). Reconstruct 3D lung volume for each breathing phase

with Andrew Hope, now Asst Prof. of Radiation Oncology, Univ. of Toronto

slide-32
SLIDE 32

4D CT Data Acquisition

Patient lung Data acquisition Images essentially unordered

slide-33
SLIDE 33

External breath surrogate Time

Sort by external breathing surrogate

slide-34
SLIDE 34

34

1-D manifold

  • Sort breath using Isomap of images
slide-35
SLIDE 35

Color coded by breath surrogate (belt) measurement.

slide-36
SLIDE 36 36

Solve for affine parameters of one couch position that maximizes smoothness over volume segment boundary to next affine parameter.

slide-37
SLIDE 37 37
slide-38
SLIDE 38 38
slide-39
SLIDE 39 39
slide-40
SLIDE 40 40
slide-41
SLIDE 41 41
slide-42
SLIDE 42 42
slide-43
SLIDE 43 43
slide-44
SLIDE 44
slide-45
SLIDE 45

45

slide-46
SLIDE 46

46

slide-47
SLIDE 47

47

slide-48
SLIDE 48

48

slide-49
SLIDE 49

49

slide-50
SLIDE 50

50

slide-51
SLIDE 51

51

slide-52
SLIDE 52

52

Limitations

Each chunk needs data in each part of breath Breath must be present in all images Top of lung difficult to order Lots of data needed for secondary variations (heart beat, hysteresis)

slide-53
SLIDE 53

“is there more to manifold learning than re-sampling, re-ordering, and de- noising?”

slide-54
SLIDE 54

Data from Sandor Kovacs, Dept. of Radiology, Washington University

slide-55
SLIDE 55

breathing heartbeat

slide-56
SLIDE 56

Snake C(s) C(s,t) C(s,f,q) Single image Time sequence

slide-57
SLIDE 57
  • Use a cubic B-spline surface to specify how

each control point varies with f and q.

C(s,f,q)

slide-58
SLIDE 58
  • Heartbeat phase f, breathing phase q.
  • contour C(s, f, q)
slide-59
SLIDE 59

Term to penalize non- translational motion Term to penalize deformation other than expansion/contraction

slide-60
SLIDE 60

Level set function f(x,y) f(x,y,t) f(x,y,f,q) Single image Time sequence

4D Level set function

slide-61
SLIDE 61 Cardio-Pulminary Level Sets, Q. Zhang, R. Souvenir,
  • R. Pless, CVBIA 2005
Manifold Learning for Segmentation, Q. Zhang, R. Souvenir,
  • R. Pless, EMMCVPR,
2005

Worst single image results from best parameters

slide-62
SLIDE 62

Cine-MR segmentation summary

  • Manifold learning re-arranges original data

frames in order to provide additional constraints which improve segmentation

  • Resulting optimization problem remains very

similar to standard Snakes or Level-Sets.

slide-63
SLIDE 63

Conclusions

  • Manifold learning is an important tool for

the analysis of images that vary due to motion and deformation.

  • Especially useful in medical --- images of
  • ne patient often form very clean

manifolds of just a few dimensions.

slide-64
SLIDE 64

Looking forward?

slide-65
SLIDE 65 Richard Souvenir UNC-Charlotte Qilong Zhang, Nomura Michael Dixon, Willow Garage Manfred Georg, Google/ Youtube Nathan Jacobs, Kentucky

Support from the National Science Foundation

Acknowledgements / Other Interests

Omni-directional Vision Surveillance and Tracking Remote Sensing