Image Masking Schemes for Local Manifold Learning Methods Marco F. - - PowerPoint PPT Presentation

image masking schemes for local manifold learning methods
SMART_READER_LITE
LIVE PREVIEW

Image Masking Schemes for Local Manifold Learning Methods Marco F. - - PowerPoint PPT Presentation

Image Masking Schemes for Local Manifold Learning Methods Marco F. Duarte Joint work with Hamid Dadkhahi IEEE ICASSP - April 23 2015 Manifold Learning Given training points in , learn the mapping to the underlying K -dimensional


slide-1
SLIDE 1

Marco F. Duarte

Image Masking Schemes for 
 Local Manifold Learning Methods

Joint work with Hamid Dadkhahi IEEE ICASSP - April 23 2015

slide-2
SLIDE 2

Manifold Learning

  • Given training points in , learn the mapping

to the underlying K-dimensional articulation manifold

  • Exploit local geometry 


to capture parameter 
 differences by embedding 
 distances

  • ISOMAP, LLE, HLLE, …
  • Ex:

images of

  • rotating teapot
  • articulation space
  • = circle
slide-3
SLIDE 3

Compressive Manifold Learning

  • Isomap algorithm approximates geodesic distances

using distances between neighboring points

  • Random measurements preserve these distances
  • Theorem: If , then the Isomap


residual variance in the projected domain is bounded by the additive error factor

N = 4096 (Full Data) M = 100 M = 50 M = 25 translating
 disk manifold
 (K=2) [Hegde, Wakin,
 Baraniuk 2008]

  • Given training points in , learn the mapping

to the underlying K-dimensional articulation manifold

slide-4
SLIDE 4

Custom Projection Operators

  • Goal of Dimensionality Reduction: To preserve distances

between points in the manifold, i.e., for

  • Collect pairwise differences into set of 


secant vectors

  • Search for projection that preserves norms of secants:

[Hegde, Sankaranarayanan, Baraniuk 2012]

slide-5
SLIDE 5
  • Usual approach: Principal Component Analysis (PCA)
  • Collect all secants into a matrix:
  • Perform eigenvalue decomposition on S:
  • Select top eigenvectors as projections
  • PCA minimizes the average squared distortion over secants, but

can distort individual secants arbitrarily and therefore warp manifold structure

Custom Projection Operators

[Hegde, Sankaranarayanan, Baraniuk 2012]

slide-6
SLIDE 6
  • For target distortion , find matrix featuring the smallest

number of rows that yields

  • This is equivalent to minimizing the rank

  • f the matrix such that
  • Use nuclear norm as proxy for rank to 

  • btain computationally efficient approach
  • Improves over random projections since matrix is 


specifically tailored to manifold observed

  • May be difficult to link target distortion to matrix rank/


number of rows

Custom Projection Operators: NuMax

[Hegde, Sankaranarayanan, Baraniuk 2012]

slide-7
SLIDE 7
  • Projections matrices have entries 


with arbitrary values

  • Physics of sensing process, 


hardware devices restrict types 


  • f projections we can obtain
  • Example: Low-power imaging 


for computational eyeglasses

  • Low-power imaging sensor 


allows for individual selection of 
 pixels to record

  • Power consumption proportional 


to number of pixels sampled

  • Random projections/NuMax involve 


half/all pixels and do not enable 
 power savings

  • How to derive constrained 


projection matrices that involve 


  • nly few pixels?

Issues with Randomness and NuMax

[Mayberry, Hu, Marlin, 
 Salthouse, Ganesan 2014]

slide-8
SLIDE 8
  • Select only a subset of the pixels of

size M that minimizes distortion to manifold structure

  • Emulate strategies for projection

design into mask design

  • Random Masking:


Pick M pixels uniformly at random across image

Masking Strategies for Manifold Data

M = 100 
 pixels

slide-9
SLIDE 9
  • Select only a subset of the pixels of

size M that minimizes distortion to manifold structure

  • Emulate strategies for projection

design into mask design

  • Principal coordinate analysis: 


Pick M coordinates that maximize variance among secants

Masking Strategies for Manifold Data

M = 100 
 pixels

[Dadkhahi and Duarte 2014]

slide-10
SLIDE 10
  • Select only a subset of the pixels of

size M that minimizes distortion to manifold structure

  • Emulate strategies for projection

design into mask design

  • Adaptation of NuMax:
  • Define secants from k-nearest

neighbor graph:


  • Pick masking matrix (row

submatrix of I) to minimize secant norm distortion after scaling:


  • Combinatorial integer program

replaced by greedy approximation

Masking Strategies for Manifold Data

M = 100 
 pixels

[Dadkhahi and Duarte 2014]

slide-11
SLIDE 11

Isomap vs. Locally Linear Embedding

  • While Isomap employs distances between neighbors when

designing the embedding, Locally Linear Embedding (LLE) employs additional local geometry, 
 representing each vector as a weighted 
 linear combination of its neighbors


  • In particular, Isomap embedding is 


sensitive to scaling of the point clouds, 
 while LLE isn’t

slide-12
SLIDE 12

Isomap vs. Locally Linear Embedding

  • While Isomap employs distances between neighbors when

designing the embedding, Locally Linear Embedding (LLE) employs additional local geometry, 
 representing each vector as a weighted 
 linear combination of its neighbors


  • In particular, Isomap embedding is 


sensitive to scaling of the point clouds, 
 while LLE isn’t

slide-13
SLIDE 13

Isomap vs. Locally Linear Embedding

  • While Isomap employs distances between neighbors when

designing the embedding, Locally Linear Embedding (LLE) employs additional local geometry, 
 representing each vector as a weighted 
 linear combination of its neighbors


  • In particular, Isomap embedding is 


sensitive to scaling of the point clouds, 
 while LLE isn’t

  • To preserve this additional local 


information, we expand the set of secants 
 to include distances between neighbors of 
 each point

slide-14
SLIDE 14
  • While Isomap employs distances between neighbors when

designing the embedding, Locally Linear Embedding (LLE) employs additional local geometry, 
 representing each vector as a weighted 
 linear combination of its neighbors


  • In particular, Isomap embedding is 


sensitive to scaling of the point clouds, 
 while LLE isn’t

  • To preserve this additional local 


information, we expand the set of secants 
 to include distances between neighbors of 
 each point

Isomap vs. Locally Linear Embedding

slide-15
SLIDE 15
  • Expand the set of secants considered:


  • Compute squared norms of secants 


in obtained from the original 
 images and from masked images; 
 collect into “norm” vectors 


  • Choose mask that maximizes sum of 


cosine similarities between original and 
 masked “norm” vectors: 
 
 


  • Replace combinatorial optimization by 


greedy forward selection algorithm

  • Cosine similarity is invariant to (local) scaling of point cloud

Manifold-Aware Pixel Selection for LLE

slide-16
SLIDE 16

Manifold-Aware Pixel Selection for LLE

M = 100 pixels M = 100 pixels

slide-17
SLIDE 17

50 100 150 200 250 300 2 4 6 8 10 Embedding Dim./Masking Size m Embedding Error Full Data PCA SPCA PCoA MAPS−LLE MAPS−Isomap Random

Performance Analysis: 
 LLE Embedding Error

M

Full Data PCA SPCA PCoA MAPS−LLE MAPS−Isomap Random

slide-18
SLIDE 18

50 100 150 200 250 300 500 1000 1500 2000 2500 Embedding Dim./Masking Size m Embedding Error M

Performance Analysis: 
 LLE Embedding Error

Full Data PCA SPCA PCoA MAPS−LLE MAPS−Isomap Random

slide-19
SLIDE 19

Performance Analysis: 
 LLE Embedding Error

50 100 150 200 250 300 10 20 30 40 50 Embedding Dim./Masking Size m Embedding Error

M

Full Data PCA SPCA PCoA MAPS−LLE SPCA PCoA MAPS−LLE MAPS−Isomap Random

slide-20
SLIDE 20

Performance Analysis: 2-D LLE

M = 100 pixels

slide-21
SLIDE 21

50 100 150 200 250 300 25 30 35 40 45 Embedding Dim./Masking Size m Average Gaze Estimation Error Full Data PCA SPCA PCoA MAPS−LLE MAPS−Isomap Random

Computational Eyeglasses: Eye Gaze Tracking

M

Error measured
 in “target” pixels

slide-22
SLIDE 22

Conclusions

  • Compressive sensing (CS) for manifold-modeled images

via random or customized projections (NuMax)

  • New sensors enable CS by masking images, 


i.e., restricting the type of projections

  • Our MAPS algorithms find image masks that best

preserve geometric structure used during manifold learning for image datasets

  • Greedy algorithms provide good preservation of learned

manifold embeddings, suitable for parameter estimation

  • While Isomap relies on distances between neighbors,

LLE also leverages local geometric structure; different algorithms are optimal for these cases

  • Concept of subsampling as feature selection -

supervised and unsupervised learning?

http://www.ecs.umass.edu/~mduarte mduarte@ecs.umass.edu