Institut Mines-Télécom
Interest point detection Nicolas ROUGON ARTEMIS Department - - PowerPoint PPT Presentation
Interest point detection Nicolas ROUGON ARTEMIS Department - - PowerPoint PPT Presentation
IMA4103 Extraction dInformation Multimdia Interest point detection Nicolas ROUGON ARTEMIS Department Nicolas.Rougon@telecom-sudparis.eu Institut Mines-Tlcom Problem statement We hereafter review methods for extracting local
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
■ We hereafter review methods for extracting
local geometric features of interest in gray level images, useable in a variety of image matching problems
- Image registration
► image stitching | augmented reality
- Image retrieval & object recognition/categorization
► image & video indexing
- 3D scene/object reconstruction
► vision-based 3D photogrammetry
- Tracking & navigation
► simultaneous localization and mapping (SLAM)
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
■ Motivation
Matching techniques using local features of interest are significantly more robust to large variations of scene geometry, including than approaches assessing similarity between image (sub)domains
- strong viewpoint
change
- partial occlusion
- object deformation
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
- Structural properties
generic sparse ► compactness ► computational efficiency numerous ► robustness
uniformly distributed occlusions | clutter | cropping
■ Requirements
Relevant features of interest should be distinctive, and satisfy properties ensuring stable and efficient detection / matching
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
■ Requirements
Relevant features of interest should be distinctive, and satisfy properties ensuring stable and efficient detection / matching
- Invariance properties
► repeatability contrast transforms sensor photometric calibration scene lighting monotonic luminance transforms spatial transforms
sensor geometric calibration viewpoint
isometries | scalings | affine transforms
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
- Robustness properties
► repeatability ► accuracy sampling & quantization digital image acquisition coding scheme noise sensor model
■ Requirements
Relevant features of interest should be distinctive, and satisfy properties ensuring stable and efficient detection / matching
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
■ Candidate features
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
■ Candidate features
Edges are not eligible as features of interest
- Generic, sparse, uniformly distributed
- Reasonably invariant to contrast changes
- Not distinctive
► matching ambiguity occurs along edges
- Not invariant to spatial transforms
t n Ltarget = c Lsource = c
? ? ?
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
■ Candidate features
Corners provide relevant features of interest
- Distinctive
- Generic, sparse, numerous, uniformly distributed
- Invariant to contrast changes
- Invariant to spatial transforms except scalings
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
■ Interest point matching
- Detection
Extract a set of distinctive & repeatable interest points Define an invariant interest patch around each keypoint
- Description
Normalize & transform patches into invariant local coordinates Compute a patch local descriptor
- Matching
Match local descriptors based
- n some similarity metrics
x1
1
x3
1 1
x2 x3
2 2
x2 x1
2 1
fi
2
fj
1
f1
1
fd
2
f1
2
fd ► ◄ ► ◄
1
fi d( , )
2
fj
▼ ▼
Institut Mines-Télécom
Example applications
IMA 4103 - Nicolas ROUGON
■ Image stitching
View #1 View #2
Institut Mines-Télécom
Example applications
IMA 4103 - Nicolas ROUGON
■ Image stitching
Corners #1 Corners #2
► Corners capture the geometry of textured shapes
Institut Mines-Télécom
Example applications
IMA 4103 - Nicolas ROUGON
■ Image stitching
Corner matching
Institut Mines-Télécom
Example applications
IMA 4103 - Nicolas ROUGON
■ Image stitching
View stitching
Institut Mines-Télécom
IMA 4103 - Nicolas ROUGON
Strong viewpoint changes | Partial occlusions
■ Video Tracking
Example applications
Institut Mines-Télécom
IMA 4103 - Nicolas ROUGON
Strong viewpoint changes | Partial occlusions | Object deformations
■ Video Tracking
Example applications
Institut Mines-Télécom
Example applications
IMA 4103 - Nicolas ROUGON
■ Multi-view 3D scene reconstruction
Feature extraction
> corner points
Feature matching
> motion vectors
Camera + sparse depth estimation
> 3D point cloud
Surface reconstruction + texturing
> 3D textured mesh
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
- Good detection
■ Requirements
Expected performances of relevant interest point detectors ◄ generic framework for performance assessment
Institut Mines-Télécom
Performance assessment
IMA 4103 - Nicolas ROUGON
■ Confusion matrix
Type I error Type II error
- True Positives (TP)
Correct detections
- True Negatives (TN)
Correct rejections
- False Positives (FP)
Wrong detections False alarm | Type I error
- False Negatives (FN)
Wrong rejections Miss | Type II error
Type I error Type II error
True Positives False Positives True Negatives False Negatives False True Ground Truth (X) Detection (Y) Negative Positive
Type I error Type II error
Success Y = X Failure Y ≠ X
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
- Good detection
few Failures
few false positives few false negatives
■ Requirements
Expected performances of relevant interest point detectors ◄ dedicated error metrics
Institut Mines-Télécom
Performance metrics
IMA 4103 - Nicolas ROUGON
■ Recall | Sensitivity | True Positive Rate
- Probability of relevant samples
to be detected > P[Y=1|X=1]
𝑠𝑓𝑑𝑏𝑚𝑚 = TP TP + FN
- 𝑠𝑓𝑑𝑏𝑚𝑚 ↗ 1 when FN ↘ 0
► assessment of false negatives ► false positives not addressed
Type I error Type II error
True Positives False Positives True Negatives False Negatives False True Ground Truth (X) Detection (Y) Negative Positive
Type I error Type II error
Institut Mines-Télécom
Performance metrics
■ Precision | Positive Predicted Value
Type I error Type II error
𝑞𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 = TP TP + FP
- Probability of detections
to be relevant > P[X=1|Y=1]
Type I error Type II error
True Positives False Positives True Negatives False Negatives False True Ground Truth (X) Detection (Y) Negative Positive
Type I error Type II error
- 𝑞𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 ↗ 1 when FP ↘ 0
► assessment of false positives ► false negatives not addressed
IMA 4103 - Nicolas ROUGON
Institut Mines-Télécom
Performance metrics
IMA 4103 - Nicolas ROUGON
■ Specificity | True Negative Rate
Type I error Type II error
- Probability of irrelevant samples
to be rejected > P[Y=0|X=0]
𝑡𝑞𝑓𝑑𝑗𝑔𝑗𝑑𝑗𝑢𝑧 = TN TN + FP
Type I error Type II error
True Positives False Positives True Negatives False Negatives False True Ground Truth (X) Detection (Y) Negative Positive
Type I error Type II error
- 𝑡𝑞𝑓𝑑𝑗𝑔𝑗𝑑𝑗𝑢𝑧 ↗ 1 when FP ↘ 0
► assessment of false positives
Institut Mines-Télécom
Performance metrics
IMA 4103 - Nicolas ROUGON
■ Negative Predicted Value
Type I error Type II error
- Probability of rejections
to be irrelevant > P[X=0|Y=0]
𝑂𝑄𝑊 = TN TN + FN
Type I error Type II error
True Positives False Positives True Negatives False Negatives False True Ground Truth (X) Detection (Y) Negative Positive
Type I error Type II error
- 𝑂𝑄𝑊 ↗ 1 when FN ↘ 0
► assessment of false negatives
Institut Mines-Télécom
Performance metrics
IMA 4103 - Nicolas ROUGON
𝐵𝐷𝐷 = TP + TN TP + TN + FP + FN
- Probability of correct decision
(detection/rejection) > P[Y=X]
■ Accuracy
Type I error Type II error
True Positives False Positives True Negatives False Negatives False True Ground Truth (X) Detection (Y) Negative Positive
Type I error Type II error
- 𝐵𝐷𝐷 ↗ 1 when (FP, FN) ↘ 0
► joint assessment of false positives & false negatives, from detections & rejections
Institut Mines-Télécom
Performance metrics
IMA 4103 - Nicolas ROUGON
𝐺 = 2 𝑞𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 ∙ 𝑠𝑓𝑑𝑏𝑚𝑚 𝑞𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 + 𝑠𝑓𝑑𝑏𝑚𝑚
- Harmonic mean of precision
and recall
■ F-score
Type I error Type II error
True Positives False Positives True Negatives False Negatives False True Ground Truth (X) Detection (Y) Negative Positive
Type I error Type II error
- 𝐺 ↗ 1 when (FP, FN) ↘ 0
► joint assessment of false positives & false negatives, focusing on detections
= 2 TP 2 TP + FP + FN
Institut Mines-Télécom
Problem statement
IMA 4103 - Nicolas ROUGON
- Good detection
high sensitivity / recall = all (most) true positives high specificity / precision
= few false positives
- Robustness against noise
- Good localization
► accuracy
- Computational efficiency
■ Requirements
Expected performances of relevant interest point detectors
► repeatability
Institut Mines-Télécom
Differential corner detection
IMA 4103 - Nicolas ROUGON
Institut Mines-Télécom
Corner detection
IMA 4103 - Nicolas ROUGON
■ Basic idea
At corner points, comparing an image patch to its neighbors shows dissimilarity in all directions
- Low-texture region ● Edge
- Corner
► similar in all directions ► similar along edge direction ► dissimilar in all directions
Institut Mines-Télécom
ρ Kρ
Corner detection
IMA 4103 - Nicolas ROUGON
■ Basic idea
At corner points, comparing an image patch to its neighbors shows dissimilarity in all directions
- This requires defining a similarity metric between image patches,
and search for its local maxima over a space of admissible shifts u
- A natural choice is (weighted) autocorrelation
Kρ : kernel with extension ρ
► patch support unit | Gaussian Opting for a unit kernel and 8-connected shifts yields the (early) Moravec detector
► not isotropic
x
Institut Mines-Télécom
Corner detection
IMA 4103 - Nicolas ROUGON
■ Quadratic approximation of the autocorrelation metric
- 1st-order Taylor expansion:
- The matrix is known as the structure tensor
► Quadratic approximation
Institut Mines-Télécom
- A robust estimate of the structure tensor is obtained using
regularized image derivatives Gaussian: | Canny-Deriche
- Expanded form
Corner detection
IMA 4103 - Nicolas ROUGON
■ Structure tensor
σ = local scale ρ = integration scale
Institut Mines-Télécom
- Pointwise patch (ρ = 0)
Corner detection
IMA 4103 - Nicolas ROUGON
■ Structure tensor
The information in the tensor (symmetric, positive definite) is described by its eigenvectors (dmax, dmin) and eigenvalues (λmax, λmin) ► same information in and
- describes average gradient properties over patch support
dmax (dmin) : dominant (anti-dominant) orientation λmax , λmin : directional contrast values
Institut Mines-Télécom
Corner detection
IMA 4103 - Nicolas ROUGON
■ Structure tensor
- Edge
1 dominant direction – 1 large directional gradient λmax large – λmin small
Institut Mines-Télécom
Corner detection
IMA 4103 - Nicolas ROUGON
■ Structure tensor
- Low-textured region
no dominant direction – no gradient information λmax small – λmin small
Institut Mines-Télécom
Corner detection
IMA 4103 - Nicolas ROUGON
■ Structure tensor
- High-textured region | Corner
no dominant direction – large directional gradients λmax large – λmin large
Institut Mines-Télécom
Corner detection
IMA 4103 - Nicolas ROUGON
■ Structure tensor
- At corner points, the smallest eigenvalue λmin of the structure
tensor is large enough
- This property is exploited by 2 widely-used corner detectors
Kanade-Lucas-Tomasi (KLT) Harris-Förstner
Institut Mines-Télécom
The KLT detector
IMA 4103 - Nicolas ROUGON
Institut Mines-Télécom
Kanade-Lucas-Tomasi (KLT) detector
IMA 4103 - Nicolas ROUGON
■ Algorithm
Given a threshold λ on λmin and the size D of a square neighborhood
- Compute image gradient
- Initialize a point list L. For each x Ω
− compute structure tensor using a unit (D x D) kernel − compute smallest eigenvalue λmin
− if λmin> λ, insert x into L
- Sort L in decreasing order of λmin ► Lsort
- Scan Lsort from top to bottom
− for each current point x Lsort, discard all points after x in Lsort located in the (D x D) neighborhood of x
Institut Mines-Télécom
Kanade-Lucas-Tomasi (KLT) detector
IMA 4103 - Nicolas ROUGON
■ Hyperparameters
- The threshold λ controls the sensitivity of the detector
λ can be estimated from the histogram of λmin which has usually an obvious valley near 0 however, this valley does not always exist
- The kernel / neighborhood size D is estimated empirically
in most cases: D [2,10] large values of D induce delocalization artifacts and neighboring corner fusion
Institut Mines-Télécom
The Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
Institut Mines-Télécom
- The Harris-Förstner detector makes use of both eigenvalues
- f the structure tensor via their ratio
- To avoid computing (λmax, λmin) explicitly, similitude invariants
- f are used
Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
■ Principles
Institut Mines-Télécom
- The latter are combined into a corner index
setting a threshold on r induces a threshold on α
- This yields the following corner metric
hyperparameter α [0, 0.25]
Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
■ Principles
− large at corner points − small in low-texture regions − negative at edge points
Institut Mines-Télécom
Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
■ Algorithm
Given a value of α and a threshold tR on R
- Compute image gradient
- For each x Ω
− compute structure tensor using a Gaussian kernel Gρ Standard choice: ρ = 2σ − compute corner metric R(x)
- Threshold corner map R above tR and retain only local maxima*
* via Non-Maximal Suppression in the 8-connected neighborhood
- Filter out weak*corners in the ρ-neighborhood of strong*corners
in a way similar to KLT
* w.r.t. the corner metric R
Institut Mines-Télécom
Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
■ Hyperparameters
- The parameter α controls the sensitivity of the detector
and is tuned empirically
sensitivity when α in most cases: α [0.04, 0.06]
- The threshold R is tuned empirically
− usually, R is set close to 0
0.05 α 0.10 0.20 0.22 0.24
Institut Mines-Télécom
Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
■ Hyperparameters
- The parameter α controls the
sensitivity of the detector and is tuned empirically
sensitivity when α in most cases: α [0.04, 0.06]
- The threshold R is tuned empirically
usually, R is set close to 0
0.05 α 0.10 0.20 0.22 0.24
Institut Mines-Télécom
Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
- riginal
Institut Mines-Télécom
Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
Corner metric R
Institut Mines-Télécom
Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
Thresholded corner metric R > R
Institut Mines-Télécom
Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
Corner metric local maxima
Institut Mines-Télécom
Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
Harris points
Institut Mines-Télécom
KLT vs. Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
■ Properties
- Isometry-invariance
Inherited from eigenvalues properties
- Insensitive to affine intensity transforms
Local maxima of λmin / R are preserved R(x) x x R
φ(L) = aL + b
Institut Mines-Télécom
KLT vs. Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
■ Limitations
- Not invariant to scaling
Using J at fixed scale ρ in both views
View #1 View #2
► multiple points detected as edges ► single point detected as a corner
ρ
Institut Mines-Télécom
KLT vs. Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
■ Performance comparison
KLT Harris
Institut Mines-Télécom
KLT vs. Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
■ Performance comparison
KLT Harris
Institut Mines-Télécom
KLT vs. Harris-Förstner detector
IMA 4103 - Nicolas ROUGON
■ KLT detector
- Output is usually closer to human perception of corners
- Often used for motion tracking
► widespread KLT Tracker
- Mostly used in the US
■ Harris-Förstner detector
- Good repeatability under varying rotation and lighting
- Often used for 3D scene reconstruction and image retrieval
- Mostly used in Europe
Institut Mines-Télécom
Temporary conclusion
IMA 4103 - Nicolas ROUGON
■ Topics to be further addressed
- Detection
scale-invariance issue
Fixed by automatic scale selection ► Harris-Laplacian detector
2nd-order detectors ► Hessian detector ► Hessian-Laplace detector
- Description
Define / extract feature vector descriptors around interest points
- Matching
Estimate correspondence between descriptors in each view
Institut Mines-Télécom