Академический Документы
Профессиональный Документы
Культура Документы
Recognition
and Retrieval
What Scaling
does the to
world look like?
billions of images
Object Recognition
High level for large-scale search
image statistics
Focus on scaling rather than understanding image
Content-Based Image Retrieval
• Variety of simple/hand-designed cues:
– Color and/or Texture histograms, Shape, PCA, etc.
• Various distance metrics
– Earth Movers Distance (Rubner et al. ‘98)
• Bag-of-words: Compare
frequencies of prototype
features
[Csurka et al., Sivic & Zisserman, Lazebnik &
Ponce]
Slide credit: Kristen Grauman
Pyramid match kernel
[Grauman & Darrell, ICCV 2005]
m = # features
L = # levels in pyramid
optimal partial
matching
descriptor
space
Pyramid match
Recognition accuracy (%)
Mean number of features per set (m) Mean number of features per set (m)
• Exploit (dis)similarity
constraints to
construct more useful
dissimilar distance function
• Number of existing
techniques for metric
learning
similar
[Weinberger et al. 2004,
Hertz et al. 2004, Frome et
al. 2007, Varma & Ray
2007, Kumar et al. 2007]
Example sources of similarity constraints
Problem-specific
knowledge
Locality Sensitive Hashing (LSH)
• Gionis, A. & Indyk, P. & Motwani, R. (1999)
• Take random projections of data
• Quantize each projection with few bits
0 101
1 Descriptor in
high D space
0
1
1
0
Fast Image Search for Learned Metrics
Jain, Kulis, & Grauman, CVPR 2008
Learn a Malhanobis metric for LSH
h( ) = h( ) h( ) ≠ h( )
Less likely to split pairs like those More likely to split pairs like those
with similarity constraint with dissimilarity constraint
Slide : Kristen Grauman
Results: Flickr dataset
Query time:
Error rate
Query
Image
Semantic
Address Space
Binary
Hash code Images in database
Function
Query address
Quite different Semantically
to a (conventional) similar
randomizing hash images
Exploring different choices of
semantic hash function
Torralba, Fergus, Weiss, CVPR 2008
Compute
Gist ~1ms (in Matlab)
Learn mapping
• Neighborhood Components Analysis [Goldberger et al., 2004]
• Adjust model parameters to move:
– Points of SAME class closer
– Points of DIFFERENT class away
• Learning methods
outperform LSH