Abstract
Bag-of-visual-words is a popular image representation and attains wide application in image processing community. While its potential has been explored in many aspects, its operation still follows a basic mode, namely for a given dataset, using k-means-like clustering methods to train a vocabulary. The vocabulary obtained this way is data dependent, i.e., with a new dataset, we must train a new vocabulary. Based on previous research on determining the optimal vocabulary size, in this paper we research on the possibility of building a universal and limited visual vocabulary with optimal performance. We analyze why such a vocabulary should exist and conduct extensive experiments on three challenging datasets to validate this hypothesis. As a consequence, we believe this work sheds a new light on finally obtaining a universal visual vocabulary of limited size which can be used with any datasets to obtain the best or near-best performance.
| Original language | English |
|---|---|
| Pages (from-to) | 398-407 |
| Number of pages | 10 |
| Journal | Lecture Notes in Computer Science |
| Volume | 6939 LNCS |
| Issue number | PART 2 |
| DOIs | |
| State | Published - 2011 |
| Externally published | Yes |
| Event | 7th International Symposium on Visual Computing, ISVC 2011 - Las Vegas, NV, United States Duration: 26 Sep 2011 → 28 Sep 2011 |
Fingerprint
Dive into the research topics of 'Towards a universal and limited visual vocabulary'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver