Skip to main navigation Skip to search Skip to main content

On vocabulary size in bag-of-visual-words representation

  • Jian Hou*
  • , Jianxin Kang
  • , Naiming Qi
  • *Corresponding author for this work
  • School of Astronautics, Harbin Institute of Technology
  • Northeast Agricultural University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Bag-of-visual-words is a popular image representation that produces high matching accuracy and efficiency. While vocabulary size impacts on matching accuracy, existing research usually selects the vocabulary size empirically. Research on representative local descriptors shows that with similarity based clustering, the intra-cluster similarity extent of descriptors plays the same role in straightforward matching as vocabulary size in visual words matching. Based on this observation, we propose to use similarity based clustering to determine the optimal vocabulary size for a given dataset in visual words matching. Preliminary experiments with three datasets produce encouraging results and demonstrate the potential of the proposed approach.

Original languageEnglish
Title of host publicationAdvances in Multimedia Information Processing, PCM 2010 - 11th Pacific Rim Conference on Multimedia, Proceedings
Pages414-424
Number of pages11
EditionPART 1
DOIs
StatePublished - 2010
Externally publishedYes
Event11th Pacific Rim Conference on Multimedia, PCM 2010 - Shanghai, China
Duration: 21 Sep 201024 Sep 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume6297 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th Pacific Rim Conference on Multimedia, PCM 2010
Country/TerritoryChina
CityShanghai
Period21/09/1024/09/10

Keywords

  • Visual words
  • representative local descriptors
  • straightforward matching
  • vocabulary size

Fingerprint

Dive into the research topics of 'On vocabulary size in bag-of-visual-words representation'. Together they form a unique fingerprint.

Cite this