Skip to main navigation Skip to search Skip to main content

Cross-media manifold learning for image retrieval & annotation

  • Xianming Liu*
  • , Rongrong Ji
  • , Hongxun Yao
  • , Pengfei Xu
  • , Xiaoshuai Sun
  • , Tianqiang Liu
  • *Corresponding author for this work
  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Fusion of visual content with textual information is an effective way for both content-based and keyword-based image retrieval. However, the performance of visual & textual fusion is affected greatly by the data noise and redundancy in both text (such as surrounding text in HTML pages) and visual (such as intra-class diversity) aspects. This paper presents a manifold-based crossmedia optimization scheme to achieve visual & textual fusion within a unified framework. Cross-Media manifold co-training mechanism between Keyword-based Metric Space and Vision- Based Metric Space is proposed creatively to infer a best dualspace fusion by minimizing manifold-based visual & textual energy criterion. We present the Isomorphic Manifold Learning to map the annotation affection in image visual space onto keyword semantic space by manifold shrinkage. We also demonstrate its correctness and convergence from mathematical perspective. The retrieval can be performed using both keyword or sample images respectively on Keyword-Based Metric Space and Vision-Based Metric Space, while the simple distance classifiers will satisfy. Two groups of experiments are conducted: The first group is carried on Corel 5000 image database to validate our effectiveness by comparing with state-of-the-art Generalized Manifold Ranking Based Image Retrieval and SVM. The second group is done over real-world Flickr dataset with over 6, 000 images to testify our effectiveness in real-world application. The promising results show that our model attains a significant improvement over stateof- the-art algorithms.

Original languageEnglish
Title of host publicationProceedings of the 1st International ACM Conference on Multimedia Information Retrieval, MIR2008, Co-located with the 2008 ACM International Conference on Multimedia, MM'08
Pages141-148
Number of pages8
DOIs
StatePublished - 2008
Externally publishedYes
Event1st International ACM Conference on Multimedia Information Retrieval, MIR2008, Co-located with the 2008 ACM International Conference on Multimedia, MM'08 - Vancouver, BC, Canada
Duration: 30 Aug 200831 Aug 2008

Publication series

NameProceedings of the 1st International ACM Conference on Multimedia Information Retrieval, MIR2008, Co-located with the 2008 ACM International Conference on Multimedia, MM'08

Conference

Conference1st International ACM Conference on Multimedia Information Retrieval, MIR2008, Co-located with the 2008 ACM International Conference on Multimedia, MM'08
Country/TerritoryCanada
CityVancouver, BC
Period30/08/0831/08/08

Keywords

  • Automatic image annotation
  • Co-training
  • Content-based image retrieval
  • Manifold learning
  • Web image search

Fingerprint

Dive into the research topics of 'Cross-media manifold learning for image retrieval & annotation'. Together they form a unique fingerprint.

Cite this