Skip to main navigation Skip to search Skip to main content

Latent gaussian-multinomial generative model for annotated data

  • Shuoran Jiang
  • , Yarui Chen*
  • , Zhifei Qin
  • , Jucheng Yang
  • , Tingting Zhao
  • , Chuanlei Zhang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Traditional generative models annotate images by multiple instances independently segmented, but these models have been becoming prohibitively expensive and time-consuming along with the growth of Internet data. Focusing on the annotated data, we propose a latent Gaussian-Multinomial generative model (LGMG), which generates the image-annotations using a multimodal probabilistic models. Specifically, we use a continuous latent variable with prior of Normal distribution as the latent representation summarizing the high-level semantics of images, and a discrete latent variable with prior of Multinomial distribution as the topics indicator for annotation. We compute the variational posteriors from a mapping structure among latent representation, topics indicator and image-annotation. The stochastic gradient variational Bayes estimator on variational objective is realized by combining the reparameterization trick and Monte Carlo estimator. Finally, we demonstrate the performance of LGMG on LabelMe in terms of held-out likelihood, automatic image annotation with the state-of-the-art models.

Original languageEnglish
Title of host publicationAdvances in Knowledge Discovery and Data Mining - 23rd Pacific-Asia Conference, PAKDD 2019, Proceedings
EditorsZhiguo Gong, Qiang Yang, Min-Ling Zhang, Zhi-Hua Zhou, Sheng-Jun Huang
PublisherSpringer Verlag
Pages42-54
Number of pages13
ISBN (Print)9783030161477
DOIs
StatePublished - 2019
Externally publishedYes
Event23rd Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2019 - Macau, China
Duration: 14 Apr 201917 Apr 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11439 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference23rd Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2019
Country/TerritoryChina
CityMacau
Period14/04/1917/04/19

Keywords

  • Annotated data
  • Gaussian-Multinomial
  • Latent representation
  • Multimodal generative models
  • Topics indicator

Fingerprint

Dive into the research topics of 'Latent gaussian-multinomial generative model for annotated data'. Together they form a unique fingerprint.

Cite this