Skip to main navigation Skip to search Skip to main content

From seed discovery to deep reconstruction: Predicting saliency in crowd via deep networks

  • Yanhao Zhang
  • , Lei Qin
  • , Qingming Huang
  • , Kuiyuan Yang
  • , Jun Zhang
  • , Hongxun Yao
  • Harbin Institute of Technology
  • Chinese Academy of Sciences
  • Microsoft USA
  • Hefei University of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Although saliency prediction in crowd has been recently recognized as an essential task for video analysis, it is not comprehensively explored yet. The challenges lie in that eyefixations in crowded scenes are inherently\distinct"and\multimodal", which difiers from those in regular scenes. To this end, the existing saliency prediction schemes typically rely on hand designed features with shallow learning paradigm, which neglect the underlying characteristics of crowded scenes. In this paper, we propose a saliency prediction model dedicated for crowd videos with two novelties: 1) Distinct units are discovered using deep representation learned by a Stacked Denoising Auto-Encoder (SDAE), considering perceptual properties of crowd saliency; 2) Contrast-based saliency is measured through deep reconstruction errors in the second SDAE trained on all units excluding distinct units. A unified model is integrated for online processing crowd saliency. Extensive evaluations on two crowd video benchmark datasets demonstrate that our approach can effectively explore crowd saliency mechanism in two-stage SDAEs and achieve significantly better results than state-of-the-art methods, with robustness to parameters.

Original languageEnglish
Title of host publicationMM 2016 - Proceedings of the 2016 ACM Multimedia Conference
PublisherAssociation for Computing Machinery, Inc
Pages72-76
Number of pages5
ISBN (Electronic)9781450336031
DOIs
StatePublished - 1 Oct 2016
Event24th ACM Multimedia Conference, MM 2016 - Amsterdam, United Kingdom
Duration: 15 Oct 201619 Oct 2016

Publication series

NameMM 2016 - Proceedings of the 2016 ACM Multimedia Conference

Conference

Conference24th ACM Multimedia Conference, MM 2016
Country/TerritoryUnited Kingdom
CityAmsterdam
Period15/10/1619/10/16

Keywords

  • Crowd saliency
  • Deep Auto Encoders
  • Reconstruction errors

Fingerprint

Dive into the research topics of 'From seed discovery to deep reconstruction: Predicting saliency in crowd via deep networks'. Together they form a unique fingerprint.

Cite this