Skip to main navigation Skip to search Skip to main content

Learning siamese network with top-down modulation for visual tracking

  • Harbin Institute of Technology
  • The Chinese University of Hong Kong, Shenzhen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The performance of visual object tracking depends largely on the target appearance model. Benefited from the success of CNN in feature extraction, recent studies have paid much attention to CNN representation learning and feature fusion model. However, the existing feature fusion models ignore the relation between the features of different layers. In this paper, we propose a deep feature fusion model based on the siamese network by considering the connection between feature maps of CNN. To tackle the limitation of different feature map sizes in CNN, we propose to fuse different resolution feature maps by introducing de-convolutional layers in the offline training stage. Specifically, a top-down modulation is adopted for feature fusion. In the tracking stage, a simple matching operation between the fused feature of the examplar and search region is conducted with the learned model, which can maintain the real-time tracking speed. Experimental results show that, the proposed method obtains favorable tracking accuracy against the state-of-the-art trackers with a real-time tracking speed.

Original languageEnglish
Title of host publicationIntelligence Science and Big Data Engineering - 8th International Conference, IScIDE 2018, Revised Selected Papers
EditorsKai Yu, Yuxin Peng, Xingpeng Jiang, Jiwen Lu
PublisherSpringer Verlag
Pages378-388
Number of pages11
ISBN (Print)9783030026974
DOIs
StatePublished - 2018
Event8th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2018 - Lanzhou, China
Duration: 18 Aug 201819 Aug 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11266 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference8th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2018
Country/TerritoryChina
CityLanzhou
Period18/08/1819/08/18

Keywords

  • Feature fusion
  • Siamese network
  • Visual tracking

Fingerprint

Dive into the research topics of 'Learning siamese network with top-down modulation for visual tracking'. Together they form a unique fingerprint.

Cite this