TY - GEN
T1 - Pair-wise event detection using cubic features and sequence discriminant learning
AU - Fang, Xiaoyu
AU - Tian, Yonghong
AU - Wang, Yaowei
AU - Su, Chi
AU - Xu, Teng
AU - Xia, Ziwei
AU - Gao, Wen
PY - 2013
Y1 - 2013
N2 - Event detection in crowded surveillance videos is a challenging yet important problem. This paper focuses on pair-wise events that involve the interaction of two persons (e.g., people embrace, meet or split) in crowded videos. To detect such an event accurately, we should build an effective representation model that can characterize the sequential properties of two persons' interaction. Towards this end, we propose a novel pair-wise event detection approach using cubic features and sequence discriminant learning. A video sequence is first partitioned into several spatio-temporal cubes, and multiple features (e.g., statistics of trajectories, bag of spatio-temporal interest points) are extracted on these cubes and then fused to form a cubic feature descriptor under multiple kernel learning (MKL) framework. After that, the SVM with dynamic time alignment kernel is used to infer the existence of an event in the video sequence. Experimental results show that the proposed approach achieves the encouraging performance on TRECVid SED dataset.
AB - Event detection in crowded surveillance videos is a challenging yet important problem. This paper focuses on pair-wise events that involve the interaction of two persons (e.g., people embrace, meet or split) in crowded videos. To detect such an event accurately, we should build an effective representation model that can characterize the sequential properties of two persons' interaction. Towards this end, we propose a novel pair-wise event detection approach using cubic features and sequence discriminant learning. A video sequence is first partitioned into several spatio-temporal cubes, and multiple features (e.g., statistics of trajectories, bag of spatio-temporal interest points) are extracted on these cubes and then fused to form a cubic feature descriptor under multiple kernel learning (MKL) framework. After that, the SVM with dynamic time alignment kernel is used to infer the existence of an event in the video sequence. Experimental results show that the proposed approach achieves the encouraging performance on TRECVid SED dataset.
KW - Cubic feature
KW - event detection
KW - surveillance
UR - https://www.scopus.com/pages/publications/84885609861
U2 - 10.1109/ICME.2013.6607573
DO - 10.1109/ICME.2013.6607573
M3 - 会议稿件
AN - SCOPUS:84885609861
SN - 9781479900152
T3 - Proceedings - IEEE International Conference on Multimedia and Expo
BT - 2013 IEEE International Conference on Multimedia and Expo, ICME 2013
T2 - 2013 IEEE International Conference on Multimedia and Expo, ICME 2013
Y2 - 15 July 2013 through 19 July 2013
ER -