TY - GEN
T1 - Shot Segmentation Method Based on Image Similarity and Deep Residual Network
AU - Ming, Baolin
AU - Lyu, Desheng
AU - Yu, Dengsha
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/5/20
Y1 - 2021/5/20
N2 - In order to segment the video according to the gradual shots, mutation shots or constant shots, so as to apply to the subsequent video analysis algorithm, this paper designs shot boundary detection method based on image similarity combined with deep residual network. The image similarity feature has better segmentation effect for mutation shots, but poor detection ability for gradual shots. The deep neural network can perform better segmentation of gradual shots, but the amount of calculation is relatively large. Based on the combination of the depth feature of the video frame and the principle of image mutual information, this paper initially extracts video boundary candidate points. At the same time, 3D residual network is designed and the shot boundary segmentation training is carried out according to the Clipshots dataset. Finally, the trained network is used to analyze the real video shot boundary from the candidate points and determine its category. The experimental results show that this method has better shot segmentation effect for VR movies and network user videos.
AB - In order to segment the video according to the gradual shots, mutation shots or constant shots, so as to apply to the subsequent video analysis algorithm, this paper designs shot boundary detection method based on image similarity combined with deep residual network. The image similarity feature has better segmentation effect for mutation shots, but poor detection ability for gradual shots. The deep neural network can perform better segmentation of gradual shots, but the amount of calculation is relatively large. Based on the combination of the depth feature of the video frame and the principle of image mutual information, this paper initially extracts video boundary candidate points. At the same time, 3D residual network is designed and the shot boundary segmentation training is carried out according to the Clipshots dataset. Finally, the trained network is used to analyze the real video shot boundary from the candidate points and determine its category. The experimental results show that this method has better shot segmentation effect for VR movies and network user videos.
KW - deep neural network
KW - feature fusion
KW - image similarity
KW - shot segmentation
UR - https://www.scopus.com/pages/publications/85111442685
U2 - 10.1109/ICVR51878.2021.9483839
DO - 10.1109/ICVR51878.2021.9483839
M3 - 会议稿件
AN - SCOPUS:85111442685
T3 - International Conference on Virtual Rehabilitation, ICVR
SP - 41
EP - 45
BT - 2021 IEEE 7th International Conference on Virtual Reality, ICVR 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th IEEE International Conference on Virtual Reality, ICVR 2021
Y2 - 20 May 2021 through 22 May 2021
ER -