TY - GEN
T1 - Adaptive Spatio-Temporal Convolutional Network for Video Deblurring
AU - Duan, Fengzhi
AU - Yao, Hongxun
N1 - Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - Video deblurring is a challenging task due to the spatially variant blur caused by camera shake, object motions, and depth variations, etc. However, for the blurred area in the current video frame, the corresponding pixels of its neighboring video frames are often clear. Based on this observation, we propose an Adaptive Spatio-Temporal Convolutional Network (ASTCN) to compensate for blurry pixels in the current frame by using clear pixels in adjacent frames. In order to use the spatial information of adjacent frames in the current frame, the video frames must be aligned first. Existing methods usually estimate optical flow in the blurry video to align consecutive frames. However, they tend to generate artifacts when the estimated optical flow is not accurate. In order to overcome the limitations of optical flow estimation, we use deformable convolution in ASTCN to complete multi-scale adjacent frame alignment at the feature level. Secondly, we propose an adaptive spatio-temporal feature fusion module based on dynamic filters, which uses the features of the clear regions of adjacent frames to perform adaptive feature transformation on the intermediate frame to remove the blur. Extensive experimental results show that the proposed algorithm has shown superior performance on the benchmark datasets as well as real-world videos.
AB - Video deblurring is a challenging task due to the spatially variant blur caused by camera shake, object motions, and depth variations, etc. However, for the blurred area in the current video frame, the corresponding pixels of its neighboring video frames are often clear. Based on this observation, we propose an Adaptive Spatio-Temporal Convolutional Network (ASTCN) to compensate for blurry pixels in the current frame by using clear pixels in adjacent frames. In order to use the spatial information of adjacent frames in the current frame, the video frames must be aligned first. Existing methods usually estimate optical flow in the blurry video to align consecutive frames. However, they tend to generate artifacts when the estimated optical flow is not accurate. In order to overcome the limitations of optical flow estimation, we use deformable convolution in ASTCN to complete multi-scale adjacent frame alignment at the feature level. Secondly, we propose an adaptive spatio-temporal feature fusion module based on dynamic filters, which uses the features of the clear regions of adjacent frames to perform adaptive feature transformation on the intermediate frame to remove the blur. Extensive experimental results show that the proposed algorithm has shown superior performance on the benchmark datasets as well as real-world videos.
KW - Dynamic filter
KW - Pixel quality compensation
KW - Video deblurring
UR - https://www.scopus.com/pages/publications/85117131615
U2 - 10.1007/978-3-030-87361-5_63
DO - 10.1007/978-3-030-87361-5_63
M3 - 会议稿件
AN - SCOPUS:85117131615
SN - 9783030873608
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 777
EP - 788
BT - Image and Graphics - 11th International Conference, ICIG 2021, Proceedings
A2 - Peng, Yuxin
A2 - Hu, Shi-Min
A2 - Gabbouj, Moncef
A2 - Zhou, Kun
A2 - Elad, Michael
A2 - Xu, Kun
PB - Springer Science and Business Media Deutschland GmbH
T2 - 11th International Conference on Image and Graphics, ICIG 2021
Y2 - 6 August 2021 through 8 August 2021
ER -