TY - GEN
T1 - Low bit-rate video coding via mode-dependent adaptive regression for wireless visual communications
AU - Liu, Xianming
AU - Wu, Xiaolin
AU - Gao, Xinwei
AU - Zhao, Debin
AU - Gao, Wen
PY - 2012
Y1 - 2012
N2 - In this paper, a practical video coding scheme is developed to realize state-of-the-art video coding efficiency with lower encoder complexity at low bit-rate, while supporting standard compliance and error resilience. Such an architecture is particularly attractive for wireless visual communications. At the encoder, multiple descriptions of a video sequence are generated in the spatio-temporal domain by temporal multiplexing and spatial adaptive downsampling. The resulting side descriptions are interleaved with each other in temporal domain, and still with conventional square sample grids in spatial domain. As such, each side description can be compressed without any change to existing video coding standards. At the decoder, each side description is first decompressed, and then reconstructed to original resolution with the help of the other side description. In this procedure, the decoder recover the original video sequence in a constrained least squares regression process, using 2D or 3D piecewise autoregressive model according to different prediction modes. In this way, the spatial and temporal correlation is sufficiently explored to achieve superior quality. Experiment results demonstrate the proposed video coding scheme outperforms H.264 in rate-distortion performance at low bit-rates and achieves superior visual quality at medium bit-rates as well.
AB - In this paper, a practical video coding scheme is developed to realize state-of-the-art video coding efficiency with lower encoder complexity at low bit-rate, while supporting standard compliance and error resilience. Such an architecture is particularly attractive for wireless visual communications. At the encoder, multiple descriptions of a video sequence are generated in the spatio-temporal domain by temporal multiplexing and spatial adaptive downsampling. The resulting side descriptions are interleaved with each other in temporal domain, and still with conventional square sample grids in spatial domain. As such, each side description can be compressed without any change to existing video coding standards. At the decoder, each side description is first decompressed, and then reconstructed to original resolution with the help of the other side description. In this procedure, the decoder recover the original video sequence in a constrained least squares regression process, using 2D or 3D piecewise autoregressive model according to different prediction modes. In this way, the spatial and temporal correlation is sufficiently explored to achieve superior quality. Experiment results demonstrate the proposed video coding scheme outperforms H.264 in rate-distortion performance at low bit-rates and achieves superior visual quality at medium bit-rates as well.
KW - Low bit-rates
KW - adaptive regression
KW - mode dependent
KW - wireless visual communications
UR - https://www.scopus.com/pages/publications/84874062461
U2 - 10.1109/VCIP.2012.6410852
DO - 10.1109/VCIP.2012.6410852
M3 - 会议稿件
AN - SCOPUS:84874062461
SN - 9781467344050
T3 - 2012 IEEE Visual Communications and Image Processing, VCIP 2012
BT - 2012 IEEE Visual Communications and Image Processing, VCIP 2012
T2 - 2012 IEEE Visual Communications and Image Processing, VCIP 2012
Y2 - 27 November 2012 through 30 November 2012
ER -