TY - JOUR
T1 - Convolutional Neural Network Based Synthesized View Quality Enhancement for 3D Video Coding
AU - ZHU, Linwei
AU - ZHANG, Yun
AU - WANG, Shiqi
AU - YUAN, Hui
AU - KWONG, Sam
AU - IP, Horace H. S.
PY - 2018/11
Y1 - 2018/11
N2 - The quality of synthesized view plays an important role in the three dimensional (3D) video system. In this paper, to further improve the coding efficiency, a convolutional neural network (CNN) based synthesized view quality enhancement method for 3D High Efficiency Video Coding (HEVC) is proposed. Firstly, the distortion elimination in synthesized view is formulated as an image restoration task with the aim to reconstruct the latent distortion free synthesized image. Secondly, the learned CNN models are incorporated into 3D HEVC codec to improve the view synthesis performance for both view synthesis optimization (VSO) and the final synthesized view, where the geometric and compression distortions are considered according to the specific characteristics of synthesized view. Thirdly, a new Lagrange multiplier in the rate-distortion (RD) cost function is derived to adapt the CNN based VSO process to embrace a better 3D video coding performance. Extensive experimental results show that the proposed scheme can efficiently eliminate the artifacts in the synthesized image, and reduce 25.9% and 11.7% bit rate in terms of peak-signal-to-noise ratio (PSNR) and structural similarity (SSIM) index, which significantly outperforms the state-of-theart methods.
AB - The quality of synthesized view plays an important role in the three dimensional (3D) video system. In this paper, to further improve the coding efficiency, a convolutional neural network (CNN) based synthesized view quality enhancement method for 3D High Efficiency Video Coding (HEVC) is proposed. Firstly, the distortion elimination in synthesized view is formulated as an image restoration task with the aim to reconstruct the latent distortion free synthesized image. Secondly, the learned CNN models are incorporated into 3D HEVC codec to improve the view synthesis performance for both view synthesis optimization (VSO) and the final synthesized view, where the geometric and compression distortions are considered according to the specific characteristics of synthesized view. Thirdly, a new Lagrange multiplier in the rate-distortion (RD) cost function is derived to adapt the CNN based VSO process to embrace a better 3D video coding performance. Extensive experimental results show that the proposed scheme can efficiently eliminate the artifacts in the synthesized image, and reduce 25.9% and 11.7% bit rate in terms of peak-signal-to-noise ratio (PSNR) and structural similarity (SSIM) index, which significantly outperforms the state-of-theart methods.
KW - 3D high efficiency video coding
KW - Convolutional neural network
KW - depth coding
KW - Lagrange multiplier
KW - view synthesis
UR - http://www.scopus.com/inward/record.url?scp=85050404712&partnerID=8YFLogxK
U2 - 10.1109/TIP.2018.2858022
DO - 10.1109/TIP.2018.2858022
M3 - Journal Article (refereed)
SN - 1057-7149
VL - 27
SP - 5365
EP - 5377
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
IS - 11
ER -