Abstract
Cross-component linear model (CCLM) prediction has been repeatedly proven to be effective in reducing the inter-channel redundancies in video compression. Essentially speaking, the linear model is identically trained by employing accessible luma and chroma reference samples at both encoder and decoder, elevating the level of operational complexity due to the least square regression or max-min based model parameter derivation. In this paper, we investigate the capability of the linear model in the context of sub-sampled based cross-component correlation mining, as a means of significantly releasing the operation burden and facilitating the hardware and software design for both encoder and decoder. In particular, the sub-sampling ratios and positions are elaborately designed by exploiting the spatial correlation and the inter-channel correlation. Extensive experiments verify that the proposed method is characterized by its simplicity in operation and robustness in terms of rate-distortion performance, leading to the adoption by Versatile Video Coding (VVC) standard and the third generation of Audio Video Coding Standard (AVS3).
Original language | English |
---|---|
Article number | 9515710 |
Pages (from-to) | 7305-7316 |
Number of pages | 12 |
Journal | IEEE Transactions on Image Processing |
Volume | 30 |
Early online date | 17 Jun 2021 |
DOIs | |
Publication status | Published - 2021 |
Externally published | Yes |
Funding
This work was supported in part by the National Natural Science Foundation of China under Grant 62022002 and Grant 62088102 and in part by the National Science Fund for Distinguished Young Scholars under Grant 62025101.
Keywords
- AVS3
- Cross-component linear model
- cross-component prediction
- video coding
- VVC