Projects per year
Abstract
The Deep Contextual Video Compression framework (DCVC) utilizes a conditional coding paradigm, where the context is extracted and employed as a condition for the contextual encoder-decoder and entropy model. In this paper, we propose enhanced context mining and filtering to improve the compression efficiency of DCVC. Firstly, considering the context of DCVC is generated without supervision and redundancy may exist among context channels, an enhanced context mining model is proposed to mitigate redundancy across context channels to obtain superior context features. Then, we introduce a transformer-based enhancement network as a filtering module to capture long-distance dependencies and further enhance compression efficiency. The transformer-based enhancement adopts a full-resolution pipeline and calculates self-attention across channel dimensions. By combining the local modeling ability of the enhanced context mining model and the non-local modeling ability of the transformer-based enhancement network, our model outperforms LDP configurations of Versatile Video Coding (VVC), achieving an average bit savings of 6.7% in terms of MS-SSIM.
Original language | English |
---|---|
Pages (from-to) | 3814-3826 |
Number of pages | 13 |
Journal | IEEE Transactions on Multimedia |
Volume | 26 |
Early online date | 18 Sept 2023 |
DOIs | |
Publication status | Published - 2024 |
Bibliographical note
Publisher Copyright:© 1999-2012 IEEE.
Funding
Key Project of Science and Technology Innovation 2030 funded by the Ministry of Science and Technology of China (Grant Number: 2018AAA0101301) Hong Kong Innovation and Technology Commission (InnoHK Project CIMDA) Hong Kong GRFRGC General Research Fund (Grant Number: 11203820, 9042598, 11209819 and CityU 9042816)
Keywords
- Codes
- Context modeling
- end-to-end training approach
- enhanced context mining
- Entropy
- Filtering
- Image coding
- in loop filtering
- Learned video compression
- Transformers
- Video compression
Fingerprint
Dive into the research topics of 'Enhanced Context Mining and Filtering for Learned Video Compression'. Together they form a unique fingerprint.Projects
- 2 Finished
-
Adaptive Dynamic Range Enhancement Oriented to High Dynamic Display (面向高動態顯示的自適應動態範圍增強)
KWONG, S. T. W. (PI), KUO, C.-C. J. (CoI), WANG, S. (CoI) & ZHANG, X. (CoI)
Research Grants Council (HKSAR)
1/01/21 → 31/12/24
Project: Grant Research
-
Intelligent Ultra High Definition Video Encoder Optimization for Future Versatile Video Coding (用于未来多功能视频编码的智能超高清视频编码器优化)
KWONG, S. T. W. (PI), ZHOU, M. (CoI), KUO, C.-C. J. (CoI) & WANG, S. (CoI)
Research Grants Council (HKSAR)
1/01/20 → 30/06/23
Project: Grant Research