Exploring Invertible Encoding for Deep Video Compression

Research output: Journal PublicationsJournal Article (refereed)peer-review

2 Citations (Scopus)

Abstract

Deep video compression methods typically use autoencoder-style networks for encoding and decoding, which can result in the loss of information during encoding that cannot be retrieved during decoding. To address this issue, recent work has explored the use of invertible neural networks for enhanced invertible encoding, which has successfully mitigated spatial information loss for better image compression. In this paper, we propose a new approach that extends invertible encoding to temporal information and introduces an encoding-decoding network for deep video compression. Our network incorporates a novel attentive channel squeeze module to improve compression performance while also leveraging a conditional coding framework for motion compression. The entire framework is optimized via a single loss function that balances bit cost and frame quality. The experimental results demonstrate the effectiveness of our approach, which achieves 15.45%/57.92% bit savings in terms of PSNR/MS-SSIM compared with the high-efficiency video coding low-delay P configuration.
Original languageEnglish
Pages (from-to)517-528
Number of pages12
JournalIEEE Transactions on Broadcasting
Volume71
Issue number2
Early online date27 Feb 2025
DOIs
Publication statusPublished - Jun 2025

Bibliographical note

Publisher Copyright:
© 1963-12012 IEEE.

Funding

This work was supported in part by the Hong Kong Innovation and Technology Commission (InnoHK Project CIMDA); in part by the Hong Kong General Research Fund under Grant 11209819 and Grant 11203820; in part by the National Natural Science Foundation of China under Grant 62176027; in part by Chongqing Talent under Grant cstc2024ycjhbgzxm0082; in part by the Central University Operating Expenses under Grant 2024CDJGF-044; and in part by the Key Project of Science and Technology Innovation 2030 funded by the Ministry of Science and Technology of China under Grant 2018AAA0101301.

Keywords

  • attentive channel squeeze
  • better transformation
  • deep video compression
  • Invertible encoding

Fingerprint

Dive into the research topics of 'Exploring Invertible Encoding for Deep Video Compression'. Together they form a unique fingerprint.

Cite this