End-to-End Learning-Based Image Compression With a Decoupled Framework

Zhaobin ZHANG*, Semih ESENLIK, Yaojun WU, Meng WANG, Kai ZHANG*, Li ZHANG*

*Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

3 Citations (Scopus)

Abstract

The autoregressive model has been widely used in learning-based image compression due to its superior context modeling capability. However, its sequential processing nature also undermines the ability of decoding in parallel and hinders the deployment in real applications. In this paper, we propose a decoupled framework to resolve this issue. With the decoupled architecture, the entropy decoding process is independent of the latent sample reconstruction process. The entropy decoding process thus can be finished before the latent sample prediction process begins, which leads to significant decoding time savings by enabling the two processes to be conducted in parallel. To further reduce the decoding time, we introduce wavefront processing, where multiple rows can be processed simultaneously when reconstructing the latent samples. On top of that, we design a series of coding tools to improve the rate-distortion efficiency and reduce the decoding complexity. Device interoperability is also supported by the proposed solution, where the same bitstream can be successfully decoded on different CPU/GPU devices. Comprehensive experiments are conducted to validate the effectiveness of the proposed method. Using objective evaluation metrics required by JPEG AI Call for Proposals (CfP), the proposed method achieves a BD-rate change of -29.6% on average with 2.44 times faster decoding speed compared to VVC image coding. When compared to the commonly used benchmark learning-based methods, the proposed method achieves -30.5% BD-rate changes and 101 times faster decoding speed over cheng2020attn. The proposed solution has been proposed to JPEG AI and IEEE 1857.11 as a response to CfP and the core techniques have been adopted by both.
Original languageEnglish
Pages (from-to)3067-3081
Number of pages15
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume34
Issue number5
Early online date11 Sept 2023
DOIs
Publication statusPublished - May 2024
Externally publishedYes

Keywords

  • Decoupled
  • end-to-end
  • IEEE 1857.11
  • image compression
  • JPEG AI
  • neural network
  • wavefront

Fingerprint

Dive into the research topics of 'End-to-End Learning-Based Image Compression With a Decoupled Framework'. Together they form a unique fingerprint.

Cite this