Leveraging Conv-Attention for Efficient and High-Quality JPEG AI Image Coding

Meng WANG*, Semih ESENLIK, Zhaobin ZHANG, Yaojun WU, Kai ZHANG, Li ZHANG, Shiqi WANG

*Corresponding author for this work

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Researchpeer-review

Abstract

In this paper, we present a Conv-Attention, a decoder-friendly attention mechanism, in an effort to advancing the practical application of the artificial intelligence-based image coding. More specifically, the proposed method is tailored for JPEG AI, which is the latest advanced neural-network based image coding standard. By identifying the obstacles by profiling the decoding complexity of JPEG AI, the attention module accounts for a significant proportion, which mainly attributes to the intricate network structure and involvement of less efficient operations. Conv-Attention model is composed with plain convolution and activation computations, equipping with sub-scaling and up-scaling design, such that the non-adjacent features can be well captured, leading to the reduction of decoding complexity and maintenance of the synthesis and attentive capability. Simulation results verify the effectiveness of the proposed method with JPEG AI reference software, wherein the decoding complexity is reduced by 80% with negligible coding performance loss. The proposed method was adopted in the 100th JPEG meeting.

Original languageEnglish
Title of host publicationProceedings : DCC 2024 : 2024 Data Compression Conference
EditorsAli BILGIN, James E. FOWLER, Joan SERRA-SAGRISTA, Yan YE, James A. STORER
PublisherIEEE
Pages43-52
Number of pages10
ISBN (Electronic)9798350385878
DOIs
Publication statusPublished - 2024
Externally publishedYes
Event2024 Data Compression Conference, DCC 2024 - Snowbird, United States
Duration: 19 Mar 202422 Mar 2024

Publication series

NameData Compression Conference Proceedings
ISSN (Print)1068-0314

Conference

Conference2024 Data Compression Conference, DCC 2024
Country/TerritoryUnited States
CitySnowbird
Period19/03/2422/03/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Fingerprint

Dive into the research topics of 'Leveraging Conv-Attention for Efficient and High-Quality JPEG AI Image Coding'. Together they form a unique fingerprint.

Cite this