LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution

Research output: Journal PublicationsJournal Article (refereed)peer-review

2 Citations (Scopus)

Abstract

Light-Field (LF) image is emerging 4D data of light rays that is capable of realistically presenting spatial and angular information of 3D scene. However, the large data volume of LF images becomes the most challenging issue in real-time processing, transmission, and storage. In this paper, we propose an end-to-end deep LF Image Compression method Using Disentangled Representation and Asymmetrical Strip Convolution (LFIC-DRASC) to improve coding efficiency. Firstly, we formulate the LF image compression problem as learning a disentangled LF representation network and an image encoding-decoding network. Secondly, we propose two novel feature extractors that leverage the structural prior of LF data by integrating features across different dimensions. Meanwhile, disentangled LF representation network is proposed to enhance the LF feature disentangling and decoupling. Thirdly, we propose the LFIC-DRASC for LF image compression, where two Asymmetrical Strip Convolution (ASC) operators, i.e., horizontal and vertical, are proposed to capture long-range correlation in LF feature space. These two ASC operators can be combined with the square convolution to further decouple LF features, which enhances the model’s ability in representing intricate spatial relationships. Experimental results demonstrate that the proposed LFIC-DRASC achieves an average of 20.5% bit rate reductions compared with the state-of-the-art methods.

Original languageEnglish
Pages (from-to)889-902
Number of pages14
JournalIEEE Transactions on Broadcasting
Volume71
Issue number3
Early online date3 Jul 2025
DOIs
Publication statusPublished - Sept 2025

Bibliographical note

Publisher Copyright:
© 1963-2012 IEEE.

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant 62172400; in part by the Shenzhen Natural Science Foundation under Grant JCYJ20240813180503005 and Grant JCYJ20230807140707015; in part by the Shenzhen Key Science and Technology Program under Grant JCYJ20241202124415021; in part by the Shenzhen-Hong Kong Collaborative Project Tier-A under Grant SGDX2024011505505010; and in part by the Guangdong Basic and Applied Basic Research Foundation under Grant 2025A1515012127.

Keywords

  • Deep learning
  • asymmetrical strip convolution
  • disentangled representation
  • image compression
  • light field

Fingerprint

Dive into the research topics of 'LFIC-DRASC: Deep Light Field Image Compression Using Disentangled Representation and Asymmetrical Strip Convolution'. Together they form a unique fingerprint.

Cite this