End-to-end depth map compression framework via RGB-to-depth structure priors learning

Minghui CHEN, Pingping ZHANG, Zhuo CHEN, Yun ZHANG, Xu WAGN, Sam KWONG

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Researchpeer-review

1 Citation (Scopus)

Abstract

In this paper, we propose a novel framework to exploit and utilize the shared information inner RGB-D data for efficient depth map compression. Two main codecs, designed based on the existing end-to-end image compression network, are adopted for RGB image compression and enhanced depth image compression with RGB-to-Depth structure prior, respectively. In particular, we propose a Structure Prior Fusion (SPF) module to extract the structure information from both RGB and depth codecs at multi-scale feature levels and fuse the cross-modal feature to generate more efficient structure priors for depth compression. Extensive experiments show that the proposed framework can achieve competitive rate-distortion performance as well as RGB-D task-specific performance at depth map compression compared with the direct compression scheme.
Original languageEnglish
Title of host publicationProceedings of the 2022 IEEE International Conference on Image Processing
PublisherIEEE
Pages3206-3210
Number of pages5
ISBN (Print)9781665496209
DOIs
Publication statusPublished - Oct 2022
Externally publishedYes
Event29th IEEE International Conference on Image Processing, ICIP 2022 - Bordeaux, France
Duration: 16 Oct 202219 Oct 2022

Conference

Conference29th IEEE International Conference on Image Processing, ICIP 2022
Country/TerritoryFrance
CityBordeaux
Period16/10/2219/10/22

Bibliographical note

This work was supported in part by the National Natural Science Foundation of China (Grant 61871270), in part by the Shenzhen Natural Science Foundation under Grants JCYJ20200109110410133 and 20200812110350001, in part by the A*STAR under YIRG project No. A2084c0176 and AHSF project No. C211118005.

Keywords

  • cross-modal
  • Depth map compression
  • feature fusion

Fingerprint

Dive into the research topics of 'End-to-end depth map compression framework via RGB-to-depth structure priors learning'. Together they form a unique fingerprint.

Cite this