Abstract
In this paper, we propose a novel framework to exploit and utilize the shared information inner RGB-D data for efficient depth map compression. Two main codecs, designed based on the existing end-to-end image compression network, are adopted for RGB image compression and enhanced depth image compression with RGB-to-Depth structure prior, respectively. In particular, we propose a Structure Prior Fusion (SPF) module to extract the structure information from both RGB and depth codecs at multi-scale feature levels and fuse the cross-modal feature to generate more efficient structure priors for depth compression. Extensive experiments show that the proposed framework can achieve competitive rate-distortion performance as well as RGB-D task-specific performance at depth map compression compared with the direct compression scheme.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2022 IEEE International Conference on Image Processing |
Publisher | IEEE |
Pages | 3206-3210 |
Number of pages | 5 |
ISBN (Print) | 9781665496209 |
DOIs | |
Publication status | Published - Oct 2022 |
Externally published | Yes |
Event | 29th IEEE International Conference on Image Processing, ICIP 2022 - Bordeaux, France Duration: 16 Oct 2022 → 19 Oct 2022 |
Conference
Conference | 29th IEEE International Conference on Image Processing, ICIP 2022 |
---|---|
Country/Territory | France |
City | Bordeaux |
Period | 16/10/22 → 19/10/22 |
Bibliographical note
This work was supported in part by the National Natural Science Foundation of China (Grant 61871270), in part by the Shenzhen Natural Science Foundation under Grants JCYJ20200109110410133 and 20200812110350001, in part by the A*STAR under YIRG project No. A2084c0176 and AHSF project No. C211118005.Keywords
- cross-modal
- Depth map compression
- feature fusion