Abstract
Understanding the human visual gaze mechanism is crucial for enhancing 360° video coding technology. This paper presents a Coding Tree Unit (CTU)-level rate control scheme with λ optimization strategy for 360° Versatile Video Coding (VVC), with the aim of enhancing rate-distortion performance and bitrate accuracy. Specifically, the Lagrange parameter λ is optimized with consideration of distortion dependency and the identification of key CTUs guided by visual gaze, which are derived from a 360° video path generation network, thoroughly integrating the characteristics of the human visual gaze. Experimental results show that the proposed scheme achieves BD-rate savings in terms of Weighted to Spherically uniform-Peak Signal-to-Noise Ratio (WS-PSNR) and Sphere-Peak Signal-to-Noise Ratio (S-PSNR) across various coding configurations.
| Original language | English |
|---|---|
| Title of host publication | 2025 IEEE International Conference on Image Processing, ICIP 2025 - Proceedings |
| Pages | 641-646 |
| Number of pages | 6 |
| ISBN (Electronic) | 9798331523794 |
| DOIs | |
| Publication status | Published - 18 Aug 2025 |
| Event | 2025 IEEE International Conference on Image Processing (ICIP) - Anchorage, AK, USA, Alaska, United States Duration: 14 Sept 2025 → 17 Sept 2025 |
Publication series
| Name | Proceedings - International Conference on Image Processing, ICIP |
|---|---|
| ISSN (Print) | 1522-4880 |
Conference
| Conference | 2025 IEEE International Conference on Image Processing (ICIP) |
|---|---|
| Abbreviated title | ICIP |
| Country/Territory | United States |
| City | Alaska |
| Period | 14/09/25 → 17/09/25 |
Bibliographical note
Publisher Copyright:©2025 IEEE.
Funding
This work was supported in part by ITF Project GHP/044/21SZ; in part by RGC General Research Fund 11203220/11200323; in part by the National Natural Science Foundation of China (Grant No. 62271336 and Grant No. 62211530110) and in part by the Key Research and Development Program of Sichuan Province (Grant No. 2024YFHZ0289).
Keywords
- 360° video coding
- distortion dependency
- rate control
- scanpath generator
- λ optimization
Fingerprint
Dive into the research topics of 'CTU-Level Rate Control with λ Optimization Based on Visual Gaze Mechanism for 360-Degree Versatile Video Coding'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver