Abstract
In the past few years, 360° video has started to infiltrate various aspects of daily life. Although there have been significant developments in 360° video coding technology, understanding of the human visual gaze mechanism has been somewhat overlooked. In this paper, we propose a rate control scheme for 360° Versatile Video Coding (VVC) based on a human visual gaze mechanism, targeting at improving the coding performance and bitrate accuracy. More specifically, based on the Equi-rectangular Projection (ERP) format, latitude information is systematically analyzed and a stripe-level bit allocation scheme is established, to better mitigate the projection distortion. Subsequently, the Lagrange parameter λ is further optimized with distortion dependency and identification of the visual gaze guided key Coding Tree Units (CTUs). The proposed rate control scheme is implemented on the VVC Test Model for 360° video. Experimental results show that the proposed rate control scheme can achieve BD-rate savings in terms of Weighted to Spherically uniform-Peak Signal-to-Noise Ratio (WS-PSNR) and Sphere-Peak Signal-to-Noise Ratio (S-PSNR) under the various configurations, respectively. Meanwhile, a healthier buffer status and better visual quality can be observed, further demonstrating the advantages of the proposed scheme.
| Original language | English |
|---|---|
| Pages (from-to) | 1-11 |
| Number of pages | 11 |
| Journal | IEEE Transactions on Multimedia |
| DOIs | |
| Publication status | E-pub ahead of print - 2 Mar 2026 |
Bibliographical note
Publisher Copyright:© 1999-2012 IEEE.
Funding
This work was supported in part by ITF Project GHP/044/21SZ; in part by RGC General Research Fund 11203220/11200323; in part by the National Natural Science Foundation of China (Grant No. 62271336) and in part by the Key Research and Development Program of Sichuan Province (Grant No. 2024YFHZ0289).
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- 360° video coding
- human visual gaze mechanism
- latitude information
- rate control
- scanpath generator
Fingerprint
Dive into the research topics of 'Rate Control for 360° Versatile Video Coding Based on Visual Gaze Mechanism'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver