Skip to main navigation Skip to search Skip to main content

CTU-Level Rate Control with λ Optimization Based on Visual Gaze Mechanism for 360-Degree Versatile Video Coding

  • Zeming ZHAO
  • , Meng WANG
  • , Xiangjie SUI
  • , Xiaohai HE
  • , Shiqi WANG

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Referred Conference Paperpeer-review

Abstract

Understanding the human visual gaze mechanism is crucial for enhancing 360° video coding technology. This paper presents a Coding Tree Unit (CTU)-level rate control scheme with λ optimization strategy for 360° Versatile Video Coding (VVC), with the aim of enhancing rate-distortion performance and bitrate accuracy. Specifically, the Lagrange parameter λ is optimized with consideration of distortion dependency and the identification of key CTUs guided by visual gaze, which are derived from a 360° video path generation network, thoroughly integrating the characteristics of the human visual gaze. Experimental results show that the proposed scheme achieves BD-rate savings in terms of Weighted to Spherically uniform-Peak Signal-to-Noise Ratio (WS-PSNR) and Sphere-Peak Signal-to-Noise Ratio (S-PSNR) across various coding configurations.
Original languageEnglish
Title of host publication2025 IEEE International Conference on Image Processing, ICIP 2025 - Proceedings
Pages641-646
Number of pages6
ISBN (Electronic)9798331523794
DOIs
Publication statusPublished - 18 Aug 2025
Event2025 IEEE International Conference on Image Processing (ICIP) - Anchorage, AK, USA, Alaska, United States
Duration: 14 Sept 202517 Sept 2025

Publication series

NameProceedings - International Conference on Image Processing, ICIP
ISSN (Print)1522-4880

Conference

Conference2025 IEEE International Conference on Image Processing (ICIP)
Abbreviated titleICIP
Country/TerritoryUnited States
CityAlaska
Period14/09/2517/09/25

Bibliographical note

Publisher Copyright:
©2025 IEEE.

Funding

This work was supported in part by ITF Project GHP/044/21SZ; in part by RGC General Research Fund 11203220/11200323; in part by the National Natural Science Foundation of China (Grant No. 62271336 and Grant No. 62211530110) and in part by the Key Research and Development Program of Sichuan Province (Grant No. 2024YFHZ0289).

Keywords

  • 360° video coding
  • distortion dependency
  • rate control
  • scanpath generator
  • λ optimization

Fingerprint

Dive into the research topics of 'CTU-Level Rate Control with λ Optimization Based on Visual Gaze Mechanism for 360-Degree Versatile Video Coding'. Together they form a unique fingerprint.

Cite this