Monotonic Neural Network based Rate-Distortion Modeling for H.266/VVC

  • Xiang PAN
  • , Zhenzhong CHEN*
  • *Corresponding author for this work

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Researchpeer-review

Abstract

Employing neural networks (NN) for rate-distortion (R-D) modeling in image/video coding has received growing attention. Typically, NNs are used to capture the R-D relationship through the rate-quantizer (R-Q) and distortion-quantizer (D-Q) models in various works. However, existing NN-based schemes do not consider the monotonicity of the R-D relationship, resulting in inferior distortion predictions. We design a monotonic neural network-based scheme to tackle this issue. We first adopt linear scaling to ground truth to make the R-Q and D-Q relationship always monotonic increasing. We then train an NN to learn the mapping from the latent features and quantizer to the rate/distortion while enforcing the derivative of NN predicted R/D w.r.t the QP is strictly positive. As such, the monotonic R-D relationship can be maintained. Simulation results on the Vimeo dataset with H.266/VVC show that the proposed approach is better than the state-of-the-art NN-based scheme in prediction accuracy and maintaining monotonicity.
Original languageEnglish
Title of host publicationIEEE International Symposium on Circuits and Systems 2025, ISCAS 2025: Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Number of pages5
ISBN (Electronic)9798350356830
DOIs
Publication statusPublished - 2025
Externally publishedYes
EventIEEE International Symposium on Circuits and Systems 2025 - London, United Kingdom
Duration: 25 May 202528 May 2025

Publication series

NameIEEE International Symposium on Circuits and Systems Proceedings
PublisherIEEE
ISSN (Print)0271-4310
ISSN (Electronic)2158-1525

Symposium

SymposiumIEEE International Symposium on Circuits and Systems 2025
Abbreviated titleISCAS 2025
Country/TerritoryUnited Kingdom
CityLondon
Period25/05/2528/05/25

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Fingerprint

Dive into the research topics of 'Monotonic Neural Network based Rate-Distortion Modeling for H.266/VVC'. Together they form a unique fingerprint.

Cite this