Abstract
Employing neural networks (NN) for rate-distortion (R-D) modeling in image/video coding has received growing attention. Typically, NNs are used to capture the R-D relationship through the rate-quantizer (R-Q) and distortion-quantizer (D-Q) models in various works. However, existing NN-based schemes do not consider the monotonicity of the R-D relationship, resulting in inferior distortion predictions. We design a monotonic neural network-based scheme to tackle this issue. We first adopt linear scaling to ground truth to make the R-Q and D-Q relationship always monotonic increasing. We then train an NN to learn the mapping from the latent features and quantizer to the rate/distortion while enforcing the derivative of NN predicted R/D w.r.t the QP is strictly positive. As such, the monotonic R-D relationship can be maintained. Simulation results on the Vimeo dataset with H.266/VVC show that the proposed approach is better than the state-of-the-art NN-based scheme in prediction accuracy and maintaining monotonicity.
| Original language | English |
|---|---|
| Title of host publication | IEEE International Symposium on Circuits and Systems 2025, ISCAS 2025: Proceedings |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Number of pages | 5 |
| ISBN (Electronic) | 9798350356830 |
| DOIs | |
| Publication status | Published - 2025 |
| Externally published | Yes |
| Event | IEEE International Symposium on Circuits and Systems 2025 - London, United Kingdom Duration: 25 May 2025 → 28 May 2025 |
Publication series
| Name | IEEE International Symposium on Circuits and Systems Proceedings |
|---|---|
| Publisher | IEEE |
| ISSN (Print) | 0271-4310 |
| ISSN (Electronic) | 2158-1525 |
Symposium
| Symposium | IEEE International Symposium on Circuits and Systems 2025 |
|---|---|
| Abbreviated title | ISCAS 2025 |
| Country/Territory | United Kingdom |
| City | London |
| Period | 25/05/25 → 28/05/25 |
Bibliographical note
Publisher Copyright:© 2025 IEEE.
Fingerprint
Dive into the research topics of 'Monotonic Neural Network based Rate-Distortion Modeling for H.266/VVC'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver