Abstract
In this paper, we propose a convolutional neural network (CNN)-based rate-distortion (R-D) modeling method for H.265/HEVC. A fully convolutional neural network (CNN) is designed to learn end-to-end, pixels-to-pixels mappings from the original images to the structural similarity (SSIM) maps indicating distortion. The rate information is predicted through a CNN with fully connected layers as well. When compared to traditional CNN methods, the proposed mappings to the distortion or rate information. The experiments demonstrate the feasibility of our CNN-based framework for rate-distortion modeling.
| Original language | English |
|---|---|
| Title of host publication | IEEE Visual Communications and Image Processing, VCIP 2017 |
| Publisher | IEEE |
| Pages | 1-4 |
| Number of pages | 4 |
| ISBN (Electronic) | 9781538604625 |
| ISBN (Print) | 9781538604632 |
| DOIs | |
| Publication status | Published - Dec 2017 |
| Externally published | Yes |
| Event | 2017 IEEE Visual Communications and Image Processing (VCIP 2017) - St. Petersburg, United States Duration: 10 Dec 2017 → 13 Dec 2017 |
Conference
| Conference | 2017 IEEE Visual Communications and Image Processing (VCIP 2017) |
|---|---|
| Country/Territory | United States |
| City | St. Petersburg |
| Period | 10/12/17 → 13/12/17 |
Bibliographical note
Publisher Copyright:© 2017 IEEE.
Funding
This work was supported in part by National Natural Science Foundation of China (No. 61471273), National Hightech R&D Program of China (863 Program, 2015AA015903), Natural Science Foundation of Hubei Province of China (No. 2015CFA053) and LIESMARS Special Research Funding.
Keywords
- convolutional neural network (CNN)
- deep learning
- H.265/HEVC
- Rate-distortion model
- structural similarity