Abstract
In this paper, we propose an effective approach to infer the just noticeable distortion (JND) profile based on patch-level structural visibility learning. Instead of pixel-level JND profile estimation, the image patch, which is regarded as the basic processing unit to better correlate with the human perception, can be further decomposed into three conceptually independent components for visibility estimation. In particular, to incorporate the structural degradation into the patch-level JND model, a deep learning-based structural degradation estimation model is trained to approximate the masking of structural visibility. In order to facilitate the learning process, a JND dataset is further established, including 202 pristine images and 7878 distorted images generated by advanced compression algorithms based on the upcoming Versatile Video Coding (VVC) standard. Extensive experimental results further show the superiority of the proposed approach over the state-of-the-art. Our dataset is available at: https://github.com/ShenXuelin-CityU/PWJNDInfer.
Original language | English |
---|---|
Pages (from-to) | 26-38 |
Number of pages | 13 |
Journal | IEEE Transactions on Image Processing |
Volume | 30 |
Early online date | 3 Nov 2020 |
DOIs | |
Publication status | Published - 2021 |
Externally published | Yes |
Bibliographical note
The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Sérgio De Faria. (Corresponding authors: Shiqi Wang; Sam Kwong.) Xuelin Shen, Zhangkai Ni, Wenhan Yang, and Shiqi Wang are with the Department of Computer Science, City University of Hong Kong, Hong Kong (e-mail: [email protected]; [email protected]; [email protected]; [email protected]).Funding
This work was supported in part by the Hong Kong Research Grants Council (RGC) General Research Funds under Grant 9042816 (CityU 11209819), Grant 9042958 (CityU 11203820), and Grant 9042957 (CityU 11203220), and in part by the Hong Kong Research Grants Council (RGC) Early Career Scheme under Grant 9048122 (CityU 21211018).
Keywords
- Just noticeable distortion
- deep neural network
- perceptual video coding
- visual perception