Projects per year
Abstract
Existing deep learning-based full-reference IQA (FR-IQA) models usually predict the image quality in a deterministic way by explicitly comparing the features, gauging how severely distorted an image is by how far the corresponding feature lies from the space of the reference images. Herein, we look at this problem from a different viewpoint and propose to model the quality degradation in perceptual space from a statistical distribution perspective. As such, the quality is measured based upon the Wasserstein distance in the deep feature domain. More specifically, the 1D Wasserstein distance at each stage of the pre-trained VGG network is measured, based on which the final quality score is performed. The deep Wasserstein distance (DeepWSD) performed on features from neural networks enjoys better interpretability of the quality contamination caused by various types of distortions and presents an advanced quality prediction capability. Extensive experiments and theoretical analysis show the superiority of the proposed DeepWSD in terms of both quality prediction and optimization. The implementation of our method is publicly available at https://github.com/Buka-Xing/DeepWSD.
Original language | English |
---|---|
Title of host publication | Proceedings of the 30th ACM International Conference on Multimedia |
Publisher | Association for Computing Machinery |
Pages | 970-978 |
ISBN (Print) | 9781450392037 |
DOIs | |
Publication status | Published - Oct 2022 |
Externally published | Yes |
Event | The 30th ACM International Conference on Multimedia - Lisbon, Portugal Duration: 10 Oct 2022 → 14 Oct 2022 |
Conference
Conference | The 30th ACM International Conference on Multimedia |
---|---|
Country/Territory | Portugal |
City | Lisbon |
Period | 10/10/22 → 14/10/22 |
Bibliographical note
This work was supported in part by the Key Project of Science and Technology Innovation 2030 supported by the Ministry of Science and Technology of China (Grant No. 2018AAA0101301), the National Natural Science Foundation of China Grant 61672443, 62022002 and 62176027, in part by Hong Kong GRF - RGC General Research Fund 9042816 (CityU 11209819) and 9042958 (CityU 11203820). Also, this work is supported by the Hong Kong Innovation and Technology Commission (InnoHK Project CIMDA).Keywords
- full-reference IQA
- statistical model for image representation
- Wasserstein distance
Fingerprint
Dive into the research topics of 'DeepWSD : Projecting Degradations in Perceptual Space to Wasserstein Distance in Deep Feature Space'. Together they form a unique fingerprint.Projects
- 1 Active
-
Adaptive Dynamic Range Enhancement Oriented to High Dynamic Display (面向高動態顯示的自適應動態範圍增強)
KWONG, S. T. W. (PI), KUO, C.-C. J. (CoI), WANG, S. (CoI) & ZHANG, X. (CoI)
Research Grants Council (HKSAR)
1/01/21 → 31/12/24
Project: Grant Research