Abstract
Blind image quality measurement (BIQM) has achieved great progress due to the deployment of deep neural networks (DNNs) for training end-to-end models. Most of the existing DNN-based BIQM methods simply aggregate the local deep feature maps with a global max or average pooling layer to generate holistic feature vectors for quality prediction. However, such pooling strategy fails to capture the high-order statistics of local feature descriptors. Inspired by the success of dictionary encoding-based BIQM methods, this article proposes a deep dictionary encoding network (Deep-DEN) that can well capture the high-order statistics of local deep features in an end-to-end manner. In Deep-DEN, dictionary encoding is encapsulated into a single learnable layer attached to the end of a backbone network and followed by a fully connected layer for quality prediction. As a result, high-order statistics of the extracted local deep features in the backbone network and quality prediction functions are simultaneously optimized in a fully supervised manner. The performance of Deep-DEN has been extensively evaluated on several benchmarks and the superiority has been well validated by comparisons with other state-of-the-art BIQM methods.
Original language | English |
---|---|
Pages (from-to) | 7398-7410 |
Journal | IEEE Transactions on Instrumentation and Measurement |
Volume | 69 |
Issue number | 10 |
Early online date | 2 Apr 2020 |
DOIs | |
Publication status | Published - Oct 2020 |
Externally published | Yes |
Bibliographical note
This work was supported in part by the Natural Science Foundation of China under Grant 61901236, Grant 61622109, and Grant 61801303, in part by the Natural Science Foundation of Ningbo under Grant 2019A610097, in part by the Zhejiang Natural Science Foundation of China under Grant R18F010008, in part by the Guangdong Basic and Applied Basic Research Foundation under Grant 2019A1515012031, in part by the Shenzhen Science and Technology Plan Basic Research Project under Grant JCYJ20190808161805519, and in part by the National Science Foundation of Shenzhen University under Grant 860-000002110122. It was also sponsored by K. C. Wong Magna Fund in Ningbo University.Keywords
- Blind image quality measurement
- convolutional neural network
- deep learning
- dictionary encoding
- end-to-end