Monotonic and Invertible Network : A General Framework for Learning IQA Model from Mixed Datasets

  • Baoliang CHEN
  • , Kang XIAO
  • , Xuelin SHEN
  • , Shiqi WANG

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

Learning from mixed datasets is a powerful strategy for model generalization improvement. However, this approach becomes particularly challenging for image quality assessment (IQA) where the quality annotations across different datasets are usually not aligned, due to varying quality criteria, score ranges, and viewing conditions involved in their subjective tests. Score rescaling and rank learning are two main strategies for the mixed dataset IQA model learning. In particular, the score rescaling attempts to align the annotations across datasets directly by empirical linear or nonlinear transformations, which may not always be reliable. Rank learning, on the other hand, is restricted to pair comparison within each dataset while the image pairs from different datasets are not fully examined. In this paper, we present a novel mixed dataset learning framework for the IQA, where we align the quality annotation in an implicit manner. Specifically, we first introduce a dataset-shared quality regressor to project images across datasets into a unified proxy score space. We force the proxy scores to serve as proxies of the aligned results of the annotations. To achieve this, two key priors are considered: 1) within each dataset, the proxy scores should maintain the same rank as the annotations; and 2) the score ranges of proxy scores from different datasets overlap when images of similar quality exist in these datasets. To meet the criteria, we propose a monotonic and invertible network as a dataset-specific score mapper to bridge the proxy scores and the annotations with the rank consistency and range intersection established. This constrained learning ultimately results in the proxy scores being the desired alignment results of the annotations. Experiments on extensive IQA datasets validate the effectiveness of our method, and the continuous performance gains when incorporating our learning strategy into different network architectures demonstrate the high generalizability of our framework. The source code is available at https://github.com/KANGX99/MIMI.
Original languageEnglish
Pages (from-to)7924-7945
Number of pages22
JournalInternational Journal of Computer Vision
Volume133
Issue number11
Early online date18 Aug 2025
DOIs
Publication statusPublished - Nov 2025

Bibliographical note

B. Chen and K. Xiao: These authors contributed equally to this work.

Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2025.

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant 62401214, the RGC General Research Fund 11200323, and the CityUHK Applied Research Grant 9667255.

Keywords

  • Image quality assessment
  • Invertible neural network
  • Mixed dataset
  • Monotonic network

Fingerprint

Dive into the research topics of 'Monotonic and Invertible Network : A General Framework for Learning IQA Model from Mixed Datasets'. Together they form a unique fingerprint.

Cite this