Skip to main navigation Skip to search Skip to main content

Quality Assessment for Text-to-Image Generation: A Survey

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

In recent years, there have been notable advancements in text-to-image generation facilitated by artificial intelligence (AI) technology. Text-to-image generation requires higher-level cognitive abilities, posing unique challenges for image quality assessment typically designed for professionally generated content and user-generated content. Existing works have extensively investigated quality assessment from subjective and objective perspectives, covering a range of evaluation dimensions such as text–image alignment, perception, esthetics, fairness, and toxicity. This article provides a comprehensive overview of recent advancements in image quality assessment for text-to-image generation. In particular, we review existing quality assessment studies from subjective and objective perspectives, highlighting representative datasets and objective metrics for assessing different aspects of AI-generated image quality. Additionally, we discuss the limitations of current research and propose future directions.

Original languageEnglish
Pages (from-to)44-52
Number of pages9
JournalIEEE Multimedia
Volume32
Issue number2
DOIs
Publication statusPublished - 6 Feb 2025

Bibliographical note

Publisher Copyright:
© 1994-2012 IEEE.

Funding

This work was supported in part by the Science and Technology Innovation 2030 Key Project under Grant 2018AAA0101301, in part by the Hong Kong Innovation and Technology Commission (InnoHK Project CIM-DA), and in part by the Hong Kong General Research Fund under Grants 11209819, 11203820, 11200323, and 11203220. This article has supplementary down-loadable material available at https://doi.org/10.1109/MMUL.2025.3538862, provided by the authors.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 9 - Industry, Innovation, and Infrastructure
    SDG 9 Industry, Innovation, and Infrastructure

Fingerprint

Dive into the research topics of 'Quality Assessment for Text-to-Image Generation: A Survey'. Together they form a unique fingerprint.

Cite this