Machine Learning for Actionable Warning Identification: A Comprehensive Survey

  • Xiuting GE
  • , Chunrong FANG*
  • , Xuanye LI
  • , Weisong SUN
  • , Daoyuan WU
  • , Juan ZHAI
  • , Shangwei LIN
  • , Zhihong ZHAO
  • , Yang LIU
  • , Zhenyu CHEN*
  • *Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

Actionable Warning Identification (AWI) plays a crucial role in improving the usability of static code analyzers. With recent advances in Machine Learning (ML), various approaches have been proposed to incorporate ML techniques into AWI. These ML-based AWI approaches, benefiting from ML's strong ability to learn subtle and previously unseen patterns from historical data, have demonstrated superior performance. However, a comprehensive overview of these approaches is missing, which could hinder researchers and practitioners from understanding the current process and discovering potential for future improvement in the ML-based AWI community. In this article, we systematically review the state-of-the-art ML-based AWI approaches. First, we employ a meticulous survey methodology and gather 51 primary studies from January 1, 2000 to January 9, 2023. Then, we outline a typical ML-based AWI workflow, including warning dataset preparation, preprocessing, AWI model construction, and evaluation stages. In such a workflow, we categorize ML-based AWI approaches based on the warning output format. In addition, we analyze the key techniques used in each stage, along with their strengths, weaknesses, and distribution. Finally, we provide practical research directions for future ML-based AWI approaches, focusing on aspects such as data improvement (e.g., enhancing the warning labeling strategy) and model exploration (e.g., exploring large language models for AWI).

Original languageEnglish
Article number39
Pages (from-to)1-35
Number of pages35
JournalACM Computing Surveys
Volume57
Issue number2
Early online date11 Oct 2024
DOIs
Publication statusPublished - Feb 2025
Externally publishedYes

Bibliographical note

Acknowledgements:
We would like to thank the anonymous reviewers for their insightful comments.

Publisher Copyright:
© 2024 Copyright held by the owner/author(s). Publication rights licensed to ACM.

Funding

This work is partly supported by the National Natural Science Foundation of China (grant nos. 61932012, 62372228, and 62141215) and the Program of the China Scholarship Council (grant no. 202306190140).

Keywords

  • Static analysis warnings
  • actionable warning identification
  • literature review

Fingerprint

Dive into the research topics of 'Machine Learning for Actionable Warning Identification: A Comprehensive Survey'. Together they form a unique fingerprint.

Cite this