A Topic Modelling-based Bibliometric Exploration of Automatic Summarization Research

Xieling CHEN, Haoran XIE*, Xiaohui TAO, Lingling XU, Jingjing WANG, Hong-Ning DAI, Fu Lee WANG

*Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

The surge in text data has driven extensive research into developing diverse automatic summarization approaches to effectively handle vast textual information. There are several reviews on this topic, yet no large-scale analysis based on quantitative approaches has been conducted. To provide a comprehensive overview of the field, this study conducted a bibliometric analysis of 3108 papers published from 2010 to 2022, focusing on automatic summarization research regarding topics and trends, top sources, countries/regions, institutions, researchers, and scientific collaborations. We have identified the following trends. First, the number of papers has experienced 65% growth, with the majority being published in computer science conferences. Second, Asian countries and institutions, notably China and India, actively engage in this field and demonstrate a strong inclination toward inter-regional international collaboration, contributing to more than 24% and 20% of the output, respectively. Third, researchers show a high level of interest in multihead and attention mechanisms, graph-based semantic analysis, and topic modeling and clustering techniques, with each topic having a prevalence of over 10%. Finally, scholars have been increasingly interested in self-supervised and zero/few-shot learning, multihead and attention mechanisms, and temporal analysis and event detection. This study is valuable when it comes to enhancing scholars' and practitioners' understanding of the current hotspots and future directions in automatic summarization.
Original languageEnglish
JournalWiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
DOIs
Publication statusE-pub ahead of print - 25 Apr 2024

Bibliographical note

Publisher Copyright:
© 2024 The Authors. WIREs Data Mining and Knowledge Discovery published by Wiley Periodicals LLC.

Keywords

  • automatic summarization
  • text mining
  • topic modeling
  • trend analysis

Fingerprint

Dive into the research topics of 'A Topic Modelling-based Bibliometric Exploration of Automatic Summarization Research'. Together they form a unique fingerprint.

Cite this