Self-Explanation-Augmented Generation for Natural Language Inference

  • Zijian WANG
  • , Zongxi LI*
  • , Kevin HUNG
  • , Weiming WANG
  • , Siu-Kei AU YEUNG
  • *Corresponding author for this work

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Researchpeer-review

Abstract

Natural language inference (NLI) serves as a core downstream task for evaluating a model’s reasoning capabilities, and large language models (LLMs) have shown remarkable performance across such tasks. However, existing approaches often repurpose LLMs as feature extractors with discriminative classification heads, overlooking their inherently generative and autoregressive nature. To address this issue, we propose Self-Explanation-Augmented Generation (SEAG), a novel framework that reformulates NLI as a generative task to better align with the pretraining objectives of decoder-only LLMs. SEAG consists of two key components: (1) Generative Label Prediction, which prompts the model to directly generate textual labels as natural language tokens, and (2) Explanation-Augmented Attention, which introduces trainable [Exp] tokens that guide the model to construct internal reasoning pathways. This design activates the LLM’s latent reasoning capacity while improving interpretability. Evaluated on the SNLI dataset, SEAG outperforms two baselines, including the Bert-base and Llama-3.2-3B-Instruct variants. Extensive ablation studies confirm that both the generative label prediction and explanation-augmented attention are crucial to performance gains. Our results underscore the potential of generative approaches to not only match but surpass traditional classification strategies by embracing the intrinsic strengths of LLMs.

Original languageEnglish
Title of host publicationNeural Computing for Advanced Applications: 6th International Conference, NCAA 2025, Hong Kong, China, July 4–6, 2025, Proceedings, Part II
EditorsHaijun ZHANG, Kim Fung TSANG, Fu Lee WANG, Kevin HUNG, Tianyong HAO, Zenghui WANG, Zhou WU, Zhao ZHANG
PublisherSpringer Science and Business Media Deutschland GmbH
Pages88-99
Number of pages12
ISBN (Print)9789819537389
DOIs
Publication statusPublished - 2025
Event6th International Conference on Neural Computing for Advanced Applications, NCAA 2025 - Hong Kong, China
Duration: 4 Jul 20256 Jul 2025

Publication series

NameCommunications in Computer and Information Science
PublisherSpringer
Volume2665
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937
NameInternational Conference on Neural Computing for Advanced Applications
PublisherSpringer
Volume2025

Conference

Conference6th International Conference on Neural Computing for Advanced Applications, NCAA 2025
Country/TerritoryChina
CityHong Kong
Period4/07/256/07/25

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

Funding

This work has been supported by a grant from Hong Kong Metropolitan University (Project Reference No. CP/2022/02).

Keywords

  • Explainable AI
  • Generative Language Models
  • LLM
  • Natural Language Inference

Fingerprint

Dive into the research topics of 'Self-Explanation-Augmented Generation for Natural Language Inference'. Together they form a unique fingerprint.

Cite this