Abstract
Natural language inference (NLI) serves as a core downstream task for evaluating a model’s reasoning capabilities, and large language models (LLMs) have shown remarkable performance across such tasks. However, existing approaches often repurpose LLMs as feature extractors with discriminative classification heads, overlooking their inherently generative and autoregressive nature. To address this issue, we propose Self-Explanation-Augmented Generation (SEAG), a novel framework that reformulates NLI as a generative task to better align with the pretraining objectives of decoder-only LLMs. SEAG consists of two key components: (1) Generative Label Prediction, which prompts the model to directly generate textual labels as natural language tokens, and (2) Explanation-Augmented Attention, which introduces trainable [Exp] tokens that guide the model to construct internal reasoning pathways. This design activates the LLM’s latent reasoning capacity while improving interpretability. Evaluated on the SNLI dataset, SEAG outperforms two baselines, including the Bert-base and Llama-3.2-3B-Instruct variants. Extensive ablation studies confirm that both the generative label prediction and explanation-augmented attention are crucial to performance gains. Our results underscore the potential of generative approaches to not only match but surpass traditional classification strategies by embracing the intrinsic strengths of LLMs.
| Original language | English |
|---|---|
| Title of host publication | Neural Computing for Advanced Applications: 6th International Conference, NCAA 2025, Hong Kong, China, July 4–6, 2025, Proceedings, Part II |
| Editors | Haijun ZHANG, Kim Fung TSANG, Fu Lee WANG, Kevin HUNG, Tianyong HAO, Zenghui WANG, Zhou WU, Zhao ZHANG |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 88-99 |
| Number of pages | 12 |
| ISBN (Print) | 9789819537389 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 6th International Conference on Neural Computing for Advanced Applications, NCAA 2025 - Hong Kong, China Duration: 4 Jul 2025 → 6 Jul 2025 |
Publication series
| Name | Communications in Computer and Information Science |
|---|---|
| Publisher | Springer |
| Volume | 2665 |
| ISSN (Print) | 1865-0929 |
| ISSN (Electronic) | 1865-0937 |
| Name | International Conference on Neural Computing for Advanced Applications |
|---|---|
| Publisher | Springer |
| Volume | 2025 |
Conference
| Conference | 6th International Conference on Neural Computing for Advanced Applications, NCAA 2025 |
|---|---|
| Country/Territory | China |
| City | Hong Kong |
| Period | 4/07/25 → 6/07/25 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
Funding
This work has been supported by a grant from Hong Kong Metropolitan University (Project Reference No. CP/2022/02).
Keywords
- Explainable AI
- Generative Language Models
- LLM
- Natural Language Inference