Abstract
As large language models (LLMs) become ubiquitous in public-facing services, millions of users engage in unauthenticated sessions under the assumption that ''no login'' implies ''no tracking.'' We challenge this assumption by formalizing Dialogue Attribution—the task of identifying the same user across disparate, unauthenticated human-LLM sessions, even under severe topic shifts. To rigorously quantify this threat, we introduce WildAuth, the first benchmark derived from real-world ChatGPT logs, and propose Uncertainty-aware Multi-aspect Attribution (UMA). UMA effectively links users by fusing complementary identity signals—content, stylometrics, interaction, and personality—via a novel uncertainty-aware mechanism that dynamically suppresses noise in ambiguous or short-text scenarios. Our approach consistently outperforms strong stylometric, PLM-based, and LLM-assisted baselines, achieving an AUC of 92.05% (F1 76.15%) in challenging cross-topic settings. More critically, we find that attribution remains highly effective (88.65% AUC) even when relying solely on the LLM's responses. These findings expose a fundamental privacy paradox: the very behavioral signatures that enable natural interaction--including the assistant's stylistic ''mirroring''--render anonymous sessions intrinsically linkable, necessitating a re-evaluation of privacy standards for AI infrastructure.
| Original language | English |
|---|---|
| Title of host publication | WWW 2026 - Proceedings of the ACM Web Conference 2026 |
| Editors | Hakim HACID, Yoelle MAAREK |
| Publisher | Association for Computing Machinery, Inc |
| Pages | 8797-8807 |
| Number of pages | 11 |
| ISBN (Electronic) | 9798400723070 |
| ISBN (Print) | 9798400723070 |
| DOIs | |
| Publication status | Published - 12 Apr 2026 |
| Event | ACM Web Conference 2026 - Dubai , United Arab Emirates Duration: 13 Apr 2026 → 17 Apr 2026 |
Conference
| Conference | ACM Web Conference 2026 |
|---|---|
| Abbreviated title | WWW '26 |
| Country/Territory | United Arab Emirates |
| City | Dubai |
| Period | 13/04/26 → 17/04/26 |
Bibliographical note
Publisher Copyright:© 2026 Owner/Author.
Funding
This work has benefited from the financial support of Lingnan University (ISRG252605) and National Natural Science Foundation of China under Grants 62372028.
Keywords
- Dialogue Attribution
- Human-LLM Interaction
- Privacy Risks
- User Modeling
- Large Language Models
- Behavioral Fingerprinting
- privacy risks
- human-llm interaction
- user modeling
- large language models
- dialogue attribution
- behavioral fingerprinting
Fingerprint
Dive into the research topics of 'The Chatbot Knows It's You: Dialogue Attribution in Unauthenticated Human–LLM Sessions'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver