Abstract
This paper explores computational approaches for detecting parallelism in classical Chinese poetry, a rhetorical device where two verses mirror each other in syntax, meaning, tone, and rhythm. We experiment with five classification methods: (1) verb position matching, (2) integrated semantic, syntactic, and word-segmentation analysis, (3) difference-based character embeddings, (4) structured examples (inner/outer couplets), and (5) GPT-guided classification. We use a manually annotated dataset, containing 6,125 pentasyllabic couplets, to evaluate performance. The results indicate that parallelism detection poses a significant challenge even for powerful LLMs such as GPT-4o, with the highest F1 score below 0.72. Nevertheless, each method contributes valuable insights into the art of parallelism in Chinese poetry, suggesting a new understanding of parallelism as a verbal expression of principal components in a culturally defined vector space.
Original language | English |
---|---|
Title of host publication | Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities |
Editors | Mika HÄMÄLÄINEN, Emily ÖHMAN, So MIYAGAWA, Khalid ALNAJJAR, Yuri BIZZONI |
Place of Publication | Miami |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 200-208 |
Number of pages | 9 |
ISBN (Print) | 9798891761810 |
Publication status | Published - Nov 2024 |
Event | The 4th International Conference on Natural Language Processing for Digital Humanities - Miami, United States Duration: 16 Nov 2024 → 16 Nov 2024 |
Conference
Conference | The 4th International Conference on Natural Language Processing for Digital Humanities |
---|---|
Abbreviated title | NLP4DH 2024 |
Country/Territory | United States |
City | Miami |
Period | 16/11/24 → 16/11/24 |