Abstract
Deep learning-based approaches formulate acoustic echo cancellation (AEC) as a supervised speech separation task, where the mixture signal and the far-end signal are combined directly before or after the encoding stage. However, the mixture signal and the far-end signal are not integrated sufficiently due to the lack of interpretability for the affinity between speech and echo in a noisy mixture. In this paper, we propose DCA-Net, a dual-branch cross-attention neural network, to improve AEC performance by exploring the affinities between speech and echo in the representation space. In particular, the two branches predict speech and echo, respectively, and an interaction module is designed at several intermediate feature domains between the two branches to learn the correlations between these features of the two branches. Such an interaction can leverage features learned from one branch to restore missing information or counteract undesired information of the other by calculating the similarity between these features of two branches using multi-head cross attention. Evaluation results show that the proposed DCA-Net effectively suppresses acoustic echo and noise while preserving good speech quality.
| Original language | English |
|---|---|
| Title of host publication | 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024: Proceedings |
| Publisher | IEEE |
| Pages | 401-405 |
| Number of pages | 5 |
| ISBN (Electronic) | 9798350344851 |
| ISBN (Print) | 9798350344868 |
| DOIs | |
| Publication status | Published - 2024 |
| Externally published | Yes |
| Event | ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Seoul, Korea, Republic of Duration: 14 Apr 2024 → 19 Apr 2024 |
Publication series
| Name | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
|---|---|
| ISSN (Print) | 1520-6149 |
Conference
| Conference | ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
|---|---|
| Country/Territory | Korea, Republic of |
| City | Seoul |
| Period | 14/04/24 → 19/04/24 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Funding
This work was supported in part by the National Nature Science Foundation of China (No. 62071342, No.62171326), the Special Fund of Hubei Luojia Laboratory (No. 220100019), the Hubei Province Technological Innovation Major Project (No. 2021BAA034) and the Fundamental Research Funds for the Central Universities (No.2042023kf1033).
Keywords
- Acoustic echo cancellation
- dual-branch
- interaction module
- multi-head cross-attention
Fingerprint
Dive into the research topics of 'Improving Acoustic Echo Cancellation by Exploring Speech and Echo Affinity with Multi-Head Attention'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver