Skip to main navigation Skip to search Skip to main content

Improving Acoustic Echo Cancellation by Mixing Speech Local and Global Features with Transformer

  • Yajie LIU
  • , Xinmeng XU
  • , Weiping TU*
  • , Yuhong YANG
  • , Li XIAO
  • *Corresponding author for this work

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Researchpeer-review

Abstract

We propose MiT-Net, a novel mix-transformer neural network with a pyramid encoder operating in the time domain, for the task of acoustic echo cancellation. The MiT-Net formulates acoustic echo cancellation as a supervised speech separation problem, in which near-end speech is separated from a single microphone recording and sent to the far end, and consists of two key components. First, we apply a pyramid encoder, which adopts the coarse-to-fine structure, to extract the latent correlations between double-end signals and to fuse them in a multiscale manner. Second, we propose a mix-transformer, a combination of local and global attention in a parallel way, to leverage local and global speech information for separation. Experimental results show that the proposed method outperforms recent AEC methods in terms of objective evaluation metrics. In addition, exploring the correlation between speech local and global features by using the mix-transformer significantly improves the system performance and shows more robustness than the conventional transformer.
Original languageEnglish
Title of host publicationICASSP 2023: 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Proceedings
PublisherIEEE
Number of pages5
ISBN (Electronic)9781728163277
DOIs
Publication statusPublished - 2023
Externally publishedYes
EventICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Rhodes Island, Greece
Duration: 4 Jun 202310 Jun 2023

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2023-June
ISSN (Print)1520-6149

Conference

ConferenceICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Country/TerritoryGreece
CityRhodes Island
Period4/06/2310/06/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Funding

This work was supported in part by the National Nature Science Foundation of China (No. 62071342, No.62171326), the Special Fund of Hubei Luojia Laboratory (No. 220100019), the Hubei Province Technological Innovation Major Project (No. 2021BAA034) and the Fundamental Research Funds for the Central Universities (No. 2042022kf0001).

Keywords

  • Acoustic echo cancellation
  • local and global information
  • mix-transformer
  • multiscale fusion

Fingerprint

Dive into the research topics of 'Improving Acoustic Echo Cancellation by Mixing Speech Local and Global Features with Transformer'. Together they form a unique fingerprint.

Cite this