Research on the application of multi-modal human-computer interaction technology in phonetics based on the background of big data

Yanziye WEI*

*Corresponding author for this work

Research output: Other Conference ContributionsConference Paper (other)Other Conference Paperpeer-review

Abstract

According to McKinsey & Company, the world's leading consulting firm, which first proposed the era of "Big Data," "Data, which has permeated every industry and business function today, has become an important factor in production. The mining and use of vast amounts of data heralds a new wave of productivity growth and consumer surplus." "Big data" has existed for some time in fields such as physics, biology, environmental ecology, as well as in the military, finance, and communications industries, but has attracted attention in recent years because of the growth of the Internet and the information industry. Speech recognition technology, as an important part of the human-computer interaction field, has made tremendous development in recent years. This paper will delve into the evolution of speech recognition and human-computer interaction, the problems encountered, the solution process, the future scope of availability, as well as the applications in various countries and future research trends. With the continuous development of speech recognition technology, speech-based human-computer interaction design has made significant progress in various fields. From intelligent voice assistants to voice-controlled smart homes, speech recognition has become an important tool for improving user experience and increasing accessibility. This paper is based on the algorithmic background of big data.
Original languageEnglish
Pages405-410
Number of pages6
DOIs
Publication statusPublished - 2 Oct 2024

Bibliographical note

Publisher Copyright:
© 2024 ACM.

Keywords

  • Big Data Context
  • Multi-modal
  • Human-computer interaction
  • technological convergence

Fingerprint

Dive into the research topics of 'Research on the application of multi-modal human-computer interaction technology in phonetics based on the background of big data'. Together they form a unique fingerprint.

Cite this