The use of adaptive frame for speech recognition

Sam KWONG*, Qianhua HE

*Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

3 Citations (Scopus)

Abstract

We propose an adaptive frame speech analysis scheme through dividing speech signal into stationary and dynamic region. Long frame analysis is used for stationary speech, and short frame analysis for dynamic speech. For computation convenience, the feature vector of short frame is designed to be identical to that of long frame. Two expressions are derived to represent the feature vector of short frames. Word recognition experiments on the TIMIT and NON-TIMIT with discrete Hidden Markov Model (HMM) and continuous density HMM showed that steady performance improvement could be achieved for open set testing. On the TIMIT database, adaptive frame length approach (AFL) reduces the error reduction rates from 4.47% to 11.21% and 4.54% to 9.58% for DHMM and CHMM, respectively. In the NON-TIMIT database, AFL also can reduce the error reduction rates from 1.91% to 11.55% and 2.63% to 9.5% for discrete hidden Markov model (DHMM) and continuous HMM (CHMM), respectively. These results proved the effectiveness of our proposed adaptive frame length feature extraction scheme especially for the open testing. In fact, this is a practical measurement for evaluating the performance of a speech recognition system.

Original languageEnglish
Article number281836
Pages (from-to)82-88
Number of pages7
JournalEurasip Journal on Applied Signal Processing
Volume2001
Issue number2
DOIs
Publication statusPublished - Jun 2001
Externally publishedYes

Keywords

  • Adaptive frame
  • Signal analysis
  • Speech coding
  • Speech recognition

Fingerprint

Dive into the research topics of 'The use of adaptive frame for speech recognition'. Together they form a unique fingerprint.

Cite this