Abstract
This paper presents a new approach that uses the maximum model distance (MMD) method for the adaptation of Hidden Markov models (HMMs). This method has the same framework as it is used for constructing speech recognizers with abundant data, and work effectively with any amount of adaptation data. All parameters of the HMMs with or without the adaptation data could be adapted. If the adaptation data is sufficient, then the adapted models will gradually become a speaker-dependent one. Both the dialect and the speaker adaptation experiments were conducted to investigate the effectiveness of the proposed algorithm. In the speaker adaptation experiments, up to 65.55% phoneme error reduction was achieved, and the MMD could reduce the phoneme error by 16.91% even only one adaptation utterance is available.
Original language | English |
---|---|
Pages (from-to) | 270-276 |
Number of pages | 7 |
Journal | IEEE Transactions on Systems, Man, and Cybernetics Part A:Systems and Humans. |
Volume | 34 |
Issue number | 2 |
Early online date | 26 Feb 2004 |
DOIs |
|
Publication status | Published - Mar 2004 |
Externally published | Yes |
Funding
This work was supported in part by the City University of Hong Kong under Grant 7001488. This paper was recommended by Associate Editor D. Zhang. Q. H. He is with the South China University of Technology, Guangzhou 510641 China. S. Kwong and Q. Y. Hong are with the City University of Hong Kong, Hong Kong (e-mail: [email protected]). Digital Object Identifier 10.1109/TSMCA.2003.818884
Keywords
- Hidden Markov model
- Maximum model distance
- Speaker adaptation