Abstract
This paper presents a new approach that uses the maximum model distance (MMD) method for the adaptation of Hidden Markov models (HMMs). This method has the same framework as it is used for constructing speech recognizers with abundant data, and work effectively with any amount of adaptation data. All parameters of the HMMs with or without the adaptation data could be adapted. If the adaptation data is sufficient, then the adapted models will gradually become a speaker-dependent one. Both the dialect and the speaker adaptation experiments were conducted to investigate the effectiveness of the proposed algorithm. In the speaker adaptation experiments, up to 65.55% phoneme error reduction was achieved, and the MMD could reduce the phoneme error by 16.91% even only one adaptation utterance is available.
Original language | English |
---|---|
Pages (from-to) | 270-276 |
Number of pages | 7 |
Journal | IEEE Transactions on Systems, Man, and Cybernetics Part A:Systems and Humans. |
Volume | 34 |
Issue number | 2 |
Early online date | 26 Feb 2004 |
DOIs |
|
Publication status | Published - Mar 2004 |
Externally published | Yes |
Bibliographical note
Funding Information:This work was supported in part by the City University of Hong Kong under Grant 7001488.
Keywords
- Hidden Markov model
- Maximum model distance
- Speaker adaptation