Rapid acoustic model adaptation using inverse MLLR-based feature generation

Arata Ito, Sunao Hara, Norihide Kitaoka, Kazuya Takeda

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We propose a technique for generating a large amount of target speaker-like speech features by converting a large amount of prepared speech features of many speakers into features similar to those of the target speaker using a transformation matrix. To generate a large amount of target speaker-like features, the system only needs a very small amount of the target speaker's utterances. This technique enables the system to adapt the acoustic model efficiently from a small amount of the target speaker's utterances. To evaluate the proposed method, we prepared 100 reference speakers and 12 target (test) speakers. We conducted the experiments in an isolated word recognition task using a speech database collected by real PC-based distributed environments and compared our proposed method with MLLR, MAP and the method theoretically equivalent to the SAT. Experimental results proved that the proposed method needed a significantly smaller amount of the target speaker's utterances than conventional MLLR, MAP and SAT.

Original languageEnglish
Title of host publication20th International Congress on Acoustics 2010, ICA 2010 - Incorporating Proceedings of the 2010 Annual Conference of the Australian Acoustical Society
Pages3783-3788
Number of pages6
Volume5
Publication statusPublished - 2010
Externally publishedYes
Event20th International Congress on Acoustics 2010, ICA 2010 - Incorporating the 2010 Annual Conference of the Australian Acoustical Society - Sydney, NSW, Australia
Duration: Aug 23 2010Aug 27 2010

Other

Other20th International Congress on Acoustics 2010, ICA 2010 - Incorporating the 2010 Annual Conference of the Australian Acoustical Society
CountryAustralia
CitySydney, NSW
Period8/23/108/27/10

    Fingerprint

ASJC Scopus subject areas

  • Acoustics and Ultrasonics

Cite this

Ito, A., Hara, S., Kitaoka, N., & Takeda, K. (2010). Rapid acoustic model adaptation using inverse MLLR-based feature generation. In 20th International Congress on Acoustics 2010, ICA 2010 - Incorporating Proceedings of the 2010 Annual Conference of the Australian Acoustical Society (Vol. 5, pp. 3783-3788)