抄録
This paper proposes a sub-band speech synthesis approach to develop high-quality Text-to-Speech (TTS). For the low-frequency band and high-frequency band, Hidden Markov Model (HMM)-based speech synthesis and waveform-based speech synthesis are used, respectively. Both speech synthesis methods are widely known to show good performance and to have benefits and shortcomings from different points of view. One motivation is to apply the right speech synthesis method in the right frequency band. Experiment results show that in terms of the smoothness the proposed approach shows better performance than waveform-based speech synthesis, and in terms of the clarity it shows better than HMM-based speech synthesis. Consequently, the proposed approach combines the inherent benefits from both waveform-based speech synthesis and HMM-based speech synthesis.
本文言語 | English |
---|---|
ホスト出版物のタイトル | 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014 |
出版社 | Institute of Electrical and Electronics Engineers Inc. |
ISBN(電子版) | 9786163618238 |
DOI | |
出版ステータス | Published - 2月 12 2014 |
イベント | 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014 - Chiang Mai 継続期間: 12月 9 2014 → 12月 12 2014 |
Other
Other | 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014 |
---|---|
国/地域 | Thailand |
City | Chiang Mai |
Period | 12/9/14 → 12/12/14 |
ASJC Scopus subject areas
- 信号処理
- 情報システム