Abstract
This paper proposes syllable-based F0 units(SBUs) for F0 contour generation and a two-stage strategy. The two-stage strategy provides a flexible F0 generation framework by introducing a global model and local model. The local model consists of the SBUs which make it possible to precisely estimate F0 contour using segmental information. Experimental results show that the proposed approach can generate a good global model(the measured multiple correlation coefficient is 0.843), and can precisely estimate average F0(the measured multiple correlation coefficient is 0.875). It is also confirmed that generating SBUs according to syllable positions is important in precisely estimating F0 contour. Listening tests show that speech synthesized with the proposed model is preferred to the output of the conventional model. We expect that the approach will prove to be useful and powerful for synthesizing various types of speech.
Original language | English |
---|---|
Title of host publication | ICASSP 1992 - 1992 International Conference on Acoustics, Speech, and Signal Processing |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 53-56 |
Number of pages | 4 |
Volume | 2 |
ISBN (Electronic) | 0780305329 |
DOIs | |
Publication status | Published - 1992 |
Externally published | Yes |
Event | 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992 - San Francisco, United States Duration: Mar 23 1992 → Mar 26 1992 |
Other
Other | 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992 |
---|---|
Country/Territory | United States |
City | San Francisco |
Period | 3/23/92 → 3/26/92 |
ASJC Scopus subject areas
- Software
- Signal Processing
- Electrical and Electronic Engineering