Abstract
This paper proposes a new application of speech modification called `speech morphing'. In image processing, morphing is a well known technique that gradually changes one person's face to that of someone else. Speech morphing produces similar results for speech; i.e., one person's speech is gradually changed to that of someone else. Speech morphing makes it possible to create movies or multi-media entertainment together with image morphing. The proposed algorithm pitch-synchronously modifies fundamental frequency(F0) and DFT spectrum and outputs high quality speech. To clarify the balance of F0 modification and spectrum modification, listening tests were carried out using 20 male speakers. The results yielded the relationship between the amount of modification and speaker identity. In terms of overall performance, listening tests show that the proposed algorithm successfully generates smooth, high quality voice changes.
Original language | English |
---|---|
Pages | 2235-2238 |
Number of pages | 4 |
Publication status | Published - Dec 1 1996 |
Externally published | Yes |
Event | Proceedings of the 1996 International Conference on Spoken Language Processing, ICSLP. Part 1 (of 4) - Philadelphia, PA, USA Duration: Oct 3 1996 → Oct 6 1996 |
Other
Other | Proceedings of the 1996 International Conference on Spoken Language Processing, ICSLP. Part 1 (of 4) |
---|---|
City | Philadelphia, PA, USA |
Period | 10/3/96 → 10/6/96 |
ASJC Scopus subject areas
- Computer Science(all)