TY - GEN
T1 - A segment-based approach to voice conversion
AU - Abe, Masanobu
PY - 1991/12/1
Y1 - 1991/12/1
N2 - A voice conversion algorithm that uses speech segments as conversion units is proposed. Input speech is decomposed into speech segments by a speech recognition module, and the segments are replaced by speech segments uttered by another speaker. This algorithm makes it possible to convert not only the static characteristics but also the dynamic characteristics of speaker individuality. The proposed voice conversion algorithm was used with two male speakers. Spectrum distortion between target speech and the converted speech was reduced to one-third the natural spectrum distortion between the two speakers. A listening experiment showed that, in terms of speaker identification accuracy, the speech converted by segment-sized units gave a score 20% higher than the speech converted frame-by-frame.
AB - A voice conversion algorithm that uses speech segments as conversion units is proposed. Input speech is decomposed into speech segments by a speech recognition module, and the segments are replaced by speech segments uttered by another speaker. This algorithm makes it possible to convert not only the static characteristics but also the dynamic characteristics of speaker individuality. The proposed voice conversion algorithm was used with two male speakers. Spectrum distortion between target speech and the converted speech was reduced to one-third the natural spectrum distortion between the two speakers. A listening experiment showed that, in terms of speaker identification accuracy, the speech converted by segment-sized units gave a score 20% higher than the speech converted frame-by-frame.
UR - http://www.scopus.com/inward/record.url?scp=0026369941&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0026369941&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:0026369941
SN - 078030033
T3 - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
SP - 765
EP - 768
BT - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
A2 - Anon, null
PB - Publ by IEEE
T2 - Proceedings of the 1991 International Conference on Acoustics, Speech, and Signal Processing - ICASSP 91
Y2 - 14 May 1991 through 17 May 1991
ER -