Cross-language voice conversion

Masanobu Abe, Kiyohiro Shikano, Hisao Kuwabara

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Citations (Scopus)

Abstract

First, the part of spectral difference that is due to the difference in language is assessed. This is investigated using a bilingual speaker's speech data. It is found that the interlanguage (between English and Japanese) difference is smaller than the interspeaker difference. Listening tests indicate that the difference between English and Japanese is very small. Second, a model for cross-language voice conversion is described. In this approach, voice conversion is considered a mapping problem between two speakers' spectrum spaces. The spectrum spaces are represented by codebooks. From this point of view, a cross-language voice conversion model and measures for the model are proposed. The converted speech from male to female is as understandable as the unconverted speech and, moreover, it is recognized as female speech.

Original languageEnglish
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
PublisherPubl by IEEE
Pages345-348
Number of pages4
Volume1
Publication statusPublished - 1990
Externally publishedYes
Event1990 International Conference on Acoustics, Speech, and Signal Processing: Speech Processing 2, VLSI, Audio and Electroacoustics Part 2 (of 5) - Albuquerque, New Mexico, USA
Duration: Apr 3 1990Apr 6 1990

Other

Other1990 International Conference on Acoustics, Speech, and Signal Processing: Speech Processing 2, VLSI, Audio and Electroacoustics Part 2 (of 5)
CityAlbuquerque, New Mexico, USA
Period4/3/904/6/90

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Acoustics and Ultrasonics

Cite this

Abe, M., Shikano, K., & Kuwabara, H. (1990). Cross-language voice conversion. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 1, pp. 345-348). Publ by IEEE.

Cross-language voice conversion. / Abe, Masanobu; Shikano, Kiyohiro; Kuwabara, Hisao.

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1 Publ by IEEE, 1990. p. 345-348.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abe, M, Shikano, K & Kuwabara, H 1990, Cross-language voice conversion. in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. vol. 1, Publ by IEEE, pp. 345-348, 1990 International Conference on Acoustics, Speech, and Signal Processing: Speech Processing 2, VLSI, Audio and Electroacoustics Part 2 (of 5), Albuquerque, New Mexico, USA, 4/3/90.
Abe M, Shikano K, Kuwabara H. Cross-language voice conversion. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1. Publ by IEEE. 1990. p. 345-348
Abe, Masanobu ; Shikano, Kiyohiro ; Kuwabara, Hisao. / Cross-language voice conversion. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 1 Publ by IEEE, 1990. pp. 345-348
@inproceedings{4334ab13ac4545d58da2684b6559fa30,
title = "Cross-language voice conversion",
abstract = "First, the part of spectral difference that is due to the difference in language is assessed. This is investigated using a bilingual speaker's speech data. It is found that the interlanguage (between English and Japanese) difference is smaller than the interspeaker difference. Listening tests indicate that the difference between English and Japanese is very small. Second, a model for cross-language voice conversion is described. In this approach, voice conversion is considered a mapping problem between two speakers' spectrum spaces. The spectrum spaces are represented by codebooks. From this point of view, a cross-language voice conversion model and measures for the model are proposed. The converted speech from male to female is as understandable as the unconverted speech and, moreover, it is recognized as female speech.",
author = "Masanobu Abe and Kiyohiro Shikano and Hisao Kuwabara",
year = "1990",
language = "English",
volume = "1",
pages = "345--348",
booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",
publisher = "Publ by IEEE",

}

TY - GEN

T1 - Cross-language voice conversion

AU - Abe, Masanobu

AU - Shikano, Kiyohiro

AU - Kuwabara, Hisao

PY - 1990

Y1 - 1990

N2 - First, the part of spectral difference that is due to the difference in language is assessed. This is investigated using a bilingual speaker's speech data. It is found that the interlanguage (between English and Japanese) difference is smaller than the interspeaker difference. Listening tests indicate that the difference between English and Japanese is very small. Second, a model for cross-language voice conversion is described. In this approach, voice conversion is considered a mapping problem between two speakers' spectrum spaces. The spectrum spaces are represented by codebooks. From this point of view, a cross-language voice conversion model and measures for the model are proposed. The converted speech from male to female is as understandable as the unconverted speech and, moreover, it is recognized as female speech.

AB - First, the part of spectral difference that is due to the difference in language is assessed. This is investigated using a bilingual speaker's speech data. It is found that the interlanguage (between English and Japanese) difference is smaller than the interspeaker difference. Listening tests indicate that the difference between English and Japanese is very small. Second, a model for cross-language voice conversion is described. In this approach, voice conversion is considered a mapping problem between two speakers' spectrum spaces. The spectrum spaces are represented by codebooks. From this point of view, a cross-language voice conversion model and measures for the model are proposed. The converted speech from male to female is as understandable as the unconverted speech and, moreover, it is recognized as female speech.

UR - http://www.scopus.com/inward/record.url?scp=0025590356&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0025590356&partnerID=8YFLogxK

M3 - Conference contribution

VL - 1

SP - 345

EP - 348

BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

PB - Publ by IEEE

ER -