Waveform-based speech synthesis approach with a formant frequency modification

Hideyuki Mizuno, Masanobu Abe, Tomohisa Hirokawa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

16 Citations (Scopus)

Abstract

This paper proposes a new approach to speech synthesis based on waveform segments. One novel point of this approach is its new formant frequency modification algorithm which makes it possible to flexibly change formant frequency and so reproduce the desired speech quality. The algorithm characterizes speech formants not only by formant frequencies and formant bandwidths, but also by spectral intensities of formant frequencies. The desirable formant structure, which is specified by the parameters, is obtained by iteratively modifying the formant bandwidths. Using the specified formant structure, the speech signal is synthesized by FFT. Evaluation by the acoustic distance measure, and by listening tests, confirms the good performance of the approach.

Original languageEnglish
Title of host publicationProceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
PublisherPubl by IEEE
Volume2
ISBN (Print)0780309464
Publication statusPublished - 1993
Externally publishedYes
Event1993 IEEE International Conference on Acoustics, Speech and Signal Processing - Minneapolis, MN, USA
Duration: Apr 27 1993Apr 30 1993

Other

Other1993 IEEE International Conference on Acoustics, Speech and Signal Processing
CityMinneapolis, MN, USA
Period4/27/934/30/93

Fingerprint

Speech synthesis
waveforms
synthesis
Bandwidth
bandwidth
Fast Fourier transforms
fast Fourier transformations
Acoustics
acoustics
evaluation

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Acoustics and Ultrasonics

Cite this

Mizuno, H., Abe, M., & Hirokawa, T. (1993). Waveform-based speech synthesis approach with a formant frequency modification. In Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. 2). Publ by IEEE.

Waveform-based speech synthesis approach with a formant frequency modification. / Mizuno, Hideyuki; Abe, Masanobu; Hirokawa, Tomohisa.

Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing. Vol. 2 Publ by IEEE, 1993.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mizuno, H, Abe, M & Hirokawa, T 1993, Waveform-based speech synthesis approach with a formant frequency modification. in Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing. vol. 2, Publ by IEEE, 1993 IEEE International Conference on Acoustics, Speech and Signal Processing, Minneapolis, MN, USA, 4/27/93.
Mizuno H, Abe M, Hirokawa T. Waveform-based speech synthesis approach with a formant frequency modification. In Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing. Vol. 2. Publ by IEEE. 1993
Mizuno, Hideyuki ; Abe, Masanobu ; Hirokawa, Tomohisa. / Waveform-based speech synthesis approach with a formant frequency modification. Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing. Vol. 2 Publ by IEEE, 1993.
@inproceedings{0e10131c826048caa1bd57e023791298,
title = "Waveform-based speech synthesis approach with a formant frequency modification",
abstract = "This paper proposes a new approach to speech synthesis based on waveform segments. One novel point of this approach is its new formant frequency modification algorithm which makes it possible to flexibly change formant frequency and so reproduce the desired speech quality. The algorithm characterizes speech formants not only by formant frequencies and formant bandwidths, but also by spectral intensities of formant frequencies. The desirable formant structure, which is specified by the parameters, is obtained by iteratively modifying the formant bandwidths. Using the specified formant structure, the speech signal is synthesized by FFT. Evaluation by the acoustic distance measure, and by listening tests, confirms the good performance of the approach.",
author = "Hideyuki Mizuno and Masanobu Abe and Tomohisa Hirokawa",
year = "1993",
language = "English",
isbn = "0780309464",
volume = "2",
booktitle = "Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing",
publisher = "Publ by IEEE",

}

TY - GEN

T1 - Waveform-based speech synthesis approach with a formant frequency modification

AU - Mizuno, Hideyuki

AU - Abe, Masanobu

AU - Hirokawa, Tomohisa

PY - 1993

Y1 - 1993

N2 - This paper proposes a new approach to speech synthesis based on waveform segments. One novel point of this approach is its new formant frequency modification algorithm which makes it possible to flexibly change formant frequency and so reproduce the desired speech quality. The algorithm characterizes speech formants not only by formant frequencies and formant bandwidths, but also by spectral intensities of formant frequencies. The desirable formant structure, which is specified by the parameters, is obtained by iteratively modifying the formant bandwidths. Using the specified formant structure, the speech signal is synthesized by FFT. Evaluation by the acoustic distance measure, and by listening tests, confirms the good performance of the approach.

AB - This paper proposes a new approach to speech synthesis based on waveform segments. One novel point of this approach is its new formant frequency modification algorithm which makes it possible to flexibly change formant frequency and so reproduce the desired speech quality. The algorithm characterizes speech formants not only by formant frequencies and formant bandwidths, but also by spectral intensities of formant frequencies. The desirable formant structure, which is specified by the parameters, is obtained by iteratively modifying the formant bandwidths. Using the specified formant structure, the speech signal is synthesized by FFT. Evaluation by the acoustic distance measure, and by listening tests, confirms the good performance of the approach.

UR - http://www.scopus.com/inward/record.url?scp=0027191578&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0027191578&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0027191578

SN - 0780309464

VL - 2

BT - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing

PB - Publ by IEEE

ER -