A Hybrid Method Oriented to Concatenative Text-to-Speech Synthesis

Iriondo, Ignasi; Alías-Pujol, Francesc; Sanchís Bernabeu, Francisco Javier; Melenchón, Javier; Iriondo, Ignasi; Alías-Pujol, Francesc; Sanchís Bernabeu, Francisco Javier; Melenchón, Javier

Fecha de publicación

2003-09-01

URI http://hdl.handle.net/20.500.14342/2892

Resumen

In this paper we present a speech synthesis method for diphonebased text-to-speech systems. Its main goal is to achieve prosodic modifications that result in more natural-sounding synthetic speech. This improvement is especially useful for emotional speech synthesis, which requires high-quality prosodic modification. We present a hybrid method based on TD-PSOLA and the harmonic plus noise model, which incorporates a novel method to jointly modify pitch and time-scale. Preliminary results show an improvement in the synthetic speech quality when high pitch modification is required.

Tipo de documento

Artículo

Versión publicada

Lengua

Inglés

Palabras clave

Parla

Processament de la parla

Páginas

4 p.

Publicado por

8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003

Número del acuerdo de la subvención

info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679

Citación recomendada

Esta citación se ha generado automáticamente.

Mostrar el registro completo del ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)

Contribucions a congressos [221]