A Hybrid Method Oriented to Concatenative Text-to-Speech Synthesis
Ver/Abrir
Otros/as autores/as
Fecha de publicación
2003-09-01Resumen
In this paper we present a speech synthesis method for diphonebased text-to-speech systems. Its main goal is to achieve
prosodic modifications that result in more natural-sounding synthetic speech. This improvement is especially useful for emotional speech synthesis, which requires high-quality prosodic modification. We present a hybrid method based on TD-PSOLA and the harmonic plus noise model, which incorporates a novel method to jointly modify pitch and time-scale. Preliminary results show an improvement in the synthetic speech quality when high pitch modification is required.
Tipo de documento
Artículo
Versión publicada
Lengua
Inglés
Palabras clave
Páginas
4 p.
Publicado por
8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003
Número del acuerdo de la subvención
info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679
Este ítem aparece en la(s) siguiente(s) colección(ones)
Derechos
© ISCA. Tots els drets reservats