A Hybrid Method Oriented to Concatenative Text-to-Speech Synthesis
Ver/Abrir
Autor/a
Iriondo Sanz, Ignasi
Alías Pujol, Francesc
Sanchís Bernabeu, Francisco Javier
Melenchón Maldonado, Javier
Otros/as autores/as
Universitat Ramon Llull. La Salle
Fecha de publicación
2003-09-01Resumen
In this paper we present a speech synthesis method for diphonebased text-to-speech systems. Its main goal is to achieve
prosodic modifications that result in more natural-sounding synthetic speech. This improvement is especially useful for emotional speech synthesis, which requires high-quality prosodic modification. We present a hybrid method based on TD-PSOLA and the harmonic plus noise model, which incorporates a novel method to jointly modify pitch and time-scale. Preliminary results show an improvement in the synthetic speech quality when high pitch modification is required.
Tipo de documento
Artículo
Versión publicada
Lengua
English
Palabras clave
Parla
Processament de la parla
Páginas
4 p.
Publicado por
8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003
Número del acuerdo de la subvención
info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679
Este ítem aparece en la(s) siguiente(s) colección(ones)
Derechos
© ISCA. Tots els drets reservats