A Hybrid Method Oriented to Concatenative Text-to-Speech Synthesis
Visualitza/Obre
Autor/a
Iriondo Sanz, Ignasi
Alías Pujol, Francesc
Sanchís Bernabeu, Francisco Javier
Melenchón Maldonado, Javier
Altres autors/es
Universitat Ramon Llull. La Salle
Data de publicació
2003-09-01Resum
In this paper we present a speech synthesis method for diphonebased text-to-speech systems. Its main goal is to achieve
prosodic modifications that result in more natural-sounding synthetic speech. This improvement is especially useful for emotional speech synthesis, which requires high-quality prosodic modification. We present a hybrid method based on TD-PSOLA and the harmonic plus noise model, which incorporates a novel method to jointly modify pitch and time-scale. Preliminary results show an improvement in the synthetic speech quality when high pitch modification is required.
Tipus de document
Article
Versió publicada
Llengua
English
Paraules clau
Parla
Processament de la parla
Pàgines
4 p.
Publicat per
8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003
Número de l'acord de la subvenció
info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679
Aquest element apareix en la col·lecció o col·leccions següent(s)
Drets
© ISCA. Tots els drets reservats