Evolutionary weight tuning based on diphone pairsfor unit selection speech synthesis
Autor/a
Alías Pujol, Francesc
Llorà Fàbrega, Xavier
Altres autors/es
Universitat Ramon Llull. La Salle
Illinois Genetic Algorithms Lab
University of Illinois. National Center for Supercomputing Applications
Data de publicació
2003-09-01Resum
Unit selection text-to-speech (TTS) conversion is an ongoingresearch for the speech synthesis community. This paper isfocused on tuning the weights involved in the target and con-catenation cost metrics. We propose a method for automati-cally adjusting these weights simultaneously by means of di-phone and triphone pairs. This method is based on techniquesprovided by the evolutionary computation community, takingadvantage of their robustness in noisy domains. The experi-ments and their analyses demonstrate its good performance inthis problem, thus, overcoming some constraints assumed byprevious works and leading to a new interesting framework forfurther investigations.
Tipus de document
Article
Versió publicada
Llengua
English
Paraules clau
Processament de la parla
Parla
Pàgines
4 p.
Publicat per
8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003
Número de l'acord de la subvenció
info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679
Aquest element apareix en la col·lecció o col·leccions següent(s)
Drets
© ISCA. Tots els drets reservats