Evolutionary weight tuning based on diphone pairsfor unit selection speech synthesis
Author
Alías Pujol, Francesc
Llorà Fàbrega, Xavier
Other authors
Universitat Ramon Llull. La Salle
Illinois Genetic Algorithms Lab
University of Illinois. National Center for Supercomputing Applications
Publication date
2003-09-01Abstract
Unit selection text-to-speech (TTS) conversion is an ongoingresearch for the speech synthesis community. This paper isfocused on tuning the weights involved in the target and con-catenation cost metrics. We propose a method for automati-cally adjusting these weights simultaneously by means of di-phone and triphone pairs. This method is based on techniquesprovided by the evolutionary computation community, takingadvantage of their robustness in noisy domains. The experi-ments and their analyses demonstrate its good performance inthis problem, thus, overcoming some constraints assumed byprevious works and leading to a new interesting framework forfurther investigations.
Document Type
Article
Published version
Language
English
Keywords
Processament de la parla
Parla
Pages
4 p.
Publisher
8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003
Grant agreement number
info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679
This item appears in the following Collection(s)
Rights
© ISCA. Tots els drets reservats