Evolutionary weight tuning based on diphone pairsfor unit selection speech synthesis
Autor/a
Alías Pujol, Francesc
Llorà Fàbrega, Xavier
Otros/as autores/as
Universitat Ramon Llull. La Salle
Illinois Genetic Algorithms Lab
University of Illinois. National Center for Supercomputing Applications
Fecha de publicación
2003-09-01Resumen
Unit selection text-to-speech (TTS) conversion is an ongoingresearch for the speech synthesis community. This paper isfocused on tuning the weights involved in the target and con-catenation cost metrics. We propose a method for automati-cally adjusting these weights simultaneously by means of di-phone and triphone pairs. This method is based on techniquesprovided by the evolutionary computation community, takingadvantage of their robustness in noisy domains. The experi-ments and their analyses demonstrate its good performance inthis problem, thus, overcoming some constraints assumed byprevious works and leading to a new interesting framework forfurther investigations.
Tipo de documento
Artículo
Versión publicada
Lengua
English
Palabras clave
Processament de la parla
Parla
Páginas
4 p.
Publicado por
8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003
Número del acuerdo de la subvención
info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679
Este ítem aparece en la(s) siguiente(s) colección(ones)
Derechos
© ISCA. Tots els drets reservats