Evolutionary weight tuning based on diphone pairsfor unit selection speech synthesis
Other authors
Publication date
2003-09-01Abstract
Unit selection text-to-speech (TTS) conversion is an ongoingresearch for the speech synthesis community. This paper isfocused on tuning the weights involved in the target and con-catenation cost metrics. We propose a method for automati-cally adjusting these weights simultaneously by means of di-phone and triphone pairs. This method is based on techniquesprovided by the evolutionary computation community, takingadvantage of their robustness in noisy domains. The experi-ments and their analyses demonstrate its good performance inthis problem, thus, overcoming some constraints assumed byprevious works and leading to a new interesting framework forfurther investigations.
Document Type
Article
Published version
Language
English
Keywords
Processament de la parla
Parla
Pages
4 p.
Publisher
8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003
Grant agreement number
info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679
This item appears in the following Collection(s)
Rights
© ISCA. Tots els drets reservats