Evolutionary weight tuning based on diphone pairsfor unit selection speech synthesis

Alías-Pujol, Francesc; Llorà Fàbrega, Xavier; Alías-Pujol, Francesc; Llorà Fàbrega, Xavier

Data de publicació

2003-09-01

URI http://hdl.handle.net/20.500.14342/2896

Resum

Unit selection text-to-speech (TTS) conversion is an ongoingresearch for the speech synthesis community. This paper isfocused on tuning the weights involved in the target and con-catenation cost metrics. We propose a method for automati-cally adjusting these weights simultaneously by means of di-phone and triphone pairs. This method is based on techniquesprovided by the evolutionary computation community, takingadvantage of their robustness in noisy domains. The experi-ments and their analyses demonstrate its good performance inthis problem, thus, overcoming some constraints assumed byprevious works and leading to a new interesting framework forfurther investigations.

Tipus de document

Article

Versió publicada

Llengua

Anglès

Paraules clau

Processament de la parla

Parla

Pàgines

4 p.

Publicat per

8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003

Número de l'acord de la subvenció

info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679

Citació recomanada

Aquesta citació s'ha generat automàticament.

Mostra el registre complet de l'element

Aquest element apareix en la col·lecció o col·leccions següent(s)

Contribucions a congressos [221]