Evolutionary weight tuning based on diphone pairsfor unit selection speech synthesis

Alías-Pujol, Francesc; Llorà Fàbrega, Xavier; Alías-Pujol, Francesc; Llorà Fàbrega, Xavier

Publication date

2003-09-01

URI http://hdl.handle.net/20.500.14342/2896

Abstract

Unit selection text-to-speech (TTS) conversion is an ongoingresearch for the speech synthesis community. This paper isfocused on tuning the weights involved in the target and con-catenation cost metrics. We propose a method for automati-cally adjusting these weights simultaneously by means of di-phone and triphone pairs. This method is based on techniquesprovided by the evolutionary computation community, takingadvantage of their robustness in noisy domains. The experi-ments and their analyses demonstrate its good performance inthis problem, thus, overcoming some constraints assumed byprevious works and leading to a new interesting framework forfurther investigations.

Document Type

Article

Published version

Language

English

Keywords

Processament de la parla

Parla

Pages

4 p.

Publisher

8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003

Grant agreement number

info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679

Recommended citation

This citation was generated automatically.

Show full item record

This item appears in the following Collection(s)

Contribucions a congressos [244]