A Hybrid Method Oriented to Concatenative Text-to-Speech Synthesis

Iriondo, Ignasi; Alías-Pujol, Francesc; Sanchís Bernabeu, Francisco Javier; Melenchón, Javier; Iriondo, Ignasi; Alías-Pujol, Francesc; Sanchís Bernabeu, Francisco Javier; Melenchón, Javier

Data de publicació

2003-09-01

URI http://hdl.handle.net/20.500.14342/2892

Resum

In this paper we present a speech synthesis method for diphonebased text-to-speech systems. Its main goal is to achieve prosodic modifications that result in more natural-sounding synthetic speech. This improvement is especially useful for emotional speech synthesis, which requires high-quality prosodic modification. We present a hybrid method based on TD-PSOLA and the harmonic plus noise model, which incorporates a novel method to jointly modify pitch and time-scale. Preliminary results show an improvement in the synthetic speech quality when high pitch modification is required.

Tipus de document

Article

Versió publicada

Llengua

Anglès

Paraules clau

Parla

Processament de la parla

Pàgines

4 p.

Publicat per

8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003

Número de l'acord de la subvenció

info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679

Citació recomanada

Aquesta citació s'ha generat automàticament.

Mostra el registre complet de l'element

Aquest element apareix en la col·lecció o col·leccions següent(s)

Contribucions a congressos [221]