A Hybrid Method Oriented to Concatenative Text-to-Speech Synthesis

Iriondo, Ignasi; Alías-Pujol, Francesc; Sanchís Bernabeu, Francisco Javier; Melenchón, Javier; Iriondo, Ignasi; Alías-Pujol, Francesc; Sanchís Bernabeu, Francisco Javier; Melenchón, Javier

Publication date

2003-09-01

URI http://hdl.handle.net/20.500.14342/2892

Abstract

In this paper we present a speech synthesis method for diphonebased text-to-speech systems. Its main goal is to achieve prosodic modifications that result in more natural-sounding synthetic speech. This improvement is especially useful for emotional speech synthesis, which requires high-quality prosodic modification. We present a hybrid method based on TD-PSOLA and the harmonic plus noise model, which incorporates a novel method to jointly modify pitch and time-scale. Preliminary results show an improvement in the synthetic speech quality when high pitch modification is required.

Document Type

Article

Published version

Language

English

Keywords

Parla

Processament de la parla

Pages

4 p.

Publisher

8th European Conference on Speech Communication and Technology. EUROSPEECH 2003 - INTERSPEECH 2003

Grant agreement number

info:eu-repo/grantAgreement/DURSI/FI/2000 FI-00679

Recommended citation

This citation was generated automatically.

Show full item record

This item appears in the following Collection(s)

Contribucions a congressos [253]