Linguistic and Mixed Excitation Improvements on a HMM-based speech synthesis for Castilian Spanish

Gonzalvo Fructuoso, Xavier; Socoró Carrié, Joan Claudi; Iriondo Sanz, Ignasi; Monzo Sánchez, Carlos; Martínez Marroquín, Elisa; Gonzalvo Fructuoso, Xavier; Socoró Carrié, Joan Claudi; Iriondo Sanz, Ignasi; Monzo Sánchez, Carlos; Martínez Marroquín, Elisa

Data de publicació

2007-08

URI http://hdl.handle.net/20.500.14342/2970

Resum

HiddenMarkov Models based text-to-speech(HMM-TTS)synthesis is one of the techniques for generating speech from trained statistical models where spectrum and prosody of basic speech units are modelled altogether. This paper presents the advancesin our Spanish HMM-TTS and a perceptualtest is conducted to compare it with an extended PSOLA-based concatenative (E-PSOLA) system. The improvements have been performed on phonetic information and contextual factors according to the Castilian Spanish language and speech generation using a mixed excitation(ME) technique. The resultsshow the preference of the new HMM-TTS system in front of the previous system and a better MOS in comparison with a real E-PSOLA in terms of acceptability, intelligibility and stability.

Tipus de document

Objecte de conferència

Llengua

Anglès

Matèries (CDU)

62 - Enginyeria. Tecnologia

Paraules clau

Processament de la parla

Anàlisi prosòdica (Lingüística)

Pàgines

6 p.

Publicat per

6th ISCA Workshop on Speech Synthesis, Bonn, 22-24 of August 2007

Publicat a

Proceedings of the 6th ISCA Workshop on Speech Synthesis

Citació recomanada

Aquesta citació s'ha generat automàticament.

Mostra el registre complet de l'element

Aquest element apareix en la col·lecció o col·leccions següent(s)

Contribucions a congressos [221]