Linguistic and Mixed Excitation Improvements on a HMM-based speech synthesis for Castilian Spanish

Gonzalvo Fructuoso, Xavier; Socoró Carrié, Joan Claudi; Iriondo Sanz, Ignasi; Monzo Sánchez, Carlos; Martínez Marroquín, Elisa

Fecha de publicación

2007-08

URI http://hdl.handle.net/20.500.14342/2970

Resumen

HiddenMarkov Models based text-to-speech(HMM-TTS)synthesis is one of the techniques for generating speech from trained statistical models where spectrum and prosody of basic speech units are modelled altogether. This paper presents the advancesin our Spanish HMM-TTS and a perceptualtest is conducted to compare it with an extended PSOLA-based concatenative (E-PSOLA) system. The improvements have been performed on phonetic information and contextual factors according to the Castilian Spanish language and speech generation using a mixed excitation(ME) technique. The resultsshow the preference of the new HMM-TTS system in front of the previous system and a better MOS in comparison with a real E-PSOLA in terms of acceptability, intelligibility and stability.

Tipo de documento

Objeto de conferencia

Lengua

Inglés

Materias (CDU)

62 - Ingeniería. Tecnología

Palabras clave

Processament de la parla

Anàlisi prosòdica (Lingüística)

Páginas

6 p.

Publicado por

6th ISCA Workshop on Speech Synthesis, Bonn, 22-24 of August 2007

Publicado en

Proceedings of the 6th ISCA Workshop on Speech Synthesis

Mostrar el registro completo del ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)

Contribucions a congressos [202]