Linguistic and Mixed Excitation Improvements on a HMM-based speech synthesis for Castilian Spanish
Ver/Abrir
Autor/a
Gonzalvo Fructuoso, Xavier
Socoró Carrié, Joan Claudi
Iriondo Sanz, Ignasi
Monzo Sánchez, Carlos
Martínez Marroquín, Elisa
Otros/as autores/as
Universitat Ramon Llull. La Salle
Fecha de publicación
2007-08Resumen
HiddenMarkov Models based text-to-speech(HMM-TTS)synthesis is one of the techniques for generating speech from
trained statistical models where spectrum and prosody of basic speech units are modelled altogether. This paper presents
the advancesin our Spanish HMM-TTS and a perceptualtest is
conducted to compare it with an extended PSOLA-based concatenative (E-PSOLA) system. The improvements have been
performed on phonetic information and contextual factors according to the Castilian Spanish language and speech generation using a mixed excitation(ME) technique. The resultsshow
the preference of the new HMM-TTS system in front of the
previous system and a better MOS in comparison with a real
E-PSOLA in terms of acceptability, intelligibility and stability.
Tipo de documento
Objeto de conferencia
Lengua
English
Materias (CDU)
62 - Ingeniería. Tecnología
Palabras clave
Processament de la parla
Anàlisi prosòdica (Lingüística)
Páginas
6 p.
Publicado por
6th ISCA Workshop on Speech Synthesis, Bonn, 22-24 of August 2007
Publicado en
Proceedings of the 6th ISCA Workshop on Speech Synthesis
Este ítem aparece en la(s) siguiente(s) colección(ones)
Derechos
© International Speech Communication Association. Tots els drets reservats