Linguistic and Mixed Excitation Improvements on a HMM-based speech synthesis for Castilian Spanish
Visualitza/Obre
Autor/a
Gonzalvo Fructuoso, Xavier
Socoró Carrié, Joan Claudi
Iriondo Sanz, Ignasi
Monzo Sánchez, Carlos
Martínez Marroquín, Elisa
Altres autors/es
Universitat Ramon Llull. La Salle
Data de publicació
2007-08Resum
HiddenMarkov Models based text-to-speech(HMM-TTS)synthesis is one of the techniques for generating speech from
trained statistical models where spectrum and prosody of basic speech units are modelled altogether. This paper presents
the advancesin our Spanish HMM-TTS and a perceptualtest is
conducted to compare it with an extended PSOLA-based concatenative (E-PSOLA) system. The improvements have been
performed on phonetic information and contextual factors according to the Castilian Spanish language and speech generation using a mixed excitation(ME) technique. The resultsshow
the preference of the new HMM-TTS system in front of the
previous system and a better MOS in comparison with a real
E-PSOLA in terms of acceptability, intelligibility and stability.
Tipus de document
Objecte de conferència
Llengua
English
Matèries (CDU)
62 - Enginyeria. Tecnologia
Paraules clau
Processament de la parla
Anàlisi prosòdica (Lingüística)
Pàgines
6 p.
Publicat per
6th ISCA Workshop on Speech Synthesis, Bonn, 22-24 of August 2007
Publicat a
Proceedings of the 6th ISCA Workshop on Speech Synthesis
Aquest element apareix en la col·lecció o col·leccions següent(s)
Drets
© International Speech Communication Association. Tots els drets reservats