Linguistic and Mixed Excitation Improvements on a HMM-based speech synthesis for Castilian Spanish
View/Open
Author
Gonzalvo Fructuoso, Xavier
Socoró Carrié, Joan Claudi
Iriondo Sanz, Ignasi
Monzo Sánchez, Carlos
Martínez Marroquín, Elisa
Other authors
Universitat Ramon Llull. La Salle
Publication date
2007-08Abstract
HiddenMarkov Models based text-to-speech(HMM-TTS)synthesis is one of the techniques for generating speech from
trained statistical models where spectrum and prosody of basic speech units are modelled altogether. This paper presents
the advancesin our Spanish HMM-TTS and a perceptualtest is
conducted to compare it with an extended PSOLA-based concatenative (E-PSOLA) system. The improvements have been
performed on phonetic information and contextual factors according to the Castilian Spanish language and speech generation using a mixed excitation(ME) technique. The resultsshow
the preference of the new HMM-TTS system in front of the
previous system and a better MOS in comparison with a real
E-PSOLA in terms of acceptability, intelligibility and stability.
Document Type
Object of conference
Language
English
Subject (CDU)
62 - Engineering. Technology in general
Keywords
Processament de la parla
Anàlisi prosòdica (Lingüística)
Pages
6 p.
Publisher
6th ISCA Workshop on Speech Synthesis, Bonn, 22-24 of August 2007
Is part of
Proceedings of the 6th ISCA Workshop on Speech Synthesis
This item appears in the following Collection(s)
Rights
© International Speech Communication Association. Tots els drets reservats