Local minimum generation error criterion for hybrid HMM speech synthesis
Visualitza/Obre
Autor/a
Gonzalvo Fructuoso, Xavier
Gutkin, Alexander
Socoró Carrié, Joan Claudi
Iriondo Sanz, Ignasi
Taylor, Paul
Altres autors/es
Universitat Ramon Llull. La Salle
Phonetic Arts
Yahoo! Europe
Data de publicació
2009-09Resum
This paper presents an HMM-driven hybrid speech synthesis
approach in which unit selection concatenative synthesis is used
to improve the quality of the statistical system using a Local
Minimum Generation Error (LMGE) during the synthesis stage.
The idea behind this approach is to combine the robustness due
to HMMs with the naturalness of concatenated units. Unlike
the conventional hybrid approaches to speech synthesis that use
concatenative synthesis as a backbone, the proposed system employs stable regions of natural units to improve the statistically
generated parameters. We show that this approach improves the
generation of vocal tract parameters, smoothes the bad joints
and increases the overall quality.
Tipus de document
Objecte de conferència
Llengua
English
Matèries (CDU)
62 - Enginyeria. Tecnologia
Paraules clau
Processament de la parla
Anàlisi prosòdica (Lingüística)
Pàgines
4 p.
Publicat per
10th Annual Conference of the International Speech Communication Associations, Brighton, 6-10 of September 2009
Aquest element apareix en la col·lecció o col·leccions següent(s)
Drets
© International Speech Communication Association. Tots els drets reservats