Local minimum generation error criterion for hybrid HMM speech synthesis
View/Open
Author
Publication date
2009-09Abstract
This paper presents an HMM-driven hybrid speech synthesis
approach in which unit selection concatenative synthesis is used
to improve the quality of the statistical system using a Local
Minimum Generation Error (LMGE) during the synthesis stage.
The idea behind this approach is to combine the robustness due
to HMMs with the naturalness of concatenated units. Unlike
the conventional hybrid approaches to speech synthesis that use
concatenative synthesis as a backbone, the proposed system employs stable regions of natural units to improve the statistically
generated parameters. We show that this approach improves the
generation of vocal tract parameters, smoothes the bad joints
and increases the overall quality.
Document Type
Object of conference
Language
English
Subject (CDU)
62 - Engineering. Technology in general
Keywords
Processament de la parla
Anàlisi prosòdica (Lingüística)
Pages
4 p.
Publisher
10th Annual Conference of the International Speech Communication Associations, Brighton, 6-10 of September 2009
This item appears in the following Collection(s)
Rights
© International Speech Communication Association. Tots els drets reservats