Local minimum generation error criterion for hybrid HMM speech synthesis

Gonzalvo Fructuoso, Xavier; Gutkin, Alexander; Socoró Carrié, Joan Claudi; Iriondo Sanz, Ignasi; Taylor, Paul; Gonzalvo Fructuoso, Xavier; Gutkin, Alexander; Socoró Carrié, Joan Claudi; Iriondo Sanz, Ignasi; Taylor, Paul

Data de publicació

2009-09

URI http://hdl.handle.net/20.500.14342/2957

Resum

This paper presents an HMM-driven hybrid speech synthesis approach in which unit selection concatenative synthesis is used to improve the quality of the statistical system using a Local Minimum Generation Error (LMGE) during the synthesis stage. The idea behind this approach is to combine the robustness due to HMMs with the naturalness of concatenated units. Unlike the conventional hybrid approaches to speech synthesis that use concatenative synthesis as a backbone, the proposed system employs stable regions of natural units to improve the statistically generated parameters. We show that this approach improves the generation of vocal tract parameters, smoothes the bad joints and increases the overall quality.

Tipus de document

Objecte de conferència

Llengua

Anglès

Matèries (CDU)

62 - Enginyeria. Tecnologia

Paraules clau

Processament de la parla

Anàlisi prosòdica (Lingüística)

Pàgines

4 p.

Publicat per

10th Annual Conference of the International Speech Communication Associations, Brighton, 6-10 of September 2009

Citació recomanada

Aquesta citació s'ha generat automàticament.

Mostra el registre complet de l'element

Aquest element apareix en la col·lecció o col·leccions següent(s)

Contribucions a congressos [221]