Linguistic and Mixed Excitation Improvements on a HMM-based speech synthesis for Castilian Spanish

Gonzalvo Fructuoso, Xavier; Socoró Carrié, Joan Claudi; Iriondo Sanz, Ignasi; Monzo Sánchez, Carlos; Martínez Marroquín, Elisa; Gonzalvo Fructuoso, Xavier; Socoró Carrié, Joan Claudi; Iriondo Sanz, Ignasi; Monzo Sánchez, Carlos; Martínez Marroquín, Elisa

Publication date

2007-08

URI http://hdl.handle.net/20.500.14342/2970

Abstract

HiddenMarkov Models based text-to-speech(HMM-TTS)synthesis is one of the techniques for generating speech from trained statistical models where spectrum and prosody of basic speech units are modelled altogether. This paper presents the advancesin our Spanish HMM-TTS and a perceptualtest is conducted to compare it with an extended PSOLA-based concatenative (E-PSOLA) system. The improvements have been performed on phonetic information and contextual factors according to the Castilian Spanish language and speech generation using a mixed excitation(ME) technique. The resultsshow the preference of the new HMM-TTS system in front of the previous system and a better MOS in comparison with a real E-PSOLA in terms of acceptability, intelligibility and stability.

Document Type

Object of conference

Language

English

Subject (CDU)

62 - Engineering. Technology in general

Keywords

Processament de la parla

Anàlisi prosòdica (Lingüística)

Pages

6 p.

Publisher

6th ISCA Workshop on Speech Synthesis, Bonn, 22-24 of August 2007

Is part of

Proceedings of the 6th ISCA Workshop on Speech Synthesis

Recommended citation

This citation was generated automatically.

Show full item record

This item appears in the following Collection(s)

Contribucions a congressos [221]