Voice Quality Modelling for Expressive Speech Synthesis

Monzo Sánchez, Carlos; Iriondo Sanz, Ignasi; Socoró Carrié, Joan Claudi

Fecha de publicación

2014-01-22

URI http://hdl.handle.net/20.500.14342/3425

DOI

http://dx.doi.org/10.1155/2014/627189

Resumen

This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics.

Tipo de documento

Artículo

Versión publicada

Lengua

Inglés

Materias (CDU)

62 - Ingeniería. Tecnología

Palabras clave

Processament de la parla

Anàlisi prosòdica (Lingüística)

Páginas

13 p.

Publicado por

Hindawi

Publicado en

Scientific World Journal, 2014, Vol. 2014 (Gener)

Mostrar el registro completo del ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)

Articles publicats en revistes [574]

Derechos

Excepto si se señala otra cosa, la licencia del ítem se describe como http://creativecommons.org/licenses/by/4.0/