Voice quality modelling for expressive speech synthesis
Visualitza/Obre
Autor/a
Monzo Sánchez, Carlos
Iriondo Sanz, Ignasi
Socoró Carrié, Joan Claudi
Altres autors/es
Universitat Ramon Llull. La Salle
Universitat Oberta de Catalunya. Computer Science, Multimedia and Telecomunications Studies
Data de publicació
2014-01Resum
This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics
Tipus de document
Article
Versió publicada
Llengua
English
Matèries (CDU)
81 - Lingüística i llengües
Paraules clau
Parla
Pàgines
13 p.
Publicat per
Hindawi Publishing Corporation
Publicat a
The Scientific World Journal. 2014
Aquest element apareix en la col·lecció o col·leccions següent(s)
Drets
© L'autor/a
Excepte que s'indiqui una altra cosa, la llicència de l'ítem es descriu com http://creativecommons.org/licenses/by/4.0/