Voice Quality Modelling for Expressive Speech Synthesis
Author
Monzo Sánchez, Carlos
Iriondo Sanz, Ignasi
Socoró Carrié, Joan Claudi
Other authors
Universitat Ramon Llull. La Salle
Universitat Oberta de Catalunya.
Publication date
2014-01-22Abstract
This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics.
Document Type
Article
Published version
Language
English
Subject (CDU)
62 - Engineering. Technology in general
Keywords
Processament de la parla
Anàlisi prosòdica (Lingüística)
Pages
13 p.
Publisher
Hindawi
Is part of
Scientific World Journal, 2014, Vol. 2014 (Gener)
This item appears in the following Collection(s)
Rights
© L'autor/a
Except where otherwise noted, this item's license is described as http://creativecommons.org/licenses/by/4.0/