Voice Quality Modelling for Expressive Speech Synthesis

Monzo Sánchez, Carlos; Iriondo Sanz, Ignasi; Socoró Carrié, Joan Claudi; Monzo Sánchez, Carlos; Iriondo Sanz, Ignasi; Socoró Carrié, Joan Claudi

doi:http://dx.doi.org/10.1155/2014/627189

Publication date

2014-01-22

URI http://hdl.handle.net/20.500.14342/3425

DOI

http://dx.doi.org/10.1155/2014/627189

Abstract

This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics.

Document Type

Article

Published version

Language

English

Subject (CDU)

62 - Engineering. Technology in general

Keywords

Processament de la parla

Anàlisi prosòdica (Lingüística)

Pages

13 p.

Publisher

Hindawi

Is part of

Scientific World Journal, 2014, Vol. 2014 (Gener)

Recommended citation

This citation was generated automatically.

Show full item record

This item appears in the following Collection(s)

Articles publicats en revistes [736]

Rights

Except where otherwise noted, this item's license is described as http://creativecommons.org/licenses/by/4.0/