Voice quality modelling for expressive speech synthesis

Monzo Sánchez, Carlos; Iriondo Sanz, Ignasi; Socoró Carrié, Joan Claudi; Monzo Sánchez, Carlos; Iriondo Sanz, Ignasi; Socoró Carrié, Joan Claudi

doi:https://doi.org/10.1155/2014/627189

Publication date

2014-01

URI http://hdl.handle.net/20.500.14342/3439

DOI

https://doi.org/10.1155/2014/627189

Abstract

This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics

Document Type

Article

Published version

Language

English

Subject (CDU)

81 - Linguistics and languages

Keywords

Parla

Pages

13 p.

Publisher

Hindawi Publishing Corporation

Is part of

The Scientific World Journal. 2014

Recommended citation

This citation was generated automatically.

Show full item record

This item appears in the following Collection(s)

Articles publicats en revistes [805]

Rights

Except where otherwise noted, this item's license is described as http://creativecommons.org/licenses/by/4.0/