Contribution of Vocal Tract and Glottal Source Spectral Cues in the Generation of Acted Happy and Aggressive Spanish Vowels

Freixes, Marc; Socoró, Joan Claudi; Alías-Pujol, Francesc

dc.contributor	Universitat Ramon Llull. La Salle
dc.contributor.author	Freixes, Marc
dc.contributor.author	Socoró, Joan Claudi
dc.contributor.author	Alías-Pujol, Francesc
dc.date.accessioned	2025-07-09T16:23:49Z
dc.date.available	2025-07-09T16:23:49Z
dc.date.issued	2022-02-16
dc.identifier.issn	2076-3417	ca
dc.identifier.uri	http://hdl.handle.net/20.500.14342/5382
dc.description.abstract	The source-filter model is one of the main techniques applied to speech analysis and synthesis. Recent advances in voice production by means of three-dimensional (3D) source-filter models have overcome several limitations of classic one-dimensional techniques. Despite the development of preliminary attempts to improve the expressiveness of 3D-generated voices, they are still far from achieving realistic results. Towards this goal, this work analyses the contribution of both the the vocal tract (VT) and the glottal source spectral (GSS) cues in the generation of happy and aggressive speech through a GlottDNN-based analysis-by-synthesis methodology. Paired neutral expressive utterances are parameterised to generate different combinations of expressive vowels, applying the target expressive GSS and/or VT cues on the neutral vowels after transplanting the expressive prosody on these utterances. The conducted objective tests focused on Spanish [a], [i] and [u] vowels show that both GSS and VT cues significantly reduce the spectral distance to the expressive target. The results from the perceptual test show that VT cues make a statistically significant contribution in the expression of happy and aggressive emotions for [a] vowels, while the GSS contribution is significant in [i] and [u] vowels.	ca
dc.format.extent	14 p.	ca
dc.language.iso	eng	ca
dc.publisher	MDPI AG	ca
dc.relation.ispartof	Applied Sciences, vol. 12, núm. 4, febrer 2022,	ca
dc.rights	© L'autor/a	ca
dc.rights	Attribution 4.0 International	ca
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	*
dc.subject.other	Expressive speech synthesis	ca
dc.subject.other	Emotional database;	ca
dc.subject.other	Speech analysis	ca
dc.subject.other	Inverse filtering	ca
dc.subject.other	Glottal source	ca
dc.subject.other	Vocal tract	ca
dc.subject.other	Numerical voice production	ca
dc.subject.other	GlottDNN	ca
dc.subject.other	Síntesi expressiva de la parla	ca
dc.subject.other	Base de dades emocional	ca
dc.subject.other	Anàlisi de la parla	ca
dc.subject.other	Filtratge invers	ca
dc.subject.other	Font glotal	ca
dc.subject.other	Tracte vocal	ca
dc.subject.other	Producció numèrica de la veu	ca
dc.title	Contribution of Vocal Tract and Glottal Source Spectral Cues in the Generation of Acted Happy and Aggressive Spanish Vowels	ca
dc.type	info:eu-repo/semantics/article	ca
dc.rights.accessLevel	info:eu-repo/semantics/openAccess
dc.embargo.terms	cap	ca
dc.subject.udc	004	ca
dc.subject.udc	531/534	ca
dc.subject.udc	62	ca
dc.identifier.doi	https://doi.org/10.3390/app12042055	ca
dc.description.version	info:eu-repo/semantics/publishedVersion	ca