Mostrar el registro sencillo del ítem

dc.contributorUniversitat Ramon Llull. La Salle
dc.contributor.authorFreixes, Marc
dc.contributor.authorSocoró, Joan Claudi
dc.contributor.authorAlías-Pujol, Francesc
dc.date.accessioned2025-07-09T16:23:49Z
dc.date.available2025-07-09T16:23:49Z
dc.date.issued2022-02-16
dc.identifier.issn2076-3417ca
dc.identifier.urihttp://hdl.handle.net/20.500.14342/5382
dc.description.abstractThe source-filter model is one of the main techniques applied to speech analysis and synthesis. Recent advances in voice production by means of three-dimensional (3D) source-filter models have overcome several limitations of classic one-dimensional techniques. Despite the development of preliminary attempts to improve the expressiveness of 3D-generated voices, they are still far from achieving realistic results. Towards this goal, this work analyses the contribution of both the the vocal tract (VT) and the glottal source spectral (GSS) cues in the generation of happy and aggressive speech through a GlottDNN-based analysis-by-synthesis methodology. Paired neutral expressive utterances are parameterised to generate different combinations of expressive vowels, applying the target expressive GSS and/or VT cues on the neutral vowels after transplanting the expressive prosody on these utterances. The conducted objective tests focused on Spanish [a], [i] and [u] vowels show that both GSS and VT cues significantly reduce the spectral distance to the expressive target. The results from the perceptual test show that VT cues make a statistically significant contribution in the expression of happy and aggressive emotions for [a] vowels, while the GSS contribution is significant in [i] and [u] vowels.ca
dc.format.extent14 p.ca
dc.language.isoengca
dc.publisherMDPI AGca
dc.relation.ispartofApplied Sciences, vol. 12, núm. 4, febrer 2022,ca
dc.rights© L'autor/aca
dc.rightsAttribution 4.0 Internationalca
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subject.otherExpressive speech synthesisca
dc.subject.otherEmotional database;ca
dc.subject.otherSpeech analysisca
dc.subject.otherInverse filteringca
dc.subject.otherGlottal sourceca
dc.subject.otherVocal tractca
dc.subject.otherNumerical voice productionca
dc.subject.otherGlottDNNca
dc.subject.otherSíntesi expressiva de la parlaca
dc.subject.otherBase de dades emocionalca
dc.subject.otherAnàlisi de la parlaca
dc.subject.otherFiltratge inversca
dc.subject.otherFont glotalca
dc.subject.otherTracte vocalca
dc.subject.otherProducció numèrica de la veuca
dc.titleContribution of Vocal Tract and Glottal Source Spectral Cues in the Generation of Acted Happy and Aggressive Spanish Vowelsca
dc.typeinfo:eu-repo/semantics/articleca
dc.rights.accessLevelinfo:eu-repo/semantics/openAccess
dc.embargo.termscapca
dc.subject.udc004ca
dc.subject.udc531/534ca
dc.subject.udc62ca
dc.identifier.doihttps://doi.org/10.3390/app12042055ca
dc.description.versioninfo:eu-repo/semantics/publishedVersionca


Ficheros en el ítem

 

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

© L'autor/a
Excepto si se señala otra cosa, la licencia del ítem se describe como http://creativecommons.org/licenses/by/4.0/
Compartir en TwitterCompartir en LinkedinCompartir en FacebookCompartir en TelegramCompartir en WhatsappImprimir