Objective Viseme Extraction and Audiovisual Uncertainty: Estimation Limits between Auditory and Visual Modes

Melenchón Maldonado, Javier; Simó, Jordi; Cobo Rodríguez, Germán; Martínez Marroquín, Elisa

dc.contributor	Universitat Ramon Llull. La Salle
dc.contributor.author	Melenchón Maldonado, Javier
dc.contributor.author	Simó, Jordi
dc.contributor.author	Cobo Rodríguez, Germán
dc.contributor.author	Martínez Marroquín, Elisa
dc.date.accessioned	2021-06-01T19:54:34Z
dc.date.accessioned	2023-07-13T09:53:09Z
dc.date.available	2021-06-01T19:54:34Z
dc.date.available	2023-07-13T09:53:09Z
dc.date.created	2007-08
dc.date.issued	2007-08
dc.identifier.uri	http://hdl.handle.net/20.500.14342/2972
dc.description.abstract	An objective way to obtain consonant visemes for any given Spanish speaking person is proposed. Its face is recorded while speaking a balanced set of sentences and stored as an audiovisual sequence. Visual and auditory modes are segmented by allophones and a distance matrix is built to find visually similar perceived allophones. Results show high correlation with tedious subjective earlier evaluations regardless of being in English. In addition, estimation between modes is also studied, revealing a tradeoff between performances in both modes: given a set of auditory groups and another of visual ones for each grouping criteria, increasing the estimation performance of one mode is translated to decreasing that of the other one. Moreover, the tradeoff is very similar (<7% between maximum and minimum values) in all observed examples	eng
dc.format.extent	4 p.	ca
dc.language.iso	eng	ca
dc.publisher	International Conference on Auditory-Visual Speech Processing, Hilvarenbeek, August 31 to September 3 2007	ca
dc.rights	© International Speech Communication Association. Tots els drets reservats
dc.source	RECERCAT (Dipòsit de la Recerca de Catalunya)
dc.subject.other	Comunicació audiovisual	ca
dc.subject.other	Percepció auditiva	ca
dc.title	Objective Viseme Extraction and Audiovisual Uncertainty: Estimation Limits between Auditory and Visual Modes	ca
dc.type	info:eu-repo/semantics/conferenceObject	ca
dc.rights.accessLevel	info:eu-repo/semantics/openAccess
dc.embargo.terms	cap	ca
dc.subject.udc	531/534

Fitxers en aquest element

Nom:: av07_P13.pdf
Grandària:: 157.0Kb
Format:: PDF
Descripció:: AVSP.2007

Visualitza/Obre

Aquest element apareix en la col·lecció o col·leccions següent(s)

Contribucions a congressos [241]

Mostra el registre parcial de l'element

Objective Viseme Extraction and Audiovisual Uncertainty: Estimation Limits between Auditory and Visual Modes

Fitxers en aquest element

Aquest element apareix en la col·lecció o col·leccions següent(s)

Visualitza

El meu compte