Text Classification based on Associative Relational Networks for Multi-Domain Text-to-Speech Synthesis

Alías-Pujol, Francesc; Sevillano, Xavier; Socoró, Joan Claudi

Fecha de publicación

2006-08

URI http://hdl.handle.net/20.500.14342/2961

Resumen

This work is a step further in our research towards developing a new strategy for high quality text-to-speech (TTS) synthesis among different domains. In this context, it is necessary to select the most appropriate domain for synthesizing the text input to the TTS system, task that can be solved including a text classifier (TC) in the classic TTS architecture. Since speech speaking style and prosody depend on the sequentiality and text structure of the message, the TC should consider not only thematic but also stylistic aspects of text. To this end, we introduce a new text modelling scheme based on an associative relational network, which represents texts as a weighted word-based graph. The conducted experiments validate the proposal in terms of both objective (text classification efficiency) and subjective (perceived synthetic speech quality) evaluation criteria.

Tipo de documento

Objeto de conferencia

Lengua

Inglés

Palabras clave

Processament de la parla

Parla

Páginas

5 p.

Publicado por

The SIGIR Workshop on Directions in Computational Analysis of Stylistics in Text Retrieval, Seattle, August 2006

Publicado en

Proceedings of the SIGIR Workshop on Directions in Computational Analysis of Stylistics in Text Retrieval

Mostrar el registro completo del ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)

Contribucions a congressos [202]