Text Classification based on Associative Relational Networks for Multi-Domain Text-to-Speech Synthesis
View/Open
Other authors
Publication date
2006-08Abstract
This work is a step further in our research towards developing a new strategy for high quality text-to-speech (TTS)
synthesis among different domains. In this context, it is
necessary to select the most appropriate domain for synthesizing the text input to the TTS system, task that can be
solved including a text classifier (TC) in the classic TTS architecture. Since speech speaking style and prosody depend
on the sequentiality and text structure of the message, the
TC should consider not only thematic but also stylistic aspects of text. To this end, we introduce a new text modelling
scheme based on an associative relational network, which
represents texts as a weighted word-based graph. The conducted experiments validate the proposal in terms of both
objective (text classification efficiency) and subjective (perceived synthetic speech quality) evaluation criteria.
Document Type
Object of conference
Language
English
Keywords
Processament de la parla
Parla
Pages
5 p.
Publisher
The SIGIR Workshop on Directions in Computational Analysis of Stylistics in Text Retrieval, Seattle, August 2006
Is part of
Proceedings of the SIGIR Workshop on Directions in Computational Analysis of Stylistics in Text Retrieval
This item appears in the following Collection(s)
Rights
© The Pennsylvania State University. Tots els drets reservats