INTEGRATING PROSODIC INFORMATION INTO A SPEECH RECOGNISER
Abstract
Abstract
In the last decade there has been an increasing tendency to incorporate language engineering strategies into speech technology. This technique combines linguistic and mathematical information in different applications: machine translation, natural language processing, speech synthesis and automatic speech recognition (ASR). In the field of speech synthesis, this hybrid approach (linguistic and mathematical/statistical) has led to the design of efficient models for reproducing the acoustic features of natural language. However, the incorporation of language engineering strategies into ASR is only beginning. In this paper, we present a theoretical framework for the integration of linguistic information into an ASR system. The objective is to design a model which can detect the suprasegmental features of the speech input, mainly those related to the fundamental frequency (F0) that can clarify the functionality of pauses, intonation contour, and interruptions. This specification model has been designed in the framework of a dialogue system.
Downloads
Downloads
How to Cite
Issue
Section
License
El Grupo de Investigación “La lengua inglesa en el ámbito universitario” HUM-397 conserva los derechos de copyright de los artículos publicados y permite la reutilización de los mismos bajo licencia Creative Commons: Creative Commons Atribución-NoComercial-SinDerivar 4.0 Internacional: se pueden copiar, usar, difundir, transmitir y exponer públicamente los artículos mencionados, siempre que (a) se cite la autoría y la fuente original de su publicación, (b) no se usen para fines comerciales, (c) no se creen obras derivadas mediante su transformación, (d) se mencione la existencia y especificaciones de esta licencia de uso.