ACL'02 Tutorial / Toward Spontaneous Speech Recognition and Understanding

ACL-02 Tutorial

Toward Spontaneous Speech Recognition and Understanding

Sadaoki Furui, Tokyo Institute of Technology, Department of Computer Science

Contents

Introduction
Difficulties of spontaneous speech recognition
Acoustic and linguistic models for spontaneous speech recognition
Paradigm shift from speech recognition to understanding
Message-driven speech recognition and understanding
Speech summarization as speech understanding
Research projects on spontaneous speech corpus and processing technology
Research directions

Abstract
Although high-recognition accuracy can be obtained for speech in the form of reading a written text or similar by using state-of-the art speech recognition technology, the accuracy is quite poor for freely spoken spontaneous speech. The principal reason for this is that acoustic and linguistic models used in speech recognition have been built using written language or speech reading text, while spontaneous speech and written language considerably differ in both acoustically and linguistically. It is indispensable to build a large corpus of spontaneous speech, since our knowledge of the structure of spontaneous speech is very limited. A paradigm shift from speech recognition to understanding is essential, where underlying messages of the speaker are extracted, instead of transcribing all the spoken words. >From these perspectives, several research projects for raising the technological level of spontaneous speech recognition and understanding have recently been conducted in several countries. This tutorial finally predicts future directions in spontaneous speech understanding technologies. It presents the most important research problems, and tries to forecast where progress will be made in the near future.
Mailing address
Sadaoki Furui
Tokyo Institute of Technology
Department of Computer Science
2-12-1 Ookayama, Meguro-ku, Tokyo, 152-8552 Japan
Tel/Fax: +81-3-5734-3480, Email: furui@cs.titech.ac.jp

Back to the tutorials page.