Electrical Engineering

Research Projects

Rich Transcription of Multi-lingual Spoken Documents for Translation and Distillation

Principal Investigator
Mari Ostendorf, Katrin Kirchhoff

Sponsor(s)
SRI International (flow through from DARPA)

Award Period
09/01/2005 - 08/31/2010

Abstract
This project will develop speech recognition and annotation technology for spoken documents, such as news broadcasts, with the goal of enabling higher accuracy and more efficient information retrieval, extraction and summarization. In addition, the technology will be developed for multiple languages, including English, Mandarin, and Arabic, with modules designed to interact with machine translation for non-English languages. The goal is to produce English transcripts complete with punctuation and capitalization and enriched with information about speaker role, emphasis, etc.

Updates or corrections to this page should be sent to gheaton@u.washington.edu.