General Info

 

Participants

The MIDAS project is cooperation between 3 research groups: ESAT (K.U.Leuven), CLST (R.U.Nijmegen) and NUANCE. It is funded by NTU (Nederlandse Taalunie)

 

ESAT

 

Project info

Title:

MIDAS -- MIssing DAta Solutions for robust Speech Recognition

Start:

1 October 2006

Duration:

4 years

Project coordination:

ESAT speech group

Promoters:

Prof. dr. ir. Hugo Van hamme (ESAT)

Summary:

Robustness to noise in automatic speech recognition is essential for the development of successful applications. Noise reduction techniques have been applied with some success in the past, but there remains a large performance gap between the best ASR implementations and human recognition, especially when the noise is non-stationary. This project tackles the noise robustness problem in ASR through missing data techniques (MDT) by addressing important open R&D issues for accuracy improvement and computational efficiency. Detectors of missing data will make minimal assumptions on the noise, while incorporating more knowledge about speech. The acoustic model in the recognizer's back-end will be refined and its evaluation will be made faster through algorithmic research. The developed algorithms will be integrated in the result of the SPRAAK software (also a project of the NTU) and made available through its distribution channels. This project addresses three STEVIN priorities: 1) robustness of speech recognition, 2) tools and data for the development of robust speech recognition, and 3) confidence measures. In this project we will base our research on a 'real-life' test suite that contains test material from the Dutch SpeechDat Car and Speecon databases.

 

Contact

Professor Hugo Van hamme
K.U.Leuven - ESAT/PSI
Kasteelpark Arenberg 10
3001 Heverlee
BELGIUM
E-mail: 
hugo.vanhamme@esat.kuleuven.be