Special SessionThe aim of this special session is for researchers to apply state-of-the art feature extraction, acoustic modeling and adaptation algorithms to the problem of (hands-free) speech recognition in the cockpit. Two main conditions have to be addressed: (i) speech signal corrupted by additive noise, and (ii) non-native speaker input.
For this special session, the HIWIRE database collected and packaged under the auspices for the IST-EU STREP project HIWIRE (Human Input that Works in Real Environments) is made freely available to the participants. The database contains 8100 English utterances pronounced by non-native speakers (31 French, 20 Greek, 20 Italian, and 10 Spanish speakers); the collected utterances correspond to human input in a command and control aeronautics application. The data was recorded in the studio and noise recorded in the cockpit was artificially added to the data. A description of the database can be found here.
The signals are provided in clean (studio recordings with close talking microphone), low-, mid- and high-noise1 (cockpit noise artificially added to the data) conditions. Two research tracks are proposed: ``robust non-native'' (RNN) and ``non-native adaptation'' (NNA) tasks. You are free to participate to one or both tracks. Baseline HTK setup scripts are provided along with the HIWIRE database for both tracks. Baseline word error rate and sentence error rate are also provided for result comparison. Training scripts use the TIMIT database (not provided with the distribution; should be acquired separately).
Authors may select to present results on a subset of the databases, e.g., clean data only for track 1. Papers will be treated as regular papers in the Interspeech paper submission procedure.
For questions, clarifications or bug reporting on task definitions and HTK scripts please contact or
Organizing committee: Thibaut Ehrette (Thales Research), Dominique Fohr (LORIA), Petros Maragos (National Technical University of Athens), Marco Matassoni (ITC-IRST), Alexandros Potamianos, (Technical University of Crete), Jose C. Segura (Universidad de Granada).
Co-organizer: David van Leeuwen (TNO Human Factors)