EasyCall Corpus

The EasyCall corpus is  a database of command speech recorded from healthy individuals and dysarthric patients. This dataset has been collected through a collaboration between the Italian Institute of Technology (IIT), the University of Ferrara and the Sant'Anna Hospital of Ferrara, and it  aims at providing a new resource for future developments of ASR-based assistive technologies.
In particular, it may be exploited to develop a voice-controlled contact application for commercial smartphones, and improve dysarthric patients' ability to communicate with their family and caregivers.

It  currently consists of  16683 audio recordings from 21 healthy and 26 dysarthric speakers.  For each speech-impaired individual, dysarthria has been assessed by a neurologist through the Therapy Outcome Measure.   The recordings focus on a small vocabulary, including basic smartphone commands, such as “open contacts”, “start call”, “end call”.  Specifically, these commands are the result of a survey administered to patients  that evaluates which commands are more likely to be employed by dysarthric individuals to use a speech command-based contact application.  In addition, the dataset includes a list of non-commands (i.e., words near/inside commands or phonetically close to commands) that can be leveraged to build a more robust ASR system.

PROJECT TEAM:
Rosanna Turrisi, Arianna Braccia, Marco Emanuele, Simone Giulietti, Luciano Fadiga, Mariachiara Sensi, Leonardo  Badino.

License CC BY-NC

DOWNLOAD