- 20 hours of speech (European Portuguese) from 284 children aged 6-10 years old performing reading tasks.
- Manual annotation of 5h30m (104 children), fully tagging several types of reading disfluencies.
- Automatic annotation of the remaining utterances.
The reading tasks performed are of sentences and pseudowords.
It falls under the Creative Commons Attribution-ShareAlike 4.0 International license (CC BY-SA 4.0).
If you use this data, please cite the following paper:
J. Proença, D. Celorico, S. Candeias, C. Lopes, F. Perdigão, The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation, ELRA International Conf. on Language Resources and Evaluation - LREC, Portorož, Slovenia, May, 2016
|Further description: LetsreadDB(PDF)|
|Download: LetsReadDB.zip (1.61GB)|