The Nijmegen Corpus of Casual French contains 35 hours of high-quality recordings featuring 46 French speakers conversing among friends. The speech has been orthographically annotated by professional transcribers. The transcriptions are stored in Transcriber xml and Praat TextGrid files.
The corpus is available to researchers in academics and industry. Instructions on how to obtain a copy of the corpus will be posted soon. In the meantime you can contact Mirjam Ernestus by e-mail (Mirjam.Ernestus@mpi.nl).
A detailed description of the corpus is provided in:
This project is funded by a European Young Investigator Award
Ernestus. The corpus was recorded by Francisco Torreira
at the Laboratoire de Phonétique et Phonologie (UMR7018)
in Paris during November 2008 as part of his dissertation work at the Radboud University Nijmegen. The orthographic transcription was carried
out in collaboration with Martine
Adda-Decker (CNRS-LIMSI, France).