Natalia Grabar
Project CLEAR
Simple Corpus for Medical French
Source work
Natalia Grabar
ATA 2018 (ENLG workshop on Automatic Text Adaptation)
8 November 2018, Tilburg, The Netherlands
See pdf
Download the datasets with medical comparable corpora in French:
- encyclopedia articles: 6Mo archive
- drug leaflets: 146Mo archive
- Cochrane summaries: 7Mo archive
Download the dataset with general language comparable corpora in French:
- encyclopedia articles: 155Mo archive
The dataset contains three corpora of documents with comparable contents.
Each corpus provides technical and simple/simplified texts on a given topic in French.