Rechercher

Paramétrage

Thèmes

Accessibility

Accessibility

Project CLEAR

Simple Corpus for Medical French

Source work

Natalia Grabar

ATA 2018 (ENLG workshop on Automatic Text Adaptation)
8 November 2018, Tilburg, The Netherlands
See pdf

Download the datasets with medical comparable corpora in French:

  1. encyclopedia articles: 6Mo archive
  2. drug leaflets: 146Mo archive
  3. Cochrane summaries: 7Mo archive

Download the dataset with general language comparable corpora in French:

  1. encyclopedia articles: 155Mo archive

The dataset contains three corpora of documents with comparable contents.
Each corpus provides technical and simple/simplified texts on a given topic in French.