 Corpus Name: EMEA
     Package: EMEA in Moses format
     Website: http://opus.nlpl.eu/EMEA-v3.php
     Release: v3
Release date: Sat Mar  3 21:51:43 EET 2018

This corpus is part of OPUS - the open collection of parallel corpora
OPUS Website: http://opus.nlpl.eu

Please cite the following article if you use any part of the corpus in your own work: J. Tiedemann, 2012, Parallel Data, Tools and Interfaces in OPUS. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012)

This is a parallel corpus made out of PDF documents from the European Medicines Agency. All files are automatically converted from PDF to plain text using pdftotext with the command line arguments -layout -nopgbrk -eol unix. There are some known problems with tables and multi-column layouts - some of them are fixed in the current version.source: http://www.emea.europa.eu/
NEW: Dutch EMEA Treebank (parsed with Alpino)
