You are here

Global view on Corpora

Technology

Cross-Lingual Textual Entailment Dataset

A Machine Translation Dataset Annotated with Binary Quality Judgements

En-Ita corpus with annotated bilingual terms in IT domain

English-Italian Word Alignment Gold Standard

A cross-lingual entailment corpus, obtained by translating the RTE-3 dataset

A multilingual dataset of news and talk show transcriptions and translations

A ready-to-use version for MT research purposes of the multilingual transcriptions of TED talks