You are here



Moses -

A statistical machine translation system


A toolkit featuring algorithms and data structures to store and access very large n-gram language models

onlineMGIZA++ -

An extension of MGIZA++, which allows to align sentence pair in an online mode


Adaptive Quality Estimation tool for Machine Translation


CLTE Benchmark -

Cross-Lingual Textual Entailment Dataset

RTE3-derived CLTE dataset -

A cross-lingual entailment corpus, obtained by translating the RTE-3 dataset

BinQE -

A Machine Translation Dataset Annotated with Binary Quality Judgements

TOSCA-MP Speech Ground Truth -

A multilingual dataset of news and talk show transcriptions and translations

BitterCorpus -

En-Ita corpus with annotated bilingual terms in IT domain

WIT3 -

A ready-to-use version for MT research purposes of the multilingual transcriptions of TED talks


English-Italian Word Alignment Gold Standard