Machine Translation

Most of the research on machine translation in CAMeL is focused on Arabic as source or target language. We list below three ongoing efforts.

Project Name Description
AraParl
CAMeL researchers, funded by a generous grant from NYUAD, are creating an Arabic translation of a portion of the European Parliamentary Proceedings (EuroParl), thus created a large scale development and test set to support research in translation between Arabic and languages of Europe.
Pivot Machine Translation for Morphologically Rich Languages Many languages do not have the necessary large-scale parallel corpora to allow building statistical machine translation systems. As such, in practice machine translation is done by pivoting through English. Since English is a poor language morphologically, this causes a reduction in the quality in the translation especially when translating between two morphologically rich languages. In collaboration with Columbia University, we have developed techniques for improving the quality of pivot machine translation and demonstrated it on Hebrew-Arabic and Persian-Arabic.
Arabic Dialect Machine Translation CAMeL researchers are working in collaboration with researchers in Columbia University on the problem of translating between Arabic dialects and English by exploiting standard Arabic resources.