Publications

All CAMeL lab publications on Google Scholar

  • Publications

    Alhafni, Bashar, Nizar Habash, and Houda Bouamor. "The Arabic Parallel Gender Corpus 2.0: Extensions and Analyses." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.

    Abdulrahim, Dana, Go Inoue, Latifa Shamsan, Salam Khalifa, and Nizar Habash. "The Bahrain Corpus: A Multi-genre Corpus of Bahraini Arabic." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.

    Baimukan, Nurpeiis, Nizar Habash, and Houda Bouamor. "Aggregating Hierarchical Dialectal Data for Arabic Dialect Classification." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.

    Batsuren, Khuyagbaatar, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Yustinus Ghanggo Ate, Maria Ryskina, Kyle Gorman, Sabrina J. Mielke, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Abbott Lane, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay, Juan López Bautista, Gema Celeste Silva Villegas, Lucas Torroba Hennigen, Adam Ek, Jean-Philippe Bernardy, Andrey Scherbakov, Aziyana Bayyr-ool, Antonios Anastasopoulos, Roberto Zariquiey, Karina Sheifer, Sofya Ganieva, Matvey Plugaryov, Elena Klyachko, Ali Salehi, Candy Angulo, Andrew Krizhanovsky, Natalia Krizhanovskaya, Elizabeth Salesky, Clara Vania, Sardana Ivanova, Jennifer White, Rowan Hall Maudslay, Josef Valvoda, Ran Zmigrod, Paula Czarnowska, Irene Nikkarinen, Aelita Salchak, Christopher Straughn, Zoey Liu, Jonathan North Washington, Yuval Pinter, Duygu Ataman, Marcin Wolinski, Totok Suhardijanto, Anna Yablonskaya, Niklas Stoehr, Zahroh Nuriah, Francis M. Tyers, Edoardo M. Ponti, Grant Aiton, Aryaman Arora, Richard J. Hatcher, Ritesh Kumar, Mohit Raj, Daria Rodionova, Anastasia Yemelina, Dorina Lakatos, Hilaria Cruz, Botond Barta, Gábor Szolnok, Judit Ács, Taras Andrushko, Igor Marchenko, Polina Mashkovtseva, Alexandra Serova, Emily Prud'hommeaux, Maria Nepomniashchaya, Elena Budianskaya, Eleanor Chodroff, Mans Hulden, Miikka Silfverberg, fausto giunchiglia, David Yarowsky, Ryan Cotterell, Reut Tsarfaty and Ekaterina Vylomova. "UniMorph 4.0: Universal Morphology." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.

    Habash, Nizar, Muhammed AbuOdeh, Dima Taji, Reem Faraj, Jamila El Gizuli, and Omar Kallas. "Camel Treebank: An Open Multi-genre Arabic Dependency Treebank." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.

    Habash, Nizar, and David Palfreyman. "ZAEBAC: An Annotated Arabic-English Bilingual Writer Corpus: Guidelines, Processes, and Insights." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.

    Inoue, Go, Salam Khalifa, and Nizar Habash. "Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its Dialects." In Findings of the Association for Computational Linguistics: ACL 2022.

    Kamal Eddine, Moussa, Nadi Tomeh, Nizar Habash, Joseph Le Roux, and Michalis Vazirgiannis. "AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive Summarization." arXiv preprint arXiv:2203.10945, 2022.

    Salloum, Wael, and Nizar Habash. "Unsupervised Arabic dialect segmentation for machine translation." In Natural Language Engineering, Volume 28, Issue 2, pp. 223 - 248. 2022.

Previous Publications