Publications
All CAMeL lab publications on Google Scholar
-
Publications
Elgamal, Salman, Ossama Obeid, Tameem Kabbani, Go Inoue, Nizar Habash. Arabic Diacritics in the Wild: Exploiting Opportunities for Improved Diacritization. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand, 2024.
Wang, Yuxia, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Osama Mohammed Afzal, Tarek Mahmoud, Giovanni Puccetti, Thomas Arnold, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov. M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand, 2024.
Koto, Fajri, Haonan Li, Sara Shatnawi, Jad Doughman, Abdelrahman Boda Sadallah, Aisha Alraeesi, Khalid Almubarak, Zaid Alyafeai, Neha Sengupta, Shady Shehata, Nizar Habash, Preslav Nakov, Timothy Baldwin. ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand, 2024.
Hamed, Injy, Fadhl Eryani, David Palfreyman and Nizar Habash. ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus. In Proceedings of the LREC-COLING 2024 - The Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Turin, Italy. 2024.
Alhafni, Bashar, Reem Hazim, Juan David Pineros Liberato, Muhamed Al Khalil and Nizar Habash. The SAMER Arabic Text Simplification Corpus. In Proceedings of the LREC-COLING 2024 - The Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Turin, Italy. 2024.
Kallas, Omar, Go Inoue and Nizar Habash. EMAD: A Bridge Tagset for Unifying Arabic POS Annotations. In Proceedings of the LREC-COLING 2024 - The Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Turin, Italy. 2024.
Khairallah, Christian, Salam Khalifa, Reham Marzouk, Mayar Mohamadein Nassar and Nizar Habash. Camel Morph MSA: A Large-Scale Open-Source Morphological Analyzer for Modern Standard Arabic. In Proceedings of the LREC-COLING 2024 - The Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Turin, Italy. 2024.
AbuOdeh, Muhammed, Long Phan, Ahmed Farouk Zakaria Elshabrawy and Nizar Habash. Palmyra 3.0: A User-Friendly Cloud-Based Platform for Morphology and Dependency Syntax Annotation. In Proceedings of the LREC-COLING 2024 - The Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Turin, Italy. 2024.
Micallef, Kurt, Nizar Habash, Claudia Borg, Fadhl Eryani, and Houda Bouamor. Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching. In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, Malta, 2024.
Wang, Yuxia, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Chenxi Whitehouse, Osama Mohammed Afzal, Tarek Mahmoud, Toru Sasaki, Thomas Arnold, Alham Aji, Nizar Habash, Iryna Gurevych, and Preslav Nakov. M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection. In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, Malta 2024.
Khairallah, Christian, Reham Marzouk, Salam Khalifa, Mayar Nassar, and Nizar Habash. Computational Morphology and Lexicography Modeling of Modern Standard Arabic Nominals. In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, Malta, 2024.
Previous Publications
-
Chierici, Alberto, Soojin Lee, Nizar Habash, Aaron Sherwood, Bishnu Dev, Gautham Kumar, Muhammad Ali. Boundless Conversations: AI-Powered Video Interactions across Domains, Languages, and Time. In Proceedings of SIGGRAPH Asia 2023 Emerging Technologies, Sydney, Australia, 2023.
Alhafni, Bashar, Go Inoue, Christian Khairallah, and Nizar Habash. Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023.
Hamed, Injy, Nizar Habash, and Thang Vu. Data Augmentation Techniques for Machine Translation of Code-Switched Texts: A Comparative Study. In Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, 2023.
Elshabrawy, Ahmed, Muhammed AbuOdeh, Go Inoue, and Nizar Habash. CamelParser2.0: A State-of-the-Art Dependency Parser for Arabic. In Proceedings of ArabicNLP 2023, Singapore, 2023.
Abdul-Mageed, Muhammad, AbdelRahim Elmadany, Chiyu Zhang, El Moatez Billah Nagoudi, Houda Bouamor, and Nizar Habash. NADI 2023: The Fourth Nuanced Arabic Dialect Identification Shared Task. In Proceedings of ArabicNLP 2023, Singapore, 2023.
Alkheder, Hasan, Houda Bouamor, Nizar Habash, and Ahmet Zengin. Benchmarking Dialectal Arabic-Turkish Machine Translation. In Proceedings of Machine Translation Summit XIX, Macau SAR, China, 2023.
Chierici, Alberto and Nizar Habash. Tell Me More, Tell Me More: AI-Generated Question Suggestions for the Creation of Interactive Video Recordings. In Proceedings of 32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), Busan, Korea, 2023.
Micallef, Kurt, Fadhl Eryani, Nizar Habash, Houda Bouamor, and Claudia Borg. 2023. Exploring the Impact of Transliteration on NLP Performance: Treating Maltese as an Arabic Dialect. In Proceedings of the Workshop on Computation and Written Language (CAWL 2023), Toronto, Canada.
Alhafni, Bashar, Ossama Obeid, and Nizar Habash. 2023. The User-Aware Arabic Gender Rewriter. In Proceedings of the First Workshop on Gender-Inclusive Translation Technologies, Tampere, Finland.
Gaser, Marwa, Manuel Mager, Injy Hamed, Nizar Habash, Slim Abdennadher and Ngoc Thang Vu. Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text. In Proceedings of the Conference of the European Association for Computational Linguistics (EACL 2023), Dubrovnik, Croatia, 2023.
Hamed, Injy, Nizar Habash, Slim Abdennadher, and Ngoc Thang Vu. 2023. Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation. In Proceedings of the The Sixth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2023), Dubrovnik, Croatia.
Hamed, Injy, Amir Hussein, Oumnia Chellah, Shammur Chowdhury, Hamdy Mubarak, Sunayana Sitaram, Nizar Habash, Ahmed Ali. Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition. In Proceedings of the IEEE Spoken Language Technology Workshop (SLT 2022). Doha, Qatar, 2023.
-
Obeid, Ossama, Go Inoue and Nizar Habash. "Camelira: An Arabic Multi-Dialect Morphological Disambiguator." In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), Demo. Abu Dhabi, United Arab Emirates. 2022.
Hazim, Reem, Hind Saddiki, Bashar Alhafni, Muhamed Al Khalil, Nizar Habash. "Arabic Word-level Readability Visualization for Assisted Text Simplification." In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), Demo. Abu Dhabi, United Arab Emirates. 2022.
Hamed, Injy, Amir Hussein, Oumnia Chellah, Shammur Chowdhury, Hamdy Mubarak, Sunayana Sitaram, Nizar Habash, Ahmed Ali. "Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition." In Proceedings of the IEEE Spoken Language Technology Workshop (SLT 2022). Doha, Qatar. 2022.
Hamed, Injy, Nizar Habash, Slim Abdennadher and Ngoc Thang Vu. "ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic - English." In Proceedings of the Seventh Arabic Natural Language Processing Workshop. Abu Dhabi, United Arab Emirates. 2022.
Kamal Eddine, Moussa, Nadi Tomeh, Nizar Habash, Joseph Le Roux and Michalis Vazirgiannis. "AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive Summarization." In Proceedings of the Seventh Arabic Natural Language Processing Workshop. Abu Dhabi, United Arab Emirates. 2022.
Dibas, Shahd, Christian Khairallah, Nizar Habash, Omar Fayez Sadi, Tariq Sairafy, Karmel Sarabta and Abrar Ardah. "Maknuune: A Large Open Palestinian Arabic Lexicon." In Proceedings of the Seventh Arabic Natural Language Processing Workshop. Abu Dhabi, United Arab Emirates. 2022.
Abdul-Mageed, Muhammad, Chiyu Zhang, AbdelRahim Elmadany, Houda Bouamor and Nizar Habash. "NADI 2022: The Third Nuanced Arabic Dialect Identification Shared Task." In Proceedings of the Seventh Arabic Natural Language Processing Workshop. Abu Dhabi, United Arab Emirates. 2022.
Alhafni, Bashar, Nizar Habash, Houda Bouamor, Ossama Obeid, Sultan Alrowili, Daliyah AlZeer, Kawla Mohmad Shnqiti, Ahmed Elbakry, Muhammad ElNokrashy, Mohamed Gabr, Abderrahmane Issam, Abdelrahim Qaddoumi, Vijay Shanker and Mahmoud Zyate. "The Shared Task on Gender Rewriting." In Proceedings of the Seventh Arabic Natural Language Processing Workshop. Abu Dhabi, United Arab Emirates. 2022.
Alhafni, Bashar, Nizar Habash, and Houda Bouamor. "The Arabic Parallel Gender Corpus 2.0: Extensions and Analyses." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.
Abdulrahim, Dana, Go Inoue, Latifa Shamsan, Salam Khalifa, and Nizar Habash. "The Bahrain Corpus: A Multi-genre Corpus of Bahraini Arabic." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.
Baimukan, Nurpeiis, Nizar Habash, and Houda Bouamor. "Hierarchical Aggregation of Dialectal Data for Arabic Dialect Classification." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.
Batsuren, Khuyagbaatar, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Yustinus Ghanggo Ate, Maria Ryskina, Kyle Gorman, Sabrina J. Mielke, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Abbott Lane, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay, Juan López Bautista, Gema Celeste Silva Villegas, Lucas Torroba Hennigen, Adam Ek, Jean-Philippe Bernardy, Andrey Scherbakov, Aziyana Bayyr-ool, Antonios Anastasopoulos, Roberto Zariquiey, Karina Sheifer, Sofya Ganieva, Matvey Plugaryov, Elena Klyachko, Ali Salehi, Candy Angulo, Andrew Krizhanovsky, Natalia Krizhanovskaya, Elizabeth Salesky, Clara Vania, Sardana Ivanova, Jennifer White, Rowan Hall Maudslay, Josef Valvoda, Ran Zmigrod, Paula Czarnowska, Irene Nikkarinen, Aelita Salchak, Christopher Straughn, Zoey Liu, Jonathan North Washington, Yuval Pinter, Duygu Ataman, Marcin Wolinski, Totok Suhardijanto, Anna Yablonskaya, Niklas Stoehr, Zahroh Nuriah, Francis M. Tyers, Edoardo M. Ponti, Grant Aiton, Aryaman Arora, Richard J. Hatcher, Ritesh Kumar, Mohit Raj, Daria Rodionova, Anastasia Yemelina, Dorina Lakatos, Hilaria Cruz, Botond Barta, Gábor Szolnok, Judit Ács, Taras Andrushko, Igor Marchenko, Polina Mashkovtseva, Alexandra Serova, Emily Prud'hommeaux, Maria Nepomniashchaya, Elena Budianskaya, Eleanor Chodroff, Mans Hulden, Miikka Silfverberg, fausto giunchiglia, David Yarowsky, Ryan Cotterell, Reut Tsarfaty and Ekaterina Vylomova. "UniMorph 4.0: Universal Morphology." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.
Habash, Nizar, Muhammed AbuOdeh, Dima Taji, Reem Faraj, Jamila El Gizuli, and Omar Kallas. "Camel Treebank: An Open Multi-genre Arabic Dependency Treebank." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.
Habash, Nizar, and David Palfreyman. "ZAEBAC: An Annotated Arabic-English Bilingual Writer Corpus: Guidelines, Processes, and Insights." In Proceedings of the Language Resources and Evaluation Conference (LREC). Marseille, France. 2022.
Inoue, Go, Salam Khalifa, and Nizar Habash. "Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its Dialects." In Findings of the Association for Computational Linguistics: ACL 2022.
Kamal Eddine, Moussa, Nadi Tomeh, Nizar Habash, Joseph Le Roux, and Michalis Vazirgiannis. "AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive Summarization." Proceedings of the Seventh Arabic Natural Language Processing Workshop (WANLP2022). EMNLP. 2022.
Salloum, Wael, and Nizar Habash. "Unsupervised Arabic dialect segmentation for machine translation." In Natural Language Engineering, Volume 28, Issue 2, pp. 223 - 248. 2022.
El-Haj, Mahmoud, Paul Rayson, Elvis de Souza, Nouran Khallaf, Nizar Habash. "AraSAS: The Open Source Arabic Semantic Tagger." In Proceedinsg of the 5th Workshop on Open-Source Arabic Corpora and Processing Tools with Shared Tasks on Qur'an QA and Fine-Grained Hate Speech Detection. 2022.
Habash, Nizar, Reham Marzouk, Christian Khairallah, Salam Khalifa. "Morphotactic Modeling in an Open-source Multi-dialectal Arabic Morphological Analyzer and Generator". In Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology. 2022.
Jensen, Jeffrey L, Daniel Karell, Cole Tanigawa-Lau, Nizar Habash, Mai Oudah, Dhia Fairus Shofia Fani. "Language Models in Sociological Research: An Application to Classifying Large Administrative Data and Measuring Religiosity."In Sociological Methodology. 2022.
-
Jensen, Jeffrey, Daniel Karell, Cole Tanigawa-Lau, Nizar Habash, Mai Oudah, Dhia Fairus shofia Fani. "Language Models in Sociological Research: An Application to Classifying Large Administrative Data and Measuring Religiosity". Sociological Methodology. 2021
Darwish, Kareem, Nizar Habash, Mourad Abbas, Hend Al-Khalifa, Hussein T. Al-Natsheh, Houda Bouamor, Karim Bouzoubaa, Violetta Cavalli- Sforza, Samhaa R. El-Beltagy, Wassim El-Hajj, Mustafa Jarrar, and Hamdy Mubarak. "A Panoramic Survey of Natural Language Processing in the Arab World". Special Issue on the Arab World. Communications of the ACM 64, no. 4 (2021) : 72-81.
Habash, Nizar. "Arabic Computational Linguistics". In Ryding, Karin and David Wilmsen, eds. "Handbook of Arabic Linguistics". Cambridge University Press. 2021.
Habash, Nizar. "Arabic Dialect Processing". In Zampieri, Marcos and Preslav Nakov, eds. "Similar Languages, Varieties, and Dialects: A Computational Perspective". Cambridge University Press. 2021.
Belkebir, Riadh, and Nizar Habash. "Automatic Error Type Annotation for Arabic". In Proceedings of the Conference on Computational Natural Language Learning (CONLL), Virtual, 2021.
Bendevski, Filip, Jumana Ibrahim, Tina Krulec, Theodore Waters, Nizar Habash, Hanan Salam, Himadri Mukherjee, Christin Camia. "Towards Automatic Narrative Coherence Prediction". In Proceedings of the ACM International Conference on Multimodal Interaction (ICMI '21), Virtual, 2021.
Pimentel, Tiago, Maria Ryskina, Sabrina J. Mielke, Shijie Wu, Eleanor Chodroff, Brian Leonard, Garrett Nicolai, Yustinus Ghanggo Ate, Salam Khalifa, Nizar Habash, Charbel El-Khaissi, Omer Goldman, Michael Gasser, William Lane, Matt Coler, Arturo Oncevay, Jaime Rafael Montoya Samame, Gema Celeste Silva Villegas, Adam Ek, Jean-Philippe Bernardy, Andrey Shcherbakov, Aziyana Bayyr-ool, Karina Sheifer, Sofya Ganieva, Matvey Plugaryov, Elena Klyachko, Ali Salehi, Andrew Krizhanovsky, Natalia Krizhanovsky, Clara Vania, Sardana Ivanova, Aelita Salchak, Christopher Straughn, Zoey Liu, Jonathan North Washington, Duygu Ataman, Witold Kieraś, Marcin Woliński, Totok Suhardijanto, Niklas Stoehr, Zahroh Nuriah, Shyam Ratan, Francis M. Tyers, Edoardo M. Ponti, Grant Aiton, Richard J. Hatcher, Emily Prud'hommeaux, Ritesh Kumar, Mans Hulden, Botond Barta, Dorina Lakatos, Gábor Szolnok, Judit Ács, Mohit Raj, David Yarowsky, Ryan Cotterell, Ben Ambridge, Ekaterina Vylomova. "SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages". In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, Virtual, 2021.
Alhafni, Bashar, Nizar Habash, and Houda Bouamor. "The Arabic Parallel Gender Corpus 2.0: Extensions and Analyses". arXiv. 2021
Inoue, Go, Salam Khalifa, and Nizar Habash. "Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its Dialects". arXiv. 2021
Chierici, Alberto, and Nizar Habash. "A View From the Crowd: Evaluation Challenges for Time-Offset Interaction Applications". Proceedings of the Workshop on Human Evaluation of NLP Systems (HumEval2021). Online. 2021.
Chierici, Alberto, Tyeece Kiana Fredorcia Hensley, Wahib Kamran, Kertu Koss, Armaan Agrawal, Erin Meekhof, Goffredo Puccetti, and Nizar Habash. "A Cloud-based User-Centered Time-Offset Interaction Application". Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL2021). Online. 2021
Inoue, Go, Bashar Alhafni, Nurpeiis Baimukan, Houda Bouamor and Nizar Habash. "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". Proceedings of the Sixth Arabic Natural Language Processing Workshop (WANLP2021). Online. 2021
Eryani, Fadhl and Nizar Habash. "Automatic Romanization of Arabic Bibliographic Records". Proceedings of the Sixth Arabic Natural Language Processing Workshop (WANLP2021). Online. 2021
Abdul-Mageed, Muhammad, Chiyu Zhang, AbdelRahim Elmadany, Houda Bouamor, and Nizar Habash. "NADI 2021: The Second Nuanced Arabic Dialect Identification Shared Task". Proceedings of the Sixth Arabic Natural Language Processing Workshop (WANLP2021). Online. 2021
-
Badaro, Gilbert, Hazem Hajj, and Nizar Habash. "A Link Prediction Approach for Accurately Mapping a Large-scale Arabic Lexical Resource to English WordNet." ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 19, no. 6 (2020): 1-38.
Salloum, Wael and Nizar Habash. "Unsupervised Arabic dialect segmentation for machine translation." Natural Language Engineering. Cambridge University Press. 2020.
Zalmout, Nasser. "Morphological Tagging and Disambiguation in Dialectal Arabic Using Deep Learning Architectures." PhD diss., New York University Tandon School of Engineering, 2020.
Erdmann, Alexander, Micha Elsner Shijie Wu, Ryan Cotterell, and Nizar Habash. The Paradigm Discovery Problem. In Proceedings of Conference of the Association for Computational Linguistics (ACL 2020), Online.
Zalmout, Nasser and Nizar Habash. Joint Diacritization, Lemmatization, Normalization, and Fine-Grained Morphological Tagging. In Proceedings of Conference of the Association for Computational Linguistics (ACL 2020), Online.
Zalmout, Nasser and Nizar Habash. Utilizing Subword Entities in Character-Level Sequence-to-Sequence Lemmatization Models. In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020, Main), Online.
Kankanampati, Yash, Joseph Le Roux, Nadi Tomeh, Dima Taji, Nizar Habash. Multitask Easy-First Dependency Parsing: Exploiting Complementarities of Different Dependency Representations. In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020, Main), Online.
Jiang, Zhengyang, Nizar Habash and Muhamed Al Khalil. An Online Readability Leveled Arabic Thesaurus. In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020, Main), Online.
Abdul-Mageed, Muhammad, Chiyu Zhang, Houda Bouamor, Nizar Habash. NADI 2020: The First Nuanced Arabic Dialect Identification Shared Task. In Proceedings of the Fifth Arabic Natural Language Processing Workshop (COLING 2020, Arabic NLP Workshop), Online.
Shazal, Ali, Aiza Usman, Nizar Habash. A Unified Model for Arabizi Detection and Transliteration using Sequence-to-Sequence Models. In Proceedings of the Fifth Arabic Natural Language Processing Workshop (COLING 2020, Arabic NLP Workshop)
Taji, Dima and Nizar Habash. PALMYRA 2.0: A Configurable Multilingual Platform Independent Tool for Morphology and Syntax Annotation. In Proceedings of the Fourth Workshop on Universal Dependencies (COLING 2020, Workshop on Universal Dependencies), Online.
Alhafni, Bashar, Nizar Habash and Houda Bouamor. Gender-Aware Reinflection using Linguistically Enhanced Neural Models. Proceedings of the Second Workshop on Gender Bias in Natural Language Processing (COLING 2020, Workshop on Gender Bias in NLP), Online.
Al Khalil, Muhamed, Nizar Habash, Zhengyang Jiang. A Large-Scale Leveled Readability Lexicon for Standard Arabic. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2020), Marseille, France, 2020.
Chierici, Alberto M., Nizar Habash, Margarita Bicec. The Margarita Dialogue Corpus: A Data Set for Time-Offset Interactions and Unstructured Dialogue Systems. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2020), Marseille, France, 2020.
Eryani, Fadhl, Nizar Habash, Houda Bouamor, Salam Khalifa. A Spelling Correction Corpus for Multiple Arabic Dialects. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2020), Marseille, France, 2020.
Khalifa, Salam, Nasser Zalmout, Nizar Habash. Morphological Analysis and Disambiguation for Gulf Arabic: The Interplay between Resources and Methods. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2020), Marseille, France, 2020.
Obeid, Ossama, Nasser Zalmout, Salam Khalifa, Dima Taji, Mai Oudah, Bashar Alhafni, Go Inoue, Fadhl Eryani, Alexander Erdmann, Nizar Habash. CAMeL Tools: An Open Source Python Toolkit for Arabic Natural Language Processing. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2020), Marseille, France, 2020.
-
Elsner, Micha, Andrea D. Sims, Alexander Erdmann, Antonio Hernandez, Evan Jaffe, Lifeng Jin, Martha Booker Johnson et al. "Modeling morphological learning, typology, and change: What can the neural sequence-to-sequence framework contribute?" Journal of Language Modelling 7, no. 1 (2019).
Zalmout, Nasser, Kapil Thadani, and Aasish Pappu. "Unsupervised Neologism Normalization Using Embedding Space Mapping." In Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019). Hong Kong, China, 2019.
Noll, Ella, Mai Oudah, and Nizar Habash. Simple Automatic Post-editing for Arabic-Japanese Machine Translation. arXiv preprint arXiv:1907.06210.
Zalmout, Nasser, and Nizar Habash. Joint Diacritization, Lemmatization, Normalization, and Fine-Grained Morphological Tagging. arXiv preprint arXiv:1910.02267.
Ali, Ahmed, Salam Khalifa, and Nizar Habash. Towards Variability Resistant Dialectal Speech Evaluation. In Proceedings of Interspeech. Graz, Austria, 2019.
Erdmann, Alexander, Salam Khalifa, Mai Oudah, Houda Bouamor, Nizar Habash. A Little Linguistics Goes a Long Way:Unsupervised Segmentation with Limited Language Specific Guidance. In Proceedings of SIGMORPHON, Florence, Italy, 2019.
Oudah, Mai, Amjad Almahairi, and Nizar Habash. The Impact of Preprocessing on Arabic-English Statistical and Neural Machine Translation. In Proceedings of the Machine Translation Summit, Dublin, Ireland, 2019.
Bouamor, Houda, Sabit Hassan, and Nizar Habash. The MADAR Shared Task on Arabic Fine-Grained Dialect Identification. In Proceedings of Workshop on Arabic Natural Language Processing, Florence, Italy, 2019.
Held, William and Nizar Habash. The Effectiveness of Simple Hybrid Systems for Hypernym Discovery. In Proceedings of Conference of the Association for Computational Linguistics (ACL), Florence, Italy, 2019.
Alshargi, Faisal, Shahd Dibas, Sakhar Alkhereyf, Reem Faraj, Basmah Abdulkareem, Sane Yagi, Ouafaa Kacha, Nizar Habash and Owen Rambow. Morphologically Annotated Corpora for Seven Arabic Dialects: Taizi, Sanaani, Najdi, Jordanian, Syrian, Iraqi and Moroccan. In Proceedings of Workshop on Arabic Natural Language Processing, Florence, Italy, 2019.
Zalmout, Nasser and Nizar Habash. Adversarial Multitask Learning for Joint Multi-Feature and Multi-Dialect Morphological Modeling. In Proceedings of Conference of the Association for Computational Linguistics (ACL), Florence, Italy, 2019.
Habash, Nizar, Houda Bouamor and Christine Chung. Automatic Gender Identification and Reinflection in Arabic. In Proceedings of the 1st ACL Workshop on Gender Bias for Natural Language Processing, Florence, Italy, 2019.
Obeid, Ossama, Mohammad Salameh, Houda Bouamor and Nizar Habash. ADIDA: Automatic Dialect Identification for Arabic. In Proceedings of the Conference of the North American Association for Computational Linguistics (NAACL), Minneapolis, 2019.
Badaro, Gilbert, Ramy Baly, Hazem Hajj, Wassim El-Hajj, Khaled Shaban, Nizar Habash, Ahmad Sallab, and Ali Hamdi. A Survey of Opinion Mining in Arabic: A Comprehensive System Perspective Covering Challenges and Advances in Tools, Resources, Models, Applications and Visualizations. In ACM Transactions on Asian Language Information Processing (TALLIP). 2019.
-
Khan, Shehroze, Jihyun Kim, Tarik Zulfikarpasic, Peter Chen and Nizar Habash. A Cross-lingual Messenger with Keyword Searchable Phrases for the Travel Domain. In Proceedings of the Computational Linguistics Conference (COLING), Santa Fe, New Mexico, 2018.
Boukaram, Halim-Antoine, Nizar Habash, Micheline Ziadee and Majd Sakr. Improving Domain Independent Question Parsing with Synthetic Treebanks. In Proceedings of the COLING Workshop on Linguistic Annotation, Multiword Expressions and Constructions, Santa Fe, USA, 2018.
Salameh, Mohammad, Houda, Bouamor, and Nizar Habash. Fine-Grained Arabic Dialect Identification. In Proceedings of the Computational Linguistics Conference (COLING), Santa Fe, New Mexico, 2018.
Erdmann, Alexander and Nizar Habash. Complementary Strategies for Low Resourced Morphological Modeling. In Proceedings of the SIGMORPHON workshop co-located with the Empirical Methods for Natural Language Processing Conference, Brussels, 2018.
Taji, Dima, Salam Khalifa, Ossama Obeid, Fadhl Eryani and Nizar Habash. An Arabic Morphological Analyzer and Generator with Copious Features. In Proceedings of the SIGMORPHON workshop co-located with the Empirical Methods for Natural Language Processing Conference, workshop co-located with the Empirical Methods for Natural Language Processing Conference, Brussels, 2018.
Taji, Dima, Jamila El Gizuli and Nizar Habash. An Arabic Dependency Treebank in the Travel Domain. In Proceedings of the Workshop on Free/Open-Source Arabic Corpora and Corpora Processing Tools (OSACT), LREC, Miyazaki, Japan, 2018.
Al-Khalil, Muhamed, Hind Saddiki, Nizar Habash, and Latifa Alfalasi. A Leveled Reading Corpus of Modern Standard Arabic. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan, 2018.
Badaro, Gilbert, Hussein Jundi, Hazem Hajj, Wassim El-Hajj and Nizar Habash. ArSEL: A Large-scale Arabic Sentiment and Emotion Lexicon. In Proceedings of the Workshop on Free/Open-Source Arabic Corpora and Corpora Processing Tools (OSACT), LREC, Miyazaki, Japan, 2018.
Bouamor, Houda, Nizar Habash, Mohammad Salameh, Wajdi Zaghouani, Owen Rambow, Dana Abdulrahim, Ossama Obeid, Salam Khalifa, Fadhl Eryani, Alexander Erdmann and Kemal Oflazer. The MADAR Arabic Dialect Corpus and Lexicon. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan, 2018.
Zalmout, Nasser, Alexander Erdmann, and Nizar Habash. Noise-Robust Morphological Disambiguation for Dialectal Arabic. In Proceedings of Conference of the North American Association for Computational Linguistics (NAACL), New Orleans, 2018.
More, Amir, Özlem Çetinoğlu, Çağrı Çöltekin, Nizar Habash, Benoît Sagot, Djamé Seddah, Reut Tsarfaty and Dima Taji. CoNLL-UL: Universal Morphological Lattices for Universal Dependency Parsing. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan, 2018.
Erdmann, Alexander, Nasser Zalmout, and Nizar Habash. "Addressing noise in multidialectal word embeddings." In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), vol. 2, pp. 558-565. 2018.
Habash, Nizar, Fadhl Eryani, Salam Khalifa, Owen Rambow, Dana Abdulrahim, Alexander Erdmann, Reem Faraj et al. "Unified guidelines and resources for Arabic dialect orthography." In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018). 2018.
Wrisley, David Joseph, and Hind Saddiki. "Moon:” A Spatial Analysis of the Gumar Corpus of Gulf Arabic Internet Fiction." Digital Humanities 2018: Book of Abstracts/Libro de resúmenes. (2018).
Ali, Dana Abu, Muaz Ahmad, Hayat Al Hassan, Paula Dozsa, Ming Hu, Jose Varias, and Nizar Habash. "A Bilingual Interactive Human Avatar Dialogue System." In Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, pp. 241-244. 2018.
Saddiki, Hind, Nizar Habash, Violetta Cavalli-Sforza, and Muhamed Al Khalil. "Feature Optimization for Predicting Readability of Arabic L1 and L2." In Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications, pp. 20-29. 2018.
Khalifa, Salam, Nizar Habash, Fadhl Eryani, Ossama Obeid, Dana Abdulrahim, and Meera Al Kaabi. "A Morphologically Annotated Corpus of Emirati Arabic." In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018). 2018.
Javed, Talha, Nizar Habash, and Dima Taji. "Palmyra: A Platform Independent Dependency Annotation Tool for Morphologically Rich Languages." In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018). 2018.
Inoue, Go, Nizar Habash, Yuji Matsumoto, and Hiroyuki Aoyama. "A Parallel Corpus of Arabic-Japanese News Articles." In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018). 2018.
Obeid, Ossama, Salam Khalifa, Nizar Habash, Houda Bouamor, Wajdi Zaghouani, and Kemal Oflazer. "MADARi: A Web Interface for Joint Arabic Morphological Annotation and Spelling Correction." arXiv preprint arXiv:1808.08392 (2018).
Watson, Daniel, Nasser Zalmout, and Nizar Habash. "Utilizing Character and Word Embeddings for Text Normalization with Sequence-to-Sequence Models." arXiv preprint arXiv:1809.01534 (2018).
-
Onyibe, Chukwuyem, and Nizar Habash. "OMAM at SemEval-2017 Task 4: English Sentiment Analysis with Conditional Random Fields." In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 670-674. 2017.
Zhang, Lingliang, Nizar Habash, and Godfried Toussaint. "Robust Dictionary Lookup in Multiple Noisy Orthographies." WANLP 2017 (co-located with EACL 2017) (2017): 119.
Baly, Ramy, Gilbert Badaro, Georges El-Khoury, Rawan Moukalled, Rita Aoun, Hazem Hajj, Wassim El-Hajj, Nizar Habash, and Khaled Bashir Shaban. "A Characterization Study of Arabic Twitter Data with a Benchmarking for State-of-the-Art Opinion Mining Models." WANLP 2017 (co-located with EACL 2017) (2017): 110.
Baly, Ramy, Hazem Hajj, Nizar Habash, Khaled Bashir Shaban, and Wassim El-Hajj. "A Sentiment Treebank and Morphologically Enriched Recursive Deep Models for Effective Sentiment Analysis in Arabic." ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 16, no. 4 (2017): 23.
Nizar Habash, Mona Diab, Kareem Darwish, Wassim El-Hajj, Hend Al-Khalifa, Houda Bouamor, Nadi Tomeh, Mahmoud El-Haj. Proceedings of the Third Arabic Natural Language Processing Workshop, 2017.
Zalmout, Nasser and Nizar Habash. Optimizing Tokenization Choice for Machine Translation Across Multiple Target Languages. The Prague Bulletin of Mathematical Linguistics, 108(1), pp. 257-269, 2017.
Zeman, Daniel, Martin Popel, Milan Straka, Jan Hajic, Joakim Nivre, Filip Ginter, Juhani Luotolahti, ...Nizar Habash et al. "CoNLL 2017 shared task: multilingual parsing from raw text to universal dependencies." Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies (2017): 1-19.
Taji, Dima, Nizar Habash, and Daniel Zeman. "Universal Dependencies for Arabic." WANLP 2017 (co-located with EACL 2017) (2017): 166.
Khalifa, Salam, Sara Hassan, and Nizar Habash. "A Morphological Analyzer for Gulf Arabic Verbs." WANLP 2017 (co-located with EACL 2017) (2017): 35.
Habash, Nizar, Nasser Zalmout, Dima Taji, Hieu Hoang and Maverick Alzate. A Parallel Corpus for Evaluating Machine Translation between Arabic and European Languages. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017), Valencia, Spain.
Zalmout, Nasser and Nizar Habash. Don't Throw Those Morphological Analyzers Away Just Yet: Neural Morphological Disambiguation for Arabic. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP 17), Copenhagen, Denmark, 2017
Mustafa Jarrar, Nizar Habash, Faeq Alrimawi, Diyam Akra, Nasser Zalmout. Curras: an annotated corpus for the Palestinian Arabic dialect. Language Resources and Evaluation, Springer, Netherlands, 2017.
Erdmann, Alexander, Nizar Habash, Dima Taji, and Houda Bouamor. Low Resourced Machine Translation via Morpho-syntactic Modeling: The Case of Dialectal Arabic. MT Summit XVI, Nagoya, Japan, 2017.
Al Khalil, Muhamed, Nizar Habash and Hind Saddiki. Simplification of Arabic Masterpieces for Extensive Reading: A Project Overview. ACLing 2017, Dubai, UAE, 2017.
-
Habash, Nizar, Anas Shahrour and Muhamed Al-Khalil. Exploiting Arabic Diacritization for High Quality Automatic Annotation. In Proceedings of the Language Resources and Evaluation Conference (LREC), Portorož, Slovenia, 2016.
Guzmán, Francisco, Houda Bouamor, Ramy Baly, and Nizar Habash. Machine Translation Evaluation for Arabic using Morphologically-Enriched Embeddings. In Proceedings of the COLING, Osaka, Japan, 2016.
Ramy Eskander, Nizar Habash, Owen Rambow, Arfath Pasha. Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine. Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, 2016.
Eskander, Ramy, Nizar Habash, Owen Rambow, and Arfath Pasha. Creating Resources for Dialectal Arabic from a Single Annotation: A Case Study on Egyptian and Levantine. In Proceedings of the COLING, Osaka, Japan, 2016.
Kaplan, Aidan, Faisal Al Shargi, Ramy Eskander, Nizar Habash and Owen Rambow. A Morphologically Annotated Corpus and a Morphological Analyzer for Moroccan and San'ani Yemeni Arabic. In Proceedings of the Language Resources and Evaluation Conference (LREC), Portorož, Slovenia, 2016.
Wajdi Zaghouani, Nizar Habash, Ossama Obeid, Behrang Mohit, Houda Bouamor, Kemal Oflazer. Annotation Guidelines and Framework for Arabic Machine Translation Post-Edited Corpus. Qatar Foundation Annual Research Conference Proceedings, 2016.
Ahmed El Kholy, Nizar Habash. Morphological Constraints for Phrase Pivot Statistical Machine Translation. arXiv preprint arXiv:1609.03376, 2016.
Hassan Sajjad, Nadir Durrani, Francisco Guzman, Preslav Nakov, Ahmed Abdelali, Stephan Vogel, Wael Salloum, Ahmed El Kholy, Nizar Habash. Egyptian Arabic to English Statistical Machine Translation System for NIST OpenMT'2015. arXiv preprint arXiv:1606.05759, 2016.
Amjad Almahairi, Kyunghyun Cho, Nizar Habash, Aaron Courville. First result on Arabic neural machine translation. arXiv preprint arXiv:1606.02680, 2016.
Al Zaatari, Ayman, Reem El Ballouli, Shady ELbassouni, Wassim El-Hajj, Hazem Hajj, Khaled Shaban and Nizar Habash. Arabic Corpora for Credibility Analysis. In Proceedings of the Language Resources and Evaluation Conference (LREC), Portorož, Slovenia, 2016.
Mohamed Al-Badrashiny, Arfath Pasha, Mona T Diab, Nizar Habash, Owen Rambow, Wael Salloum, Ramy Eskander. SPLIT: Smart Preprocessing (Quasi) Language Independent Tool. LREC, 2016
Nizar Habash, Mustafa Jarrar, Faeq Alrimawi, Diyam Fuad Akra, Nasser Zalmout, Eric Bartolotti, Mahdi Arar. Palestinian Arabic conventional orthography guidelines. Technical Report, 2016.
Ali, Dana Abu, and Nizar Habash. Botta: An Arabic Dialect Chatbot. In Proceedings of the COLING, Osaka, Japan, 2016.
Khalifa, Salam, Nasser Zalmout, and Nizar Habash. YAMAMA: Yet Another Multi-Dialect Arabic Morphological Analyzer. In Proceedings of the COLING, Osaka, Japan, 2016.
Khalifa, Salam, Houda Bouamor and Nizar Habash. DALILA: The Dialectal Arabic Linguistic Learning Assistant. In Proceedings of the Language Resources and Evaluation Conference (LREC), Portorož, Slovenia, 2016.
Irina P Temnikova, Wajdi Zaghouani, Stephan Vogel, Nizar Habash. Applying the Cognitive Machine Translation Evaluation Approach to Arabic. LREC, 2016.
Khalifa, Salam, Nizar Habash and Dana Abdulrahim. A Large Scale Corpus of Gulf Arabic. In Proceedings of the Language Resources and Evaluation Conference (LREC), Portorož, Slovenia, 2016.
Oard, Douglas W. , Jerome White, Rashmi Sankepally, Craig Harman. Vapor Engine: Demonstrating an Early Prototype of a Language-Independent Search Engine for Speech. In Proceedings of CHIIR, Chapel Hill, North Carolina, USA, 2016
Shahrour, Anas, Salam Khalifa, Dima Taji, and Nizar Habash. CamelParser: A System for Arabic Syntactic Analysis and Morphological Disambiguation. In Proceedings of the COLING, Osaka, Japan, 2016.
Temnikova, Irina, Wajdi Zaghouani, Stephan Vogel and Nizar Habash. Adapting the Cognitive MT Evaluation Approach to Arabic. In Proceedings of the Language Resources and Evaluation Conference (LREC), Portorož, Slovenia, 2016.
Zaghouani, Wajdi, Nizar Habash, Ossama Obeid, Behrang Mohit and Kemal Oflazer. Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation. In Proceedings of the Language Resources and Evaluation Conference (LREC), Portorož, Slovenia, 2016.
Zalmout, Nasser, Hind Saddiki, and Nizar Habash. Analysis of Foreign Language Teaching Methods: An Automatic Readability Approach. NLPTEA 2016, Osaka, Japan, 2016.
Dima Taji, Ramy Eskander, Nizar Habash, Owen Rambow. The Columbia University-New York University Abu Dhabi Sigmorphon 2016 Morphological Reinflection Shared Task Submission. Proceedings of the 14th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, 2016.
-
Farra, Nourra, Kathleen McKeown, Nizar Habash. "Annotating Targets of Opinions in Arabic using Crowdsourcing." In Proceedings of the Second Arabic Natural Language Processing Workshop, ACL, Beijing, 2015.
Hassan Sajjad, Nadir Durrani, Francisco Guzman, Preslav Nakov, Ahmed Abdelali, Stephan Vogel, Wael Salloum, Ahmed El Kholy, Nizar Habash. "The QCN Egyptian Arabic to English Statistical Machine Translation System" NIST OpenMT’2015," 2015.
Hamdi, Ahmed, Alexis Nasr, Nizar Habash, and Núria Gala. POS-tagging of Tunisian Dialect Using Standard Arabic Resources and Tools. In Proceedings of the Second Arabic Natural Language Processing Workshop, ACL, Beijing, 2015.
Jerome White, Douglas W. Oard, Jiaul Paik, Rashmi Sankepally, Aren Jansen. Using Zero-Resource Spoken Term Discovery for Ranked Retrieval. In Proceedings of NAACL, Denver, Colorado, USA, 2015.
Jermsurawong, Jermsak and Nizar Habash. Predicting the structure of Cooking Recipes. In Proceedings of EMNLP, Lisbon, 2015.
Oard, Douglas W. , Rashmi Sankepally, Jerome White, Aren Jansen, Craig Harman. A Test Collection for Spoken Gujarati Queries. In Proceedings of SIGIR, Santiago, Chile, 2015.
Masmoudi, Abir, Nizar Habash, Mariem Ellouze, Yannick Estève, Lamia Hadrich Belguith. Arabic Transliteration of Romanized Tunisian Dialect Text: A Preliminary Investigation. In Computational Linguistics and Intelligent Text Processing. Lecture Notes in Computer Science Volume 9041, 2015, pp 608-619.
Arfath Pasha, Mohammad Al-Badrashiny, Mona Diab, Nizar Habash, Manoj Pooleery, Owen Rambow, Ryan Roth. "Madamira 2.1." Center for Computational Learning Systems, Columbia University, 2015.
Rozovskaya, Alla, Houda Bouamor, Nizar Habash, Wajdi Zaghouani, Ossama Obeid, Behrang Mohit. The Second QALB Shared Task on Automatic Text Correction for Arabic. In Proceedings of the Second Arabic Natural Language Processing Workshop, ACL, Beijing, 2015.
Saadane, Houda, and Nizar Habash. A Conventional Orthography for Algerian Arabic. In Proceedings of the Second Arabic Natural Language Processing Workshop, ACL, Beijing, 2015.
Sajjad, Hassan, Nadir Durrani, Francisco Guzman, Preslav Nakov, Ahmed Abdelali, Stephan Vogel, Wael Salloum, Ahmed El Kholy, Nizar Habash. QCN System Description for NIST OpenMT15. In Proceedings of the Open Machine Translation Evaluation Workshop, Washington DC, 2015.
Shahrour, Anas, Salam Khalifa and Nizar Habash. Improving Arabic Diacritization through Syntactic Analysis. In Proceedings of EMNLP, Lisbon, 2015.
Zaghouani, Wajdi, Nizar Habash, Houda Bouamor, Alla Rozovskaya, Behrang Mohit, Abeer Heider and Kemal Oflazer. Correction Annotation for Non-Native Arabic Texts: Guidelines and Corpus. In Proceedings of The 9th Linguistic Annotation Workshop, NAACL, pp. 129-139. 2015.
-
Arts, Tressy, Yonatan Belinkov, Nizar Habash, Adam Kilgarriff, and Vit Suchomel. arTenTen: Arabic Corpus and Word Sketches. Journal of King Saud University - Computer and Information Sciences. Volume 26, Issue 4, December 2014, Pages 357–371.
Reem El-ballouli, M. sc, Wassim El Hajj, Shady Elbassuoni, Hazem Hajj, Nizar Habash, Khaled Bashir Shaban. Credibility Models For Arabic Content On Twitter. Qatar Foundation Annual Research Conference, 2014.
Gilbert Badaro, Be, Ramy Baly, Hazem Hajj, Nizar Habash, Wassim El-hajj, Khaled Shaban. An Efficient Model for Sentiment Classification of Arabic Tweets on Mobiles. Qatar Foundation Annual Research Conference, 2014.
Stephen Helmreich, Bonnie Dorr, Nizar Habash, Florence Reeder, Keith Miller, Lori Levin, Teruko Mitamura, Eduard Hovy, Owen Rambow, Advaith Siddharthan. "David Farwell." Routledge Encyclopedia of Translation Technology, 2014.
Arfath Pasha, Mohamed Al-Badrashiny, Mona T Diab, Ahmed El Kholy, Ramy Eskander, Nizar Habash, Manoj Pooleery, Owen Rambow, Ryan Roth. "MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic." LREC, 2014.
Wajdi Zaghouani, Ma, Nizar Habash, Behrang Mohit, Abeer Heider, Alla Rozovskaya, Kemal Oflazer. Annotation Guidelines For Non-native Arabic Text In The Qatar Arabic Language Bank. Qatar Foundation Annual Research Conference, 2014.
Ramy Georges Baly, Me, Gilbert Badaro, Hazem Hajj, Nizar Habash, Wassim El Hajj, Khaled Shaban. Semantic Model Representation For Human's Pre-conceived Notions In Arabic Text With Applications To Sentiment Mining. Qatar Foundation Annual Research Conference, 2014.
Nizar Habash. Computational Processing of Arabic Dialects (invited talk). LT4CloseLang, 2014.
M Al-Badrashiny, M Diab, N Habash, M Pooleery, O Rambow, R Roth. "MADAMIRA v1. 0 User Guide." Center for Computational Learning Systems, Columbia University, 2014.
Nizar Habash. "Machine Translation for Arabic." Winter School on Arabic Language Processing, Princess Sumaya University for Technology, January 27-29, 2014.
Ahmed El Kholy, Nizar Habash. "Alignment Symmetrization Optimization Targeting Phrase Pivot Statistical Machine Translation." Center for Computational Learning Systems, Columbia University, 2014.
Mona Diab, Nizar Habash. Natural Language Processing of Arabic and its Dialects. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Doha, Qatar, 2015.
Badaro, Gilbert, Ramy Baly, Hazem Hajj, Nizar Habash and Wassim El-Hajj. A Large Scale Arabic Sentiment Lexicon for Arabic Opinion Mining. In Proceedings of the Arabic Natural Language Processing Workshop, EMNLP, Doha, 2014.
Bies, Ann, Zhiyi Song, Mohamed Maamouri, Stephen Grimes, Haejoong Lee, Jonathan Wright, Stephanie Strassel, Nizar Habash, Ramy Eskander, and Owen Rambow. Transliteration of Arabizi into Arabic Orthography: Developing a Parallel Annotated Arabizi-Arabic Script SMS/Chat Corpus. In Proceedings of the Arabic Natural Language Processing Workshop, EMNLP, Doha, 2014.
Eskander, Ramy, Mohamed Al-Badrashiny, Nizar Habash and Owen Rambow. Foreign Words and the Automatic Processing of Arabic Social Media Text Written in Roman Script. In Proceedings of the First Workshop on Computational Approaches to Code Switching, EMNLP, Doha, 2014.
Nizar Habash. INVITED TALK 1: Computational Processing of Arabic Dialects. Proceedings of the EMNLP'2014 Workshop on Language Technology for Closely Related Languages and Language Variants, 2014.
Jarrar, Mustafa, Nizar Habash, Diyam Akra and Nasser Zalmout. Building a Corpus for Palestinian Arabic: a Preliminary Study. In Proceedings of the Arabic Natural Language Processing Workshop, EMNLP, Doha, 2014.
Jeblee, Serena, Weston Feely, Houda Bouamor, Alon Lavie, Nizar Habash and Kemal Oflazer.Domain and Dialect Adaptation for Machine Translation into Egyptian Arabic. In Proceedings of the Arabic Natural Language Processing Workshop, EMNLP, Doha, 2014.
Mohit , Behrang, Alla Rozovskaya, Nizar Habash, Wajdi Zaghouani and Ossama Obeid. The First shared Task on Automatic Text Correction for Arabic. In Proceedings of the Arabic Natural Language Processing Workshop, EMNLP, Doha, 2014.
Rozovskaya, Alla, Nizar Habash, Ramy Eskander, Noura Farra and Wael Salloum. The Columbia System in the QALB-2014 Shared Task on Arabic Error Correction. In Proceedings of the Arabic Natural Language Processing Workshop, EMNLP, Doha, 2014.
Behrang Mohit, Alla Rozovskaya, Nizar Habash, Wajdi Zaghouani, Ossama Obeid. The First QALB Shared Task on Automatic Text Correction for Arabic. Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP), 2014.
Xiaodong Cui, Brian Kingsbury, Jia Cui, Bhuvana Ramabhadran, Andrew Rosenberg, Mohammad Sadegh Rasooli, Owen Rambow, Nizar Habash, Vaibhava Goel. "Improving deep neural network acoustic modeling for audio corpus indexing under the IARPA Babel program." Fifteenth Annual Conference of the International Speech Communication Association, 2014.
Wael Salloum, Heba Elfardy, Linda Alamir-Salloum, Nizar Habash, Mona Diab. "Sentence level dialect identification for machine translation system selection." Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2014.
Noura Farra, Nadi Tomeh, Alla Rozovskaya, Nizar Habash. "Generalized character-level spelling error correction." Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2014.
Mohamed Al-Badrashiny, Ramy Eskander, Nizar Habash, Owen Rambow. "Automatic transliteration of romanized dialectal Arabic." Proceedings of the Eighteenth Conference on Computational Natural Language Learning, 2014.
Houda Bouamor, Nizar Habash, Kemal Oflazer. "A Multidialectal Parallel Corpus of Arabic." Center for Computational Learning Systems, Columbia University, 2014.
Mohamed Maamouri, Ann Bies, Seth Kulick, Michael Ciul, Nizar Habash, Ramy Eskander. "Developing an Egyptian Arabic Treebank: Impact of Dialectal Morphology on Annotation and Tool Development." Linguistic Data Consortium University of Pennsylvania; Center for Computational Learning Systems, Columbia University, 2014.
Alla Rozovskaya, Kai-Wei Chang, Mark Sammons, Dan Roth, Nizar Habash. "The Illinois-Columbia system in the CoNLL-2014." Proceedings of the Eighteenth Conference on Computational Natural Language Learning: Shared Task, 2014.
Mona Diab, Mohamed Al-Badrashiny, Maryam Aminian, Mohammed Attia, Pradeep Dasigi, Heba Elfardy, Ramy Eskander, Nizar Habash, Abdelati Hawwari, Wael Salloum. "Tharwa: A Large Scale Dialectal Arabic-Standard Arabic-English Lexico." Department of Computer Science, The George Washington University; Center for Computational Learning Systems, Columbia University, 2014.
Abir Masmoudi, Mariem Ellouze Khemakhem, Yannick Estève, Lamia Hadrich Belguith, Nizar Habash. "A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition." ANLP Research group, MIRACL Lab., University of Sfax, Tunisia; LIUM, University of Maine, France; Center for Computational Learning Systems, Columbia University, 2014.
Inès Zribi, Rahma Boujelbane, Abir Masmoudi, Mariem Ellouze, Lamia Hadrich Belguith, Nizar Habash. "A Conventional Orthography for Tunisian Arabic." LREC, 2014.
Salloum, Wael and Nizar Habash. ADAM: Analyzer for Dialectal Arabic Morphology. Journal of King Saud University - Computer and Information Sciences. Volume 26, Issue 4, December 2014, Pages 372–378.
Wajdi Zaghouani, Behrang Mohit, Nizar Habash, Ossama Obeid, Nadi Tomeh, Alla Rozovskaya, Noura Farra, Sarah Alkuhlani, Kemal Oflazer. "Large Scale Arabic Error Annotation: Guidelines and Framework." LREC, 2014.
Ray Fabri, Michael Gasser, Nizar Habash, George Kiraz, Shuly Wintner. "Linguistic introduction: The orthography, morphology and syntax of Semitic languages." Natural Language Processing of Semitic Languages, pp. 3-41, 2014.
Mohammad Sadegh Rasooli, Thomas Lippincott, Nizar Habash, Owen Rambow. "Unsupervised morphology-based vocabulary expansion." Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2014.
Tomeh, Nadi, Nizar Habash, Ramy Eskander and Joseph Le Roux. A Pipeline Approach to Supervised Automatic Error Correction. In Proceedings of the Arabic Natural Language Processing Workshop, EMNLP, Doha, 2014.
Jennifer Sikos, Peter David, Nizar Habash, Reem Faraj. "Authorship Analysis of Inspire Magazine through Stylometric and Psychological Features." Intelligence and Security Informatics Conference (JISIC), 2014 IEEE Joint, 2014.