Mona Diab

Mona Diab

SEH 5800 | Office Hours: On sabbatical, Academic Year 2017-2018
Phone: 202-994-8109

Areas of Expertise

  • Artificial intelligence
  • Computer science
  • Search and data mining

Professor Diab conducts research in Statistical Natural Language Processing (NLP) is a rapidly growing, exciting field of research in artificial intelligence and computer science. Interdisciplinarity is inherent to NLP, drawing on the fields of computer algorithms, software engineering, statistics, machine learning, linguistics, pragmatics, information technology, etc. In NLP, we model language and its use. We build both analytical models and predictive ones. In Professor Mona Diab's NLP lab, we address problems in social media processing, building robust enabling technologies such as syntactic and semantic processing tools for written texts in different languages, information extraction tools for large data, multilingual processing, machine translation, and computational sociolinguistic processing. Professor Diab has a special interest in Arabic NLP, where the emphasis has been on investigating Arabic dialect processing where there are very few available automated resources.


  • Ph.D, University of Maryland, 2003
  • M.S., The George Washington University, 1997
  • B.S., The American University in Cairo, 1992
  • B.S., Helwan University, 1990


  • Abdul-Mageed, M., Diab, M.,& Kuebler, S. (To appear, 2013). SAMAR: A System for Subjectivity and Sentiment Analysis for Arabic Social Media. Computer Speech and Language.
  • BenAjiba, Yassine, Mona Diab, Paolo Rosso. (2009) Arabic Named Entity Recognition: A Feature-driven Study. In the special issue on Processing Morphologically Rich Languages In IEEE Transactions on Audio, Speech and Language Processing. ‘17(5), 926-934, Jul ‘09.
  • Diab, Mona and Yuval Marton. (2013) Semitic Semantics. Book Chapter in Natural Language Processing for Semitic Languages, Editor Imed Zitouni, Springer Publishers, forthcoming.
  • Abu Jabara, Amjad, Pradeep Dasigi, Mona Diab, Dragomir Radev. (2012) Subgroup Detection in Ideological Discussions.Proceedings of ACL 2012, July, Jeju Island, Korea.
  • Agirre, Eneko; Daniel Cer; Mona Diab; Aitor Gonzalez-Agirre. (2012) SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity. In Proceedings of SEMEVAL Workshop, held in conjunction with *SEM and NAACL 2012, Montreal, Canada, June 2012.
  • Dasigi, P, Guo, W. & Diab, M. (2012). Cross-genre subgroup detection in online discussion threads: A study of the implicit attitude using textual latent semantics. InProceedings of ACL 2012, July, Jeju Island, Korea.
  • Elfardy, Heba and Mona Diab (2012). Token Level Identification of Linguistic Code Switching. Proceedings of COLING, Mumbai, India, December 2012.
  • Guo, Weiwei and Mona Diab. (2012) Modeling Semantic Textual Similarity in the Latent Space. Proceedings of ACL 2012, Jeju Island, Korea, July 2012
  • Guo, W. & Diab, M. (2012). A simple unsupervised latent semantics based approach for sentence similarity. In Proceedings of the 6th International Workshop on Semantic Evaluation (SEMEVAL 2012), in conjunction with *SEM and NAACL 2012, Montreal, Canada, June 2012.
  • Hawwari, Abdelati; Kfir Bar; Mona Diab (2012). Building an Arabic Multiword Expressions Repository. Proceedings of the NAACL-HLT 2012 Workshop on Computational Linguistics for Literature,Montreal, Canada, June 2012.
  • Prabhakaran, Vinodkumar, Owen Rambow, Mona Diab (2012) Who’s (Really) the Boss? Perception of Situational Power in Written Interactions. Proceedings of COLING 2012, Mumbai, India, December 2012.
  • Prabhakaran, Vinodkumar, Owen Rambow, Mona Diab. (2012) Predicting Overt Display of Power in Written Dialogs. Proceedings of NAACL 2012, Short Papers, Montreal, Canada, June 2012.
  • Prabhakaran, Vinodkumar, Michael Bloodgood, Mona Diab, Bonnie J. Dorr, Lori Levin, Christine Piatko, Owen Rambow and Benjamin Van Durme. (2012) Statistical Modality Tagging from Rule-based Annotations and Crowdsourcing. In Proceedings of ACL Workshop on Extra-propositional aspects of meaning in computational linguistics (ExProM). Jeju, South Korea, July, 2012.
  • Zaghouani, Wajdi, Abdelati Hawwari and Mona Diab. (2012) A Pilot PropBank Annotation for Quranic Arabic. In Proceedings of the computational linguistics for literature workshop, NAACL 2012, Montreal, Canada, June 2012.
  • Dasigi, Pradeep and Mona Diab. (2011) Noisy Channel Modeling for Named Entity Transliteration. Proceedings of NEWS 2011 Workshop, IJCNLP 2011, November, Chiang Mai, Thailand.