Marcos Zampieri

Assistant Professor
School of Computing
George Mason University
Fairfax, VA, USA

email   linkedin  Google Scholar
Headshot

About

I am an Assistant Professor at the School of Computing at George Mason University.

My research interests are in computational linguistics and natural language processing. My research aims to enhance our understanding of human language and communication while, in turn, developing more robust and safer NLP systems in various domains such as education, engineering, and healthcare.

Below is a list of recent selected publications. For a full list of publications please check Google Scholar.


Recent Selected Publications

TigerLLM - A Family of Bangla Large Language Models
Nishat Raihan, Marcos Zampieri
ACL (2025) pdf

Tracing L1 Interference in English Learner Writing: A Longitudinal Corpus with Error Annotations
Poorvi Acharya, J. Elizabeth Liebl, Dhiman Goswami, Kai North, Marcos Zampieri, Antonios Anastasopoulos
EMNLP (2025) pdf

mHumanEval - A Multilingual Benchmark to Evaluate Large Language Models for Code Generation
Nishat Raihan, Antonios Anastasopoulos, Marcos Zampieri
NAACL (2025) pdf

Bayelemabaga: Creating Resources for Bambara NLP
Allahsera Auguste Tapo, Kevin Assogba, Christopher M Homan, M. Mustafa Rafique, Marcos Zampieri
NAACL (2025) pdf

Large Language Models in Computer Science Education: A Systematic Literature Review
Nishat Raihan, Mohammed Latif Siddiq, Joanna CS Santos, Marcos Zampieri
SIGCSE (2025) pdf

Annotator Reliability Through In-Context Learning
Sujan Dutta, Deepak Pandita, Tharindu Weerasooriya, Marcos Zampieri, Christopher Homan, Ashiqur KhudaBukhsh
AAAI (2025) pdf

A Survey of Multimodal Sarcasm Detection
Shafkat Farabi, Tharindu Ranasinghe, Diptesh Kanojia, Yu Kong, Marcos Zampieri
IJCAI (2024) pdf

Language Variety Identification with True Labels
Marcos Zampieri, Kai North, Tommi Jauhiainen, Mariano Felice, Neha Kumari, Nishant Nair, Yash Bangera
LREC-COLING (2024) pdf

Native Language Identification in Texts: A Survey
Dhiman Goswami, Sharanya Thilagan, Kai North, Shervin Malmasi, Marcos Zampieri
NAACL (2024) pdf

Features of Lexical Complexity: Insights from L1 and L2 Speakers
Kai North, Marcos Zampieri
Frontiers in Artificial Intelligence (2023) url

Lexical Complexity Prediction: An Overview
Kai North, Matthew Shardlow, Marcos Zampieri
ACM Computing Surveys (2023) url

ALEXSIS-PT: A New Resource for Portuguese Lexical Simplification
Kai North, Marcos Zampieri, Tharindu Ranasinghe
COLING (2022) pdf

Handling Extreme Class Imbalance in Technical Logbook Datasets
Farhad Akhbardeh, Cecilia O. Alm, Marcos Zampieri, Travis Desell
ACL (2021) pdf


Books

Automatic Language Identification in Texts

Automatic Language Identification in Texts

Tommi Jauhiainen, Marcos Zampieri, Timothy Baldwin, Krister Lindén
Synthetisis Lectures on Human Language Technologies
Springer (2024)


Similar Languages, Varieties, and Dialects

Similar Languages, Varieties, and Dialects: A Computational Perspective

Marcos Zampieri, Preslav Nakov (Editors)
Studies in Natural Language Processing
Cambridge University Press (2021)


Last Updated: December 2025 | Template: Plain Academic