My primary research interests lie in Explainable Artificial Intelligence and Natural Language Processing, focusing on Harmful Speech and Language Model Interpretability. I develop methods to enhance the intrinsic interpretability and reasoning capabilities of language models, ensuring faithful explanations and alignment with human values in high-stakes domains. I hold a Ph.D. and an M.Sc. in Natural Language Processing, both in Computer Science and Computational Mathematics from the University of São Paulo. I have an interdisciplinary academic background in computer science, linguistics, and computational mathematics. During my doctoral studies, I was a visiting researcher at the University of Southern California, and an invited speaker at the Leibniz Institute for the Social Sciences. My work has received national and international recognition, including the Google Latin America Research Award, the Maria Carolina Monard Award for the Best Thesis in Artificial Intelligence, the PROPOR Best Thesis Award in Natural Language Processing, and the Trevisan Prize for Students “AI for Good,” awarded by the Computer Science Department of Bocconi University and EDGE. I actively contribute to the international artificial intelligence and natural language processing research community, serving as a program committee member and area chair for venues such as ACL, AAAI, IJCNN, and ACM. I have also co-organized the International AAAI Conference on Web and Social Media, Workshop on Online Abuse and Harms, and Explainable Deep Neural Networks for Responsible AI, as well as shared tasks on hate speech detection [i] [ii]. With contributions to addressing hate speech and misinformation in underrepresented communities in the Global South, my research advances the interpretability and alignment of Large Language Models (LLMs) in real-world scenarios that promote accountability, fairness, inclusion, and the trustworthy deployment of NLP systems.
Research Projects
-
Digital Trust Council Standards & Trustworthy Artificial Intelligence Research Project
2026: Digital Trust Council -
Robust Augmented Retrieval for Natural Language Inference over Transformer-based Models
2025: São Paulo State University & Idiap Research Institute -
Building Benchmarks for Hate Speech Detection with Moral Rationales
2024: University of Southern California -
Responsible and Explainable Fact-Checking through Fine-Grained Factual Reasoning
2024: Google LARA -
Benchmarking Hate Speech Detection in Hausa Indigenous African Language
2023: University of São Paulo -
Expanding Evaluation Data for Multilingual Protest News Detection
2022: Koç University -
Socially Responsible and Explainable Hate Speech Detection in Brazilian Portuguese
2020: University of São Paulo
Selected Awards & Fellowships
-
Best Thesis Award in Natural Language Processing
2026: PROPOR
-
International Trevisan Prize for Students “AI for Good”
2025: Bocconi University and EDGE -
Maria Carolina Monard Best Thesis Award in Artificial Intelligence
2025: University of São Paulo -
Nominated for the Thesis Award in Computer Science
2025: Brazilian Computer Society -
Nominated for the Thesis Award in Multimedia, Hypermedia and Web
2025: Brazilian Computer Society -
Postdoctoral Research Fellowship
2025: São Paulo Research Foundation -
Google Latin America Research Award
2024: Google LARA PhD Fellowship