I obtained a Ph.D. and M.Sc. in Natural Language Processing, both earned in Computer Science and Computational Mathematics from the University of São Paulo (USP). I also hold a BS in Information Systems and a BA in Linguistics. During my Ph.D., I was a visiting researcher at the University of Southern California (USC) in the USA and an invited researcher and speaker at the Leibniz Institute for the Social Sciences (GESIS) in Germany. I also received the prestigious Google Latin America Research Award (LARA) fellowship, and my thesis was recognized as one of the best in the country among all defended that year. In addition, I have actively contributed to ACL and AAAI conferences and workshops, serving as a program committee member and co-organizing ICWSM , WOAH, and DeepXplain, along with shared tasks on hate speech. My research focuses on enhancing the interpretability of Language Models (LM), while ensuring their safety, factuality, and fairness. I have been actively contributing to tasks related to hate speech and misinformation, and my current work is dedicated to developing methods and evaluation benchmarks to improve transparency and accountability in AI systems.