Building Hate Speech Data Resources for the Hausa African Indigenous Language

In African countries, the hate speech phenomenon is especially serious due to a historical problem regarding ethnic conflicts. Specifically, the Western region still lacks more research on hate speech focusing on its indigenous languages. Moreover, as most of the existing hate speech data resources are developed for the English language, the research and development of hate speech technologies for African indigenous languages are less developed. Finally, in African countries, there is a constant concern related to language policies in order to recommend the adoption of indigenous languages (e.g. Hausa, Yoruba, Igbo, etc.) as national lingua franca towards obtaining emancipation from colonial legacy. In Nigeria, this would mean the promotion of Hausa over English, hence highlighting the importance of developing specific NLP data resources, methods and tools for the Hausa language. To fill this relevant gap, this research project aims to investigate and build data resources for Hausa African Idigenous language.

Leader
Team

Publications

Resources
Dataset
  • HausaHate: A Benchmark Dataset for Hausa Hate Speech Detection.

Sponsorship