AI moderation misses most African languagesCEFR B1
20 Apr 2026
Adapted from Guest Contributor, Global Voices • CC BY 3.0
Photo by Zulfugar Karimov, Unsplash
Moderators and researchers describe a gap between the languages people use and the languages AI moderation tools can process. Bereket Tsegay, who worked at TikTok’s Kenya hub, said he often watched videos he did not understand because they were in Luo, Dholuo, Kikuyu and Dinka.
A 2025 study found that only 42 African languages appear meaningfully in major language models, and just four are handled with any consistency: Amharic, Swahili, Afrikaans and Malagasy. That leaves more than 98 percent of Africa’s languages largely invisible to moderation systems. The platforms’ dependence on English-language data causes both false positives and false negatives.
Concrete cases underline the problem. A Kenyan creator had an account removed in February 2025 and later restored. Between January and March 2025 TikTok removed more than 450,000 videos from Kenya and banned over 43,000 accounts; by Q2 removals had climbed to 592,000. In Ethiopia false claims about troops seizing Eritrea’s Red Sea port spread on Facebook before fact-checkers debunked them.
Experts warn the burden falls most heavily on creators and journalists who use local languages. Research groups such as AfricaNLP and academic teams in Pretoria, Nairobi and Addis Ababa are building datasets, and partnerships like Cohere with HausaNLP add data to models. Governments and regulators are also creating strategies to address the gap.
Difficult words
- moderator — person who checks and controls online contentModerators
- language model — computer program that processes and generates textlanguage models
- consistency — regular, reliable way something works over time
- false positive — incorrect detection that something is a problemfalse positives
- false negative — failure to detect real problem or contentfalse negatives
- dataset — organized collections of data for training modelsdatasets
- creator — person who makes videos or other content
Tip: hover, focus or tap highlighted words in the article to see quick definitions while you read or listen.
Discussion questions
- How could social media platforms improve moderation for local African languages?
- What problems can happen when moderators do not understand the language in a video?
- Do you know an example where content was removed and later restored? How did that affect the creator?
Related articles
When Basic Needs Fail: How Local Shocks Become Wider Emergencies
From close experience in Turkey, the author shows how shortages in food, health and services turn into medical and social crises. Simple support like school meals, cash and cross‑city cooperation can reduce risks and costs.
Data centers and mining could raise US power costs by 2030
A new study warns that growing electricity use by data centers and cryptocurrency mining could raise power prices and CO2 emissions in the United States by 2030. Researchers modelled hourly power needs across 26 regions to find likely impacts.
People with AMD Judge Car Arrival Times Like Others
A virtual reality study compared adults with age-related macular degeneration (AMD) and adults with normal vision. Both groups judged vehicle arrival times similarly; vision and sound were used together, and a multimodal benefit did not appear.
EU AI rules do not cover exports to West Asia and North Africa
Research by 7amleh finds that European AI rules and funding often leave the EU without binding safeguards. Money, technology and products reach governments and militaries across West Asia and North Africa with limited human rights accountability.