LingVo.club
📖+30 XP
🎧+20 XP
+35 XP
AI leaves many non-English speakers behind — Level B1 — a wooden table topped with scrabble tiles that spell out languages

AI leaves many non-English speakers behindCEFR B1

8 Apr 2026

Adapted from Aaron Spitler, Global Voices CC BY 3.0

Photo by Ling App, Unsplash

Level B1 – Intermediate
4 min
219 words

A 2025 paper from the Stanford Institute for Human-Centered Artificial Intelligence found that many popular LLMs perform poorly in languages other than English. Researchers warned that public models, including some developed in part by Google and Meta, can produce responses that do not meet the needs of the global majority.

The concentration of AI firms and data in wealthier areas such as Silicon Valley has widened the divide. News outlets reported that millions who speak languages like Kurdish and Swahili are effectively deprioritized, and users who ask for help in other languages often receive unhelpful or error-filled outputs.

Practical problems have appeared in everyday tasks. Wired explained that asking an LLM such as ChatGPT to write an email in Tamil may yield a muddled draft in English. The MIT Technology Review found that many low-resource language texts scraped from the web contain machine-translation mistakes, and well-meaning contributors often lack the skills to check accuracy. Faulty content can then become training data and reinforce errors.

Observers also note cultural effects: AI outputs tend to reflect the norms and values of English speakers in well-resourced countries, which can make non-English perspectives invisible. Experts recommend working with sidelined communities, including local input, reviewing outputs for accuracy and authenticity, and forming partnerships that respect cultural differences.

Difficult words

  • concentrationthe gathering of people or data together
  • deprioritizeto treat something as less important
    deprioritized
  • muddleto make something unclear or confused
    muddled
  • scrapeto collect information from websites automatically
    scraped
  • accuracyhow correct or exact information is
  • reinforceto make an idea or problem stronger
  • normusual rules or expected behavior in society
    norms
  • authenticitythe quality of being real and true

Tip: hover, focus or tap highlighted words in the article to see quick definitions while you read or listen.

Discussion questions

  • Which languages or communities near you might be deprioritized by current AI models, and why?
  • How could companies and researchers include local input and respect cultural differences when they build AI systems?
  • How can faulty web texts become part of training data and then reinforce errors in future AI outputs?

Related articles