A 2025 paper from the Stanford Institute for Human-Centered Artificial Intelligence found that many popular large language models do not work well in languages other than English. The online world is dominated by English, and many developers rely on English-language data. This leaves many non-English speakers with weaker AI support.
Practical problems appear when AI helps with daily tasks. For example, asking ChatGPT to write an email in Tamil can give a muddled draft in English. Efforts to add more multilingual data sometimes make matters worse because scraped web texts can include machine-translation mistakes. Experts say companies should work with local communities, check multilingual data, and partner with grassroots developers to make AI more useful for everyone.
Difficult words
- dominate — Be stronger or control a place or areadominated
- rely — Need something or depend on it
- multilingual — Using or including many different languages
- scrape — Collect text or data from websites automaticallyscraped
- muddle — Be unclear or mixed up in meaningmuddled
- grassroots — Local people or small community groups
Tip: hover, focus or tap highlighted words in the article to see quick definitions while you read or listen.
Discussion questions
- Have you used AI tools in your own language? What problem did you see?
- How could local communities help improve AI for their language?
Related articles
Sudan turns to AI as health system struggles
Sudan’s health system is under severe strain after an almost two‑year civil war. A senior health official says the country is using artificial intelligence to help provide care where normal services no longer reach, while shortages and attacks worsen the crisis.
Indonesia tightens rules for digital platforms
Indonesia is increasing regulation of global digital platforms to curb misinformation and protect public safety. Officials inspected a major company's office, require platform registration, and use takedown systems, which has drawn criticism over unclear rules and rights.