New training method helps models do long multiplication — English Level B1

A team led by Xiaoyan Bai and Chenhao Tan at the University of Chicago, with collaborators from MIT, Harvard, the University of Waterloo and Google DeepMind, studied why state-of-the-art language models fail at long multiplication. They focused on long-range dependencies: the need to hold partial products and running sums to reach a correct final answer.

Under standard fine-tuning, models with two to 12 layers achieved less than 1% accuracy on four-digit multiplication; the researchers concluded these models fell into a local optimum by learning surface patterns rather than storing intermediate values. In contrast, a model trained with Implicit Chain of Thought (ICoT) reached 100% accuracy. Probing the ICoT model showed that its hidden states encoded intermediate values and that running sums could be decoded.

The team also tested a simple training objective that teaches a model to track running sums at each step. Adding that objective to a two-layer model raised accuracy to 99% and produced attention patterns similar to ICoT. The study argues that architectural guidance and targeted objectives can enable multi-step reasoning.

Difficult words

long-range dependency — need to keep information across many steps

long-range dependencies

partial product — a number from one multiplication step

partial products

running sum — a total that updates after each step

running sums

fine-tuning — training a model on new task data

local optimum — a solution that is not best overall

implicit chain of thought — training method that encourages stepwise reasoning

Implicit Chain of Thought (ICoT)

Tip: hover, focus or tap highlighted words in the article to see quick definitions while you read or listen.

Discussion questions

Why is it helpful for a model to store intermediate values when doing long multiplication?

Do you think the same training objective (tracking running sums) could help models in other multi-step tasks? Why or why not?

Which is more important for multi-step reasoning: model architecture or specific training objectives? Explain with simple reasons.

21 Jan 2026

AI helps detect melanoma from skin images

Researchers at the University of Missouri tested artificial intelligence to help detect melanoma from images of skin. They trained models on many pictures and found combined models improved accuracy, aiming to support faster care.

Level

Read

8 Oct 2024

Dementia rising in Africa as researchers seek answers

Dementia is increasing in Africa as populations age. Research and evidence in the region are limited, so scientists study genetics, new detection tools and community measures while working with traditional healers to reduce stigma.

Level

Read

24 Nov 2025

Why some gas-rich volcanoes erupt gently

New research shows gas bubbles can form in magma because of shear inside volcanic conduits. Bubbles may join to make channels that let gas escape early, producing calm flows in some volcanoes.

Level

Read

5 Jan 2026

Egyptian university and pharma join to create Africa’s first biotechnology academy

The American University in Cairo and Minapharm have formed a partnership to set up what the university calls the first African academy for biotechnology. The initiative starts early this year to strengthen education, research and industry links.

Level

Read

17 Feb 2026

Speed training may lower dementia risk in older adults

A long-term randomized study found that adults 65 and older who completed speed-of-processing training, with later booster sessions, were less likely to be diagnosed with dementia up to twenty years later.

Level

Read

New training method helps models do long multiplication^{CEFR B1}

Difficult words

Discussion questions

Related articles

AI helps detect melanoma from skin images

Dementia rising in Africa as researchers seek answers

Why some gas-rich volcanoes erupt gently

Egyptian university and pharma join to create Africa’s first biotechnology academy

Speed training may lower dementia risk in older adults

New training method helps models do long multiplication CEFR B1

Difficult words

Discussion questions

Related articles

AI helps detect melanoma from skin images

Dementia rising in Africa as researchers seek answers

Why some gas-rich volcanoes erupt gently

Egyptian university and pharma join to create Africa’s first biotechnology academy

Speed training may lower dementia risk in older adults

New training method helps models do long multiplication^{CEFR B1}