Benchmark

Visual Reasoning Benchmark

3 September 2025

AI-for-Education.org’s data science team

Visual Reasoning Benchmark hero

We took our work on visual maths further, using questions from Zambia and India to test AI models on non-verbal reasoning tasks. We made the Visual Reasoning Benchmark to test if AI models can answer genuine visual questions faced by end-of-primary students in LMICs.

Our Visual Reasoning Benchmark incorporates multiple-choice questions from end of primary non-verbal, cognitive aptitude assessments used in Zambia and India. These questions differ from those in our Visual Maths Benchmark which consider maths concepts represented with images, as they don't rely on language or numerals. Instead, they focus on visual tasks such as pattern recognition, matching and spatial reasoning, like the well-known Raven's Progressive Matrices.

We're using this benchmark to better understand how well AI models can tackle non-verbal visual tasks. This helps pinpoint where they can be relied on for educational support, where they're likely to fail and what still needs work.

See the leaderboard

Can AI do basic visual reasoning?

Visual Reasoning Question 1 puzzle

Related resources

Find out more about how we made our benchmarks and our thinking about AI benchmarks for education.

Research Paper

We made The Visual Reasoning Benchmark to test whether AI models can help with primary school visual maths. This paper details how we built it.

10 February 2026
Back to top