Benchmark
Visual Reasoning Benchmark
06 November 2025
AI-for-Education.org's data science team.

We took our work on visual maths further, using questions from Zambia and India to test AI models on non-verbal reasoning tasks. We made The Visual Reasoning Benchmark to test if AI models can answer genuine visual questions faced by end-of-primary students in LMICs.
Our Visual Reasoning Benchmark incorporates multiple choice questions from end of primary non-verbal, cognitive aptitude assessments used in Zambia and India. These questions differ from those in our Visual Maths Benchmark which consider maths concepts represented with images, as they don't rely on language or numerals. Instead, they focus on visual tasks such as pattern recognition, matching and spatial reasoning, like the well-known Raven's Progressive Matrices.
We're using this benchmark to better understand how well AI models can tackle non-verbal visual tasks. This helps pinpoint where they can be relied on for educational support, where they're likely to fail and what still needs work.
Learn More
Can AI do basic visual reasoning?

Related resources
Find out more about how we made our benchmarks and our thinking about AI benchmarks for education.
Research Paper
We made The Visual Reasoning Benchmark to test whether AI models can help with primary school visual maths. This paper details how we built it.