MATHEMATICAL REASONING DATASETS FOR AI MODELLING: AN OVERVIEW

Parneet Kaur

doi:10.21474/JNCS01/110

15 Mar 2026

MATHEMATICAL REASONING DATASETS FOR AI MODELLING: AN OVERVIEW

Parneet Kaur

Assistant Professor, Central University of Punjab, Bathinda.

Abstract
Keywords
How to Cite This Article
Corresponding Author

Mathematical reasoning is a growing area of interest for defining the cognitive abilities of Large Language Models (LLMs). Unlike conventional Natural Language Processing (NLP) tasks, mathematical reasoning uses logical structure and symbolic manipulation and involves multi-stepreasoning. The rapidly evolving LLMs have prompted researchers to create various datasets to assess and improve their reasoning. In this paper, we review datasets for mathematical reasoning, ranging from basic rule-oriented to sophisticated LLM-oriented. This paper analyses datasets such as GSM8K, MATH, MathQA, Geometry3K, FinQA, and the newly created Math Odyssey dataset. The paper presents an overview of these datasets along with search strategies, inclusion and exclusion criteria, and the selection of review literature databases. The reviewed literature indicates insufficient coverageof reasoning, biases in the datasets, and limited domains for reasoning. The findings of the reviewpoint towards the need for curated datasets, topology-based evaluations, and more cross-disciplinary reasoning domains.

Parneet Kaur (2026); MATHEMATICAL REASONING DATASETS FOR AI MODELLING: AN OVERVIEW, Jana Nexus: Journal of Computer Science, 2 (03), 17-21, ISSN (O): 3108-1916. DOI URL: https://dx.doi.org/10.21474/JNCS01/110

Parneet Kaur
Assistant Professor, Central University of Punjab, Bathinda.
India

DOI:

Article DOI: 10.21474/JNCS01/110
DOI URL: https://dx.doi.org/10.21474/JNCS01/110

Download Full Paper

Download PDF No. of Downloads: 43 | No. of Views: 79

This work is licensed under a Creative Commons Attribution 4.0 International License.

MATHEMATICAL REASONING DATASETS FOR AI MODELLING: AN OVERVIEW

DOI:

Download Full Paper

Share this article