Can AI Dream Up New Math Problems?

Author: Denis Avetisyan


Researchers are exploring whether large language models can move beyond solving existing mathematical challenges to actually generate novel research questions.

This paper introduces DeepMath-generate, an agent that leverages large language models to propose original problems in differential and Riemannian geometry, specifically focusing on nonnegative curvature.

While traditionally viewed as computational tools, the potential for Large Language Models (LLMs) to contribute to creative mathematical inquiry remains largely unexplored. This paper, building on the work presented in ‘Can LLM generate interesting mathematical research problems?’, investigates whether LLMs can autonomously formulate novel and valuable research questions. We demonstrate this capacity through DeepMath-generate, an agent capable of producing 665 previously unknown problems in differential geometry, many of which were verified by experts as possessing unique research value. Could this represent a paradigm shift, enabling LLMs to not only solve, but actively drive mathematical discovery?


The Geometry of Inquiry: Identifying the Unaskable

Differential geometry, a cornerstone of modern mathematics and physics, frequently encounters a peculiar obstacle: the scarcity of well-defined, genuinely new research questions. While the field boasts a rich history of solved problems, identifying worthwhile avenues for further investigation proves increasingly challenging. This isn’t a matter of lacking tools or techniques, but rather a fundamental difficulty in knowing what to ask. The formulation of a meaningful problem requires not only technical proficiency but also a deep, intuitive grasp of the landscape – a capacity that is often the limiting factor in advancing the field. Progress, therefore, hinges less on solving existing puzzles and more on the creative ability to unearth, and rigorously define, the questions that will shape future inquiry. The very act of problem creation, it seems, has become a significant bottleneck in mathematical discovery.

The advancement of differential geometry, like many mathematical disciplines, has historically depended on the capacity of researchers to conceive of both solvable and insightful problems – a process deeply rooted in human intuition and specialized expertise. This reliance, however, presents a significant bottleneck to discovery. Formulating truly novel and challenging questions demands years of dedicated study and a nuanced understanding of the field, limiting the rate at which new avenues of research can be explored. The sheer cognitive load involved in problem creation often overshadows the time dedicated to actual problem-solving, hindering the overall pace of progress. Consequently, breakthroughs frequently arise from the work of a relatively small number of highly experienced mathematicians, while potentially fruitful areas may remain unexplored simply due to a lack of adequately formulated questions – a situation that automated problem generation seeks to address.

The advancement of differential geometry, like many mathematical disciplines, is fundamentally constrained not by a lack of tools, but by a scarcity of compelling research questions. Currently, the formulation of these problems relies heavily on the ingenuity of individual mathematicians, a process that is both time-consuming and susceptible to inherent biases. Automating this process-generating novel and rigorously challenging problems-offers a potential solution to this bottleneck. Such a system could explore vast mathematical landscapes, identifying areas ripe for investigation that might be overlooked by human intuition. By systematically creating and evaluating problems, researchers envision a future where the pace of discovery is dramatically accelerated, pushing the boundaries of geometric understanding and potentially revealing unexpected connections within the field. This isn’t merely about increasing the quantity of research, but about fostering a more diverse and comprehensive exploration of mathematical possibilities, potentially unlocking breakthroughs currently beyond reach.

DeepMath-generate: An Agent for Geometric Innovation

DeepMath-generate is an autonomous agent designed for the creation of novel research problems within the field of differential geometry. The system functions by utilizing Large Language Models (LLMs) as its core reasoning engine, eliminating the need for human intervention in the initial problem formulation stage. This agent is not simply a random problem generator; it is engineered to produce mathematically sound and potentially valuable problems for researchers. The architecture centers on leveraging the LLM’s capacity for complex symbolic manipulation and logical deduction to construct problems involving concepts such as manifolds, curvature tensors, and [latex]n[/latex]-dimensional geometry. The resulting problems are intended to be suitable for publication and further investigation by the mathematical community.

DeepMath-generate employs a two-prompt system to facilitate autonomous problem generation. The ‘Generator Prompt’ initiates the process by requesting a novel mathematical problem within the specified domain of differential geometry, providing initial constraints and desired characteristics. Following problem creation, an ‘Evaluator Prompt’ is utilized to assess the generated problem based on criteria of novelty and quality; this prompt analyzes the problem statement to determine if it presents a genuinely new challenge and adheres to mathematical rigor. The output of the Evaluator Prompt – a score reflecting the problem’s suitability – informs subsequent iterations of the generation process, allowing the system to refine and improve the quality of the generated problems.

DeepMath-generate’s functionality is fundamentally dependent on the reasoning and generative capabilities of the GPT-5 large language model. GPT-5 is utilized to process instructions, formulate mathematical concepts within the domain of differential geometry, and construct complete problem statements. This includes generating the necessary definitions, conditions, and questions required for a well-defined mathematical problem. The model’s capacity for complex reasoning allows it to move beyond simple keyword combinations and create problems exhibiting non-trivial solutions, and to express these problems using standard mathematical notation, including ∇ operators and tensor fields.

Extending the Boundaries: Novel Concepts in Geometric Exploration

DeepMath-generate demonstrates the capability to generate novel mathematical problems extending beyond traditional Riemannian geometry by incorporating concepts from synthetic curvature. This involves formulating problems related to [latex] \text{Synthetic Curvature} [/latex] and [latex] \text{Synthetic Ricci Curvature} [/latex], which are approaches to defining curvature properties on metric spaces that may not admit a smooth Riemannian structure. These generated problems explore the geometric implications of these synthetic definitions, allowing for investigations into spaces where standard curvature calculations are inapplicable and requiring alternative analytical techniques. The system’s output, therefore, facilitates research into broader geometric frameworks and provides test cases for validating new theorems and computational methods in non-smooth settings.

DeepMath-generate is capable of constructing mathematical problems centered on advanced geometric structures, specifically including Kähler manifolds and exotic spheres. A Kähler manifold is a complex manifold equipped with a Riemannian metric compatible with the complex structure and a symplectic form. These manifolds possess properties linking complex, Riemannian, and symplectic geometry, making them significant in areas like string theory and algebraic geometry. Exotic spheres, denoted as [latex]S^n[/latex] where [latex]n \neq 4[/latex], are smooth manifolds that are homeomorphic but not diffeomorphic to the standard [latex]n[/latex]-dimensional sphere; their existence relies on results in surgery theory and provides insight into the classification of manifolds. The system’s ability to generate problems involving these structures demonstrates its capacity to move beyond standard Euclidean or simpler Riemannian geometries.

DeepMath-generate is capable of formulating problems pertaining to the Soul Theorem, a result in Riemannian geometry concerning the structure of complete manifolds. Specifically, the system generates problems related to the existence and properties of souls – closed, totally geodesic subsets of a complete manifold. Additionally, the system explores problems concerning manifolds with nonnegative curvature, investigating their geometric and topological properties, including the behavior of geodesics and the existence of certain types of sectional curvature. These problems often involve analyzing the fundamental group and the relationship between curvature and topological invariants on such manifolds.

Automated Discovery: Reshaping the Landscape of Mathematical Inquiry

The creation of novel mathematical problems is traditionally a human endeavor, requiring significant expertise and intuition. However, automated problem generation, as demonstrated by systems like DeepMath-generate, offers a pathway to dramatically accelerate research, particularly in complex fields like differential geometry. By systematically exploring the space of possible mathematical statements, these systems can produce challenging problems that push the boundaries of current knowledge. This isn’t simply about increasing the quantity of problems; the automated approach can uncover unexpected connections and formulations that might be overlooked by human researchers, potentially leading to breakthroughs in understanding fundamental mathematical structures and opening new avenues of inquiry into areas such as [latex] \text{Riemannian manifolds} [/latex] and related geometric domains. The capacity to consistently generate such problems promises to reshape the research landscape, fostering a more dynamic and efficient process of mathematical discovery.

The automated system DeepMath-generate has successfully produced a substantial collection of 665 original research problems within the complex field of differential geometry. This achievement signifies a pivotal step towards computational creativity in mathematics, moving beyond simply solving existing problems to actively generating new ones. These aren’t trivial variations; each problem represents a unique challenge, formulated by the system’s algorithms and designed to potentially expand the boundaries of current mathematical understanding. The sheer volume of generated problems allows researchers to explore a significantly broader landscape of possibilities than would be feasible through manual creation, potentially accelerating discovery and uncovering previously inaccessible areas of mathematical inquiry. The system’s output isn’t merely a demonstration of algorithmic capability, but a valuable resource for the mathematical community, offering a wealth of new avenues for investigation into [latex]\text{manifolds}[/latex], [latex]\text{curvature}[/latex], and related concepts.

The capacity of DeepMath-generate extends beyond simple problem creation; it offers a unique avenue for mathematical discovery by systematically exploring a vast landscape of possibilities. Unlike human intuition, which is often limited by pre-conceived notions and established patterns, the system can traverse unexplored territories within differential geometry, potentially revealing unexpected relationships between seemingly disparate concepts. This exhaustive search isn’t merely about quantity; by generating and analyzing numerous variations on existing mathematical structures, DeepMath-generate increases the probability of identifying subtle connections that might otherwise remain hidden. The system’s ability to probe these uncharted areas could lead to a deeper understanding of fundamental mathematical principles and inspire new lines of inquiry, ultimately reshaping the field’s theoretical foundations and opening doors to novel applications – for example, in areas related to [latex]\text{Riemannian manifolds}[/latex] or complex geometry.

Researchers are poised to enhance DeepMath-generate’s capabilities by linking it with automated theorem proving systems, creating a powerful synergistic loop where the system not only poses novel conjectures but also attempts to validate them. This integration promises to significantly accelerate the rate of mathematical discovery, moving beyond problem generation to automated verification. Furthermore, the scope of inquiry is expanding beyond the initial focus on differential geometry; investigations are underway to apply DeepMath-generate’s problem-creation abilities to more complex mathematical landscapes, including the study of harmonic maps – functions that preserve distances – and the intricacies of the Heisenberg Group, a non-commutative group with significant applications in physics and signal processing. These explorations aim to reveal whether the principles underlying successful problem generation in one area of mathematics can be generalized to unlock new challenges and insights in seemingly disparate fields.

The pursuit of novel research problems, as demonstrated by DeepMath-generate, necessitates a ruthless pruning of complexity. It’s not about generating endless variations, but identifying the core essence of a challenge. As Donald Davies observed, “Simplicity is prerequisite for reliability.” This aligns perfectly with the agent’s function; the system doesn’t aim to solve existing problems, but to formulate genuinely new ones in differential geometry. The value lies not in the quantity of problems produced, but in their conceptual clarity and potential to advance the field, embodying the principle that true understanding is revealed through subtraction, not addition. The agent seeks to distill mathematical inquiry to its most fundamental form.

Where Do We Go From Here?

The demonstration that a Large Language Model can formulate, rather than merely solve, problems in differential geometry is, predictably, not a revelation about mathematics. It is a revelation about the architecture of information. The agent, DeepMath-generate, does not understand nonnegative curvature; it manipulates symbols according to probabilistic rules. Yet, novelty emerges. This suggests that the appearance of creativity is not a prerequisite for its existence, merely a consequence of sufficient structural complexity. The relevant question is not whether the machine is ‘thinking’, but whether its outputs are, to a human observer, interesting.

Limitations remain stark. The agent’s capacity for genuine abstraction-for identifying genuinely new lines of inquiry, rather than remixing existing tropes-is, at present, limited by the corpus upon which it was trained. Future work must address this, perhaps through mechanisms for self-directed exploration and the incorporation of external validation. More subtly, the very definition of ‘interesting’ is subject to human bias. A truly autonomous system would require a metric for novelty independent of human evaluation-a chilling prospect, and one that raises questions about the ultimate purpose of such inquiry.

The path forward is not toward artificial intelligence, but toward artificial amplification. The tool does not replace the mathematician; it extends their reach. The challenge now is to refine the interface between human intuition and algorithmic exploration, and to accept that the most profound discoveries may arise not from insight, but from the systematic exhaustion of possibility.


Original article: https://arxiv.org/pdf/2603.18813.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/

See also:

2026-03-21 03:13