Mathematical Reasoning One Shot

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

VentureBeat

AI’s math problem: FrontierMath benchmark shows how far technology still has to go

Artificial intelligence systems may be good at generating text, recognizing images, and even solving basic math problems—but when it comes to advanced mathematical reasoning, they are hitting a wall.

Geeky Gadgets

Google DeepMind AlphaProof AI solves advanced reasoning problems in mathematics

At the heart of this breakthrough lies AlphaProof, a sophisticated formal reasoning AI model developed by the brilliant minds at Google DeepMind. This innovative system has demonstrated an ...

National Academies of Sciences%2c Engineering%2c and Medicine

AI to Assist Mathematical Reasoning: A Workshop

A National Academies of Sciences, Engineering, and Medicine-appointed ad hoc committee will plan and organize a workshop that will bring together academic, industry, and government stakeholders to ...

SiliconANGLE

Harmonic AI raises $120M at $1.45B valuation to advance mathematical reasoning

Artificial intelligence for formal mathematical reasoning startup Harmonic AI Inc. announced today that it has raised $120 million in new funding on a $1.45 billion valuation. The funding is intended ...

Morningstar

APTO Releases Training Dataset to Enhance the Mathematical Reasoning Capabilities of Large Language Models (LLMs)

TOKYO, Sept. 30, 2025 /PRNewswire/ -- As generative AI use continues to increase, accuracy has become the most important metric and a key factor in decisions around adoption and utilization. APTO is ...

National Academies of Sciences%2c Engineering%2c and Medicine

Artificial Intelligence to Assist Mathematical Reasoning: Proceedings of a Workshop

Suggested Citation: "3 Case Studies." National Academies of Sciences, Engineering, and Medicine. 2023. Artificial Intelligence to Assist Mathematical Reasoning ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results