Inductive Reasoning Math

These Mathematicians Are Putting A.I. to the Test

Large language models struggle to solve research-level math questions. It takes a human to assess just how poorly they ...

EurekAlert!

Achieving >97% on GSM8K: Deeply understanding the problems makes LLMs better solvers for math word problems

Chain-of-Thought (CoT) prompting has enhanced the performance of Large Language Models (LLMs) across various reasoning tasks.

A New AI Math Startup Just Cracked 4 Previously Unsolved Problems

Axiom says its AI found solutions to several long-standing math problems, a sign of the technology’s steadily advancing reasoning capabilities.

A digital game improves the mathematical performance of children with dyscalculia

Dyscalculia, characterized by deficits in number sense and calculation skills, affects approximately 5%–7% of the population ...

Geeky Gadgets

DeepSeek V4 Leaked : Coding-First Model Aims at Devs with New Memory & Reasoning AI

What if the next leap in AI wasn’t just about generating code but about truly understanding it? Below, Universe of AI takes you through how the leaked details of DeepSeek V4 suggest a bold ...

EurekAlert!

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

SiliconANGLE

Harmonic AI raises $120M at $1.45B valuation to advance mathematical reasoning

Artificial intelligence for formal mathematical reasoning startup Harmonic AI Inc. announced today that it has raised $120 million in new funding on a $1.45 billion valuation. The funding is intended ...

Business Wire

Harmonic Builds Momentum Towards Mathematical Superintelligence with $120 Million Series C

Ribbit Capital Leads Round at $1.45B Valuation of Math-Based AI Venture; Emerson Collective Joins Existing Backers Including Sequoia & Kleiner Perkins PALO ALTO, Calif.--(BUSINESS WIRE)--Harmonic, the ...

Morningstar

APTO Releases Training Dataset to Enhance the Mathematical Reasoning Capabilities of Large Language Models (LLMs)

TOKYO, Sept. 30, 2025 /PRNewswire/ -- As generative AI use continues to increase, accuracy has become the most important metric and a key factor in decisions around adoption and utilization. APTO is ...

Nasdaq

Baidu Unveils Reasoning Model ERNIE X1.1 with Upgrades in Key Capabilities

ERNIE X1.1 shows major advancements in factuality, instruction following, and agentic capabilities; it surpasses DeepSeek R1-0528 in overall performance while performing on par with top-tier models ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results