Last week, OpenAI shocked the mathematical community by revealing that one of its internal artificial intelligence (AI) ...
The result is correct but challenges core norms of mathematics: checking proofs, crediting ideas and keeping research open to everyone.
Mathematician Will Sawin discusses his experience reviewing and refining a mathematical proof devised by OpenAI's internal model—and what that could mean for mathematics.
The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got six or seven of the ten questions right.
Researchers have introduced Light-R1-32B, a new open-source AI model optimized to solve advanced math problems. It is now available on Hugging Face under a permissive Apache 2.0 license — free for ...
DeepSeek made waves in early 2025, launching one of the world's first free-to-access thinking models. Now, the Chinese firm has just released DeepSeekMath-V2 with the objective of achieving ...
AI large language models have been especially weak on math. There are now several papers from Google Deep Mind, Alibaba and other universities where AI large language models are at Math Olympiad ...
Microsoft found that small language models can exceed the performance of much larger ones when trained to specialize in a single area. Researchers fine-tuned the Mistral 7B model to create Orca-Math, ...
Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now If you haven’t heard of “Qwen2” it’s ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results