LIVORNO, Italy – Livorno Elementary Middle School students gathered at the LEMS Media Center on March 13 to lead an immersive math modeling activity.
Researchers have introduced Light-R1-32B, a new open-source AI model optimized to solve advanced math problems. It is now available on Hugging Face under a permissive Apache 2.0 license — free for ...
The GSMM Camp is a weeklong workshop that builds interdisciplinary problem-solving skills for graduate and advanced undergraduate students. Participants work in teams on mathematically rich problems ...
Microsoft found that small language models can exceed the performance of much larger ones when trained to specialize in a single area. Researchers fine-tuned the Mistral 7B model to create Orca-Math, ...
New benchmark study results show leading AI models, including ChatGPT, Claude, and Gemini, still lag humans in visual math reasoning.
VUB's Data Analytics Lab has published new results showing that it is possible to develop original mathematical proofs using commercial language models. In a paper posted to the arXiv preprint server, ...
Chinese AI lab DeepSeek has quietly updated Prover, its AI model that’s designed to solve math-related proofs and theorems. According to South China Morning Post, DeepSeek uploaded the latest version ...
24-year-old founder and CEO Carina Hong created Axiom Math in March 2025 and has recruited a team of ten employees, most of whom are from Meta, to build a math-focused AI model. Last fall, Carina Hong ...
The Register on MSN

AI models still suck at math

Just less than before, according to the ORCA test exclusive Current-day LLMs are prediction engines and, as such, they can only find the most likely solution to problems, which is not necessarily the ...
Every time Alistair publishes a story, you’ll get an alert straight to your inbox! Enter your email By clicking “Sign up”, you agree to receive emails from ...
Alan Veliz-Cuba has received funding from the Simons Foundation and the American Mathematical Society for some of his research. You can probably think of a time when you’ve used math to solve an ...