“I was curious to establish a baseline for when LLMs are effectively able to solve open math problems compared to where they ...
On Friday, research organization Epoch AI released FrontierMath, a new mathematics benchmark that has been turning heads in the AI world because it contains hundreds of expert-level problems that ...
A research team affiliated with UNIST has unveiled a novel AI system capable of grading and providing detailed feedback on even the most untidy handwritten math answers—much like a human instructor.
Baidu's ERNIE-5.0-0110 ranks #8 globally on LMArena, becoming the only Chinese model in the top 10 while outperforming ...
Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now If you haven’t heard of “Qwen2” it’s ...
Researchers have introduced Light-R1-32B, a new open-source AI model optimized to solve advanced math problems. It is now available on Hugging Face under a permissive Apache 2.0 license — free for ...
Google DeepMind, Google LLC’s artificial intelligence research unit, today unveiled two new AI models that are capable of advanced mathematical reasoning for solving complex math problems, which ...
Microsoft has potentially made a breakthrough with small language models (SLMs) after the recent development of a new reasoning technique dubbed rStar-Math. For context, the technique enhances the ...