Google DeepMind has released a new AI co mathematician designed to help solve complex problems, and it reportedly set a fresh record on the FrontierMath benchmark. The result signals faster progress toward reliable machine-assisted reasoning in coding and mathematics, as leading labs increasingly test models on tougher evaluation tasks.
Anthropic released results from BioMysteryBench, a new evaluation set for bioinformatics tasks. Its models reportedly solved about 30% of problems that stumped human scientists, suggesting AI is moving beyond math and coding into biological research workflows. The announcement highlights growing confidence in model-driven discovery, while raising questions about real-world usefulness and benchmarks.
Your news, in seconds
Get the Beige app — every story in 60 words, updated hourly. Free on iOS & Android.
Passive mutual funds may be collecting more assets, but the real challenge is investor expectations. Passive investing tracks benchmarks, meaning you forgo the opportunity to earn alpha through outperformance, even after costs. Active managers can still generate alpha despite expenses, while passive funds mainly protect investors from falling behind—reducing the appeal for those seeking upside.
DeepSeek has previewed a new AI model, arguing it “closes the gap” with today’s leading frontier systems. The company says efficiency and performance improve over DeepSeek V3.2 thanks to architectural changes, and that the model nearly matches current top reasoning results across both open and closed models. The preview sets up what could be a serious shift in the race toward stronger reasoning.
Thirteen equity mutual funds failed to beat their respective benchmarks across three years, according to the analysis. The standout concern is downside behavior: several funds recorded down capture ratios above 120, suggesting they fall more than their benchmarks during market declines, pointing to weaker risk management when conditions turn bearish.
Swipe through stories, personalise your feed, and save articles for later — all on the app.