A Harvard-backed study tests large language models across multiple medical scenarios, including real emergency room cases. The researchers report that at least one AI model performed more accurately than two human doctors, raising questions about how these systems could support urgent care workflows and where their limits may still be.
Swipe through stories, personalise your feed, and save articles for later — all on the app.