Microsoft researchers warn that “delegated work” with frontier LLMs can quietly degrade documents across long, iterative workflows. Using the DELEGATE-52 benchmark across 52 domains, they found top models corrupt about 25% of document content after 20 rounds. Worse, agentic tools and realistic distractor files increase errors, often via rare but massive distortions humans can miss.
Swipe through stories, personalise your feed, and save articles for later — all on the app.