A startup behind AI IQ is converting dozens of frontier language models into an estimated human-style IQ, complete with an added “emotional intelligence” score and cost-performance views. The charts are praised for clarity, but slammed for implying precision from uneven, “jagged” capabilities and for methodological choices critics say may skew results. Meanwhile, enterprises are using the framework to route models by task and price.
DeepSeek V4 has arrived in two versions: a powerful Pro model with 1.6 trillion parameters and an efficient Flash variant. The headline feature is a one-million-token context window, enabling far longer and more complex prompts. With aggressive performance gains and pricing momentum, the question is whether the rapid push can be sustained against fast-moving competition.
Your news, in seconds
Get the Beige app — every story in 60 words, updated hourly. Free on iOS & Android.
Swipe through stories, personalise your feed, and save articles for later — all on the app.