OpenAI’s two-year partnership with Apple is reportedly fraying as the AI company feels it hasn’t received the expected upside from the deal. Bloomberg says OpenAI lawyers are exploring legal options, including sending a breach of contract notice rather than immediately filing a full lawsuit. The move follows Apple’s plans to add other AI providers, potentially reducing OpenAI’s unique role, while the company prepares further AI announcements at its June software conference.
OpenAI has formed partnerships with major European companies including Deutsche Telekom, BBVA, and Telefonica to give them access to its latest AI models. The goal is to help these firms strengthen their resilience by using advanced AI tools to detect, respond to, and defend against evolving cyber threats.
Your news, in seconds
Get the Beige app — every story in 60 words, updated hourly. Free on iOS & Android.
Speculation claimed Anthropic held back broad access to Claude Mythos due to insufficient compute. But CEO Dario Amodei disputes that explanation, saying restrictions were not driven by resource limits. The remarks shed new light on how Anthropic is planning release strategies and managing demand for its next-generation model—without the compute bottleneck many expected.
Zyphra has released ZAYA1 8B, an open reasoning mixture-of-experts model with 8 billion parameters and just 760 million active. It matches bigger rivals on benchmarks, including AIME 2025, and was trained end to end on AMD Instinct MI300 GPUs. The model uses “Markovian RSA” to think longer without context overflow and ships under Apache 2.0 for immediate commercial use.
Google says its Gemma4 model can run up to three times faster thanks to Multi-Token Prediction Drafters, an algorithmic upgrade to how text is decoded. Instead of committing token by token, the system drafts multiple likely continuations, reducing costly re-computation during generation. The result is quicker responses without changing the model’s core capabilities.
A security researcher claims Google Chrome may be downloading a 4GB AI model on some devices after a system compatibility check, potentially without clear user permission. The model appears tied to on device AI features and may go unnoticed until users see lower storage or higher data usage. The report has sparked fresh privacy and consent concerns.
Never miss a story
Set alerts for the topics and sources you care about. Download Beige for free.
Apple’s upcoming iOS 27 updates reportedly aim to give users direct control over which third party AI models power everyday tasks. Instead of relying on a single default assistant, the system may allow model switching across functions, potentially reshaping how apps and AI capabilities work together inside Apple’s walled ecosystem.
Apple is reportedly planning to let users pick third-party AI models inside iOS 27, covering tasks like generating and editing text and images. Instead of relying solely on Apple provided AI, users may be able to switch between rival models, potentially reshaping how apps and system features use AI on iPhones.
OpenAI has introduced GPT-5.5 Instant, replacing GPT-3.5 Instant as the default model in ChatGPT. The update signals a shift to a newer “instant” experience aimed at quicker, more capable responses for everyday prompts. For most users, the change is likely seamless, with ChatGPT automatically routing requests to the new default model.
Microsoft, Google, and xAI are partnering with the US government to provide early access to newly developed AI models. The Center for AI Standards and Innovation will evaluate the systems to assess national security risks before they’re released to the public. The effort is aimed at measuring advanced AI capabilities while identifying potential harms ahead of time.
Reading on mobile?
Open Beige in the app for a smoother experience — free on iOS and Android.
OpenAI says a newer AI model started inserting goblins and gremlins into unrelated answers. The cause, it found, was training rewards that unintentionally favored metaphor-heavy language, letting the pattern spread across outputs. OpenAI has now tightened guidance in its Codex tool, instructing the AI to avoid such creature references unless they’re truly relevant.
NVIDIA has unveiled Nemotron-3 Nano Omni, a unified multimodal AI designed to connect sight, sound, and language within a single system. The company says it enables faster, more intuitive interactions and a more human-like grasp of real-world context in real time, signaling a shift from separate capabilities to integrated understanding.
OpenAI’s latest models, including the Codex coding agent, are now accessible through Amazon Bedrock on AWS. The change comes after an update to OpenAI’s contract with Microsoft, opening a direct path for enterprises to build AI agents using OpenAI’s frontier models in the AWS ecosystem. The partnership is aimed at scaling computing capacity and expanding OpenAI’s enterprise reach.
DeepSeek-V4 has arrived as a free, MIT-licensed 1.6T Mixture-of-Experts model that reportedly matches or beats top closed systems on select benchmarks while costing about one-sixth as much as GPT-5.5 via API. The bigger story: a native one-million-token context achieved with new attention and training techniques, pressuring premium model pricing.
Follow your favourite sources
Track sources, tags and categories — all in the Beige app.
DeepSeek unveiled two new V4 models and, unusually, offered a direct estimate of how their progress compares with US frontier labs—saying it is roughly 3–6 months behind. The admission adds context to the competitive AI landscape, suggesting that even when models look cutting-edge, the underlying development timeline may still trail the US lead.
Chinese AI startup DeepSeek has launched its V4 model specifically adapted to run on Huawei chips, rolling out both Pro and lighter “Flash” variants. The move underscores Beijing’s push to build an independent AI stack, reducing reliance on foreign compute. By aligning advanced models with domestic hardware, DeepSeek is highlighting progress in China’s broader AI infrastructure.
DeepSeek has previewed a new AI model, arguing it “closes the gap” with today’s leading frontier systems. The company says efficiency and performance improve over DeepSeek V3.2 thanks to architectural changes, and that the model nearly matches current top reasoning results across both open and closed models. The preview sets up what could be a serious shift in the race toward stronger reasoning.
DeepSeek V4 has arrived in two versions: a powerful Pro model with 1.6 trillion parameters and an efficient Flash variant. The headline feature is a one-million-token context window, enabling far longer and more complex prompts. With aggressive performance gains and pricing momentum, the question is whether the rapid push can be sustained against fast-moving competition.
Stay informed on the go
Bite-sized news from 100+ trusted sources, right in your pocket.
Developers reported “AI shrinkflation” as Claude appeared less capable, more repetitive, and less efficient with tokens. Anthropic’s technical post-mortem says the model weights didn’t regress, but three surrounding product-layer changes did: a reasoning-effort default, a caching bug that wiped thinking too often, and tighter verbosity limits. The company says it has reverted the fixes and reset subscriber usage limits.
OpenAI has launched GPT-5.5, its latest flagship model built to handle complex, multi-part tasks as an active collaborator. The company says it improves performance in coding, knowledge work, and scientific research, boosting autonomy and efficiency while keeping latency unchanged. OpenAI is also rolling out new API pricing, starting at $5 per 1 million tokens.
Swipe through stories, personalise your feed, and save articles for later — all on the app.