mixture of experts

Zyphra launches ZAYA1 8B open reasoning model trained entirely on AMD Instinct MI300 GPUs

Zyphra has released ZAYA1 8B, an open reasoning mixture-of-experts model with 8 billion parameters and just 760 million active. It matches bigger rivals on benchmarks, including AIME 2025, and was trained end to end on AMD Instinct MI300 GPUs. The model uses “Markovian RSA” to think longer without context overflow and ships under Apache 2.0 for immediate commercial use.

Venture Beat

·Published by Beige· on 8 May 2026

Summarised by Beize from a story on Venture Beat on 8 May 2026

DeepSeek V4 slashes AI costs to one sixth of GPT 5.5 yet nears frontier intelligence

DeepSeek-V4 has arrived as a free, MIT-licensed 1.6T Mixture-of-Experts model that reportedly matches or beats top closed systems on select benchmarks while costing about one-sixth as much as GPT-5.5 via API. The bigger story: a native one-million-token context achieved with new attention and training techniques, pressuring premium model pricing.

Venture Beat

·Published by Beige· on 24 Apr 2026

Summarised by Beize from a story on Venture Beat on 24 Apr 2026

Your news, in seconds

Get the Beige app — every story in 60 words, updated hourly. Free on iOS & Android.

App Store Play Store

Page 1

mixture of experts

Zyphra launches ZAYA1 8B open reasoning model trained entirely on AMD Instinct MI300 GPUs

DeepSeek V4 slashes AI costs to one sixth of GPT 5.5 yet nears frontier intelligence

The full experience is on mobile.