Download the app
← Latest news

Sakana’s RL Conductor trains a tiny 7B model to orchestrate GPT Claude and Gemini at low cost

Technology
Published on 8 May 2026
Sakana’s RL Conductor trains a tiny 7B model to orchestrate GPT Claude and Gemini at low cost

It writes the workflow in plain language

Sakana AI says its RL Conductor turns a small 7B model into an “orchestra conductor” that dynamically routes tasks across GPT-5, Claude Sonnet 4, Gemini 2.5 Pro, and open-source workers. Instead of hardcoded pipelines like LangChain, it learns coordination via reinforcement learning—cutting tokens and API calls while boosting reasoning and coding benchmark scores. The tech now powers Sakana Fugu’s enterprise API.

  • RL Conductor replaces rigid hardcoded agent pipelines with dynamic orchestration
  • A 7B model coordinates multiple frontier and open-source LLMs
  • It cuts token usage dramatically versus prior multi-agent routers
  • Sakana Fugu productizes the approach via an OpenAI-compatible API
Read the full story at Venture Beat

This summarization was done by Beige for a story published on Venture BeatVenture Beat

The full experience is on mobile.

Swipe through stories, personalise your feed, and save articles for later — all on the app.