Open Source

Styx: Open-Source AI Gateway Auto-Picks the Cheapest Model for Each Request

March 9, 2026 2 min read

What happens when you stop paying frontier-model prices for questions a cheap model handles just fine?

That is the pitch behind Styx, a new open-source AI gateway that sits between your application and providers like OpenAI, Anthropic, Google, and Mistral. You send requests to a single endpoint, and Styx figures out which model should handle them.

The core feature is styx:auto. Set your model parameter to that value, and a 9-signal classifier analyzes each prompt's complexity before routing it. Simple factual questions go to budget models running around $0.15 per million tokens. Complex code generation or multi-step reasoning gets forwarded to the expensive frontier models. The idea is straightforward: most production AI traffic does not need GPT-4-class intelligence, so stop paying for it on every call.

Styx also claims to be the first gateway with native MCP (Model Context Protocol) support built in, which matters if you are building agent workflows that need to connect to external tools and data sources through the emerging MCP standard.

How It Compares to Existing Options

The closest alternatives are LiteLLM (open-source, model translation layer) and OpenRouter (hosted routing service). LiteLLM gives you a unified API across providers but does not auto-select models. OpenRouter offers routing but runs as a hosted service, not self-hosted. Styx tries to combine both: unified API, intelligent routing, and full self-hosting.

The project is early-stage and available on GitHub. There is no pricing because it is entirely self-hosted - you bring your own API keys for each provider. That also means you are responsible for running and maintaining it, which adds operational overhead compared to a hosted service like OpenRouter.

For teams running high-volume AI workloads where a significant chunk of requests are simple lookups or classifications, auto-routing could meaningfully reduce costs. The 9-signal classifier approach is more sophisticated than naive keyword matching, though real-world accuracy will depend heavily on your specific use cases. If your traffic is almost entirely complex reasoning, the routing overhead adds latency for little savings.

How It Compares to Existing Options

Related Tools

More from today

CodeGraph Cuts Claude Code Token Usage by 30% With Local Code Indexing

Andrew Ng's Context Hub Gives AI Coding Agents Persistent Memory for APIs

IBM's Granite 4.0 Speech Model Fits 6 Languages in 1 Billion Parameters

Cookie Preferences