Public beta · governed AI routing

The governed control plane
for autonomous AI.

BrainstormRouter routes every completion through a learning engine with verifiable agent identity, budget enforcement, and memory-aware fallback. One API, 31 curated models, 8 providers — with receipts.

Start routing — free tier See the architecture

Models

curated, ranked

Providers

routed on merit

Intelligence

wired systems

Overhead

<1ms

p95 guardian

Why BrainstormRouter

Most gateways are pass-through.
This one learns.

Every request feeds Thompson sampling, validity scoring, and a cost/quality frontier that re-ranks providers in real time. Every response ships with a receipt: which model, which tier, what it cost, what it would have cost.

Learned routing

Thompson sampling chooses per-request, not per-config. Winners get more traffic; losers get fewer. No static rules to maintain.

Verifiable identity

Short-lived mTLS certs via the Cryptographic Agent Framework. Every agent signs in; every call carries a provable origin.

Memory-aware context

SOUL, MEMORY, SKILL, HEARTBEAT, WORKSPACE, REFERENCE — six attributes, 4000 tokens, injected consistently across providers.

Counterfactual receipts

The shadow comparator logs what static routing would have spent. The savings widget shows the delta, live.

Budget enforcement

Per-agent, per-tenant daily caps. Budget forecaster predicts burn and degrades to cheaper tiers before you hit a ceiling.

Semantic cache

pgvector + in-memory hybrid at a 90% similarity threshold. Streaming responses get reconstructed from cache, not just JSON.

Five governance pillars

The control plane, not the crayon box.

Thirteen intelligence systems across five pillars. Every pillar is wired into the hot path — no dashboards without runtime, no runtime without receipts.

Identity

SOUL-first bootstrap

Upload identity → profile, HR record, budget, workspace, JWT. Zero admin ceremony.

Auth

Cryptographic Agent Framework

5-minute mTLS certificates signed by BR's CA. Agent JWTs with tenant scope.

Runtime

Thompson sampling + cascade

Bandit chooses, cascade recovers, validity scores, frontier re-ranks.

Economics

Budget forecaster & guardian

Per-tenant usage, daily caps, pre-request anomaly, counterfactual savings.

Evidence

Receipts & shadow comparator

Every response headers cost, savings, phase prediction, efficiency score.

Counterfactual proof

Every request knows
what static routing would’ve cost.

The shadow comparator records BR’s Thompson decision alongside what a fixed price-only or quality-only router would have picked. Welch’s t-test, Cohen’s d, win-rate confidence interval — exposed over /v1/intelligence/benchmark.

Savings surface on the dashboard and via the X-BR-Routing-Savings header on every response.

POST /v1/chat/completions

# request headers returned
X-BR-Model:              claude-sonnet-4-6
X-BR-Estimated-Cost:      $0.00412
X-BR-Actual-Cost:         $0.00389
X-BR-Routing-Savings:     $0.00124  # vs. static price-only
X-BR-Guardian-Status:     ok
X-BR-Phases:              planning,execution
X-BR-Efficiency:          0.94
X-BR-Guardian-Overhead-Ms: 0.7

# link headers for AI discoverability
Link: </openapi.yaml>; rel="describedby"; type="application/yaml"
Link: </llms.txt>; rel="alternate"; type="text/plain"

Stop paying for pass-through.
Route with evidence.

Drop-in compatible with the OpenAI SDK. Free tier includes 100k tokens, all 13 intelligence systems, full receipts.

Start free Read the SDK docs

The governed control plane for autonomous AI.

Most gateways are pass-through. This one learns.