Learned routing
Thompson sampling chooses per-request, not per-config. Winners get more traffic; losers get fewer. No static rules to maintain.
BrainstormRouter routes every completion through a learning engine with verifiable agent identity, budget enforcement, and memory-aware fallback. One API, 31 curated models, 8 providers — with receipts.
Every request feeds Thompson sampling, validity scoring, and a cost/quality frontier that re-ranks providers in real time. Every response ships with a receipt: which model, which tier, what it cost, what it would have cost.
Thompson sampling chooses per-request, not per-config. Winners get more traffic; losers get fewer. No static rules to maintain.
Short-lived mTLS certs via the Cryptographic Agent Framework. Every agent signs in; every call carries a provable origin.
SOUL, MEMORY, SKILL, HEARTBEAT, WORKSPACE, REFERENCE — six attributes, 4000 tokens, injected consistently across providers.
The shadow comparator logs what static routing would have spent. The savings widget shows the delta, live.
Per-agent, per-tenant daily caps. Budget forecaster predicts burn and degrades to cheaper tiers before you hit a ceiling.
pgvector + in-memory hybrid at a 90% similarity threshold. Streaming responses get reconstructed from cache, not just JSON.
Thirteen intelligence systems across five pillars. Every pillar is wired into the hot path — no dashboards without runtime, no runtime without receipts.
Upload identity → profile, HR record, budget, workspace, JWT. Zero admin ceremony.
5-minute mTLS certificates signed by BR's CA. Agent JWTs with tenant scope.
Bandit chooses, cascade recovers, validity scores, frontier re-ranks.
Per-tenant usage, daily caps, pre-request anomaly, counterfactual savings.
Every response headers cost, savings, phase prediction, efficiency score.
The shadow comparator records BR’s Thompson decision alongside
what a fixed price-only or quality-only router would have picked.
Welch’s t-test, Cohen’s d, win-rate confidence interval
— exposed over /v1/intelligence/benchmark.
X-BR-Routing-Savings header on every
response.
# request headers returned X-BR-Model: claude-sonnet-4-6 X-BR-Estimated-Cost: $0.00412 X-BR-Actual-Cost: $0.00389 X-BR-Routing-Savings: $0.00124 # vs. static price-only X-BR-Guardian-Status: ok X-BR-Phases: planning,execution X-BR-Efficiency: 0.94 X-BR-Guardian-Overhead-Ms: 0.7 # link headers for AI discoverability Link: </openapi.yaml>; rel="describedby"; type="application/yaml" Link: </llms.txt>; rel="alternate"; type="text/plain"
Drop-in compatible with the OpenAI SDK. Free tier includes 100k tokens, all 13 intelligence systems, full receipts.