Self-hosted intelligent LLM gateway

Route to the right model.
Prove it with your own tests.
Keep the ledger forever.

E disputatione veritas — from disputation, truth.

Self-hosted LLM routing. OpenAI-compatible. PCI-safe. One endpoint, model:"auto" — and a council of models when the stakes are high. AgoraRouter learns from outcomes your team has already verified — never from vendor benchmarks.

45
MCP tools live in production
9
model families on the panel
0
prompts that leave your perimeter
100%
decisions on an append-only ledger
The market gap
Every smart router is someone else’s cloud.
Every self-hosted gateway is dumb.
AgoraRouter is the only one that is neither.

SaaS routers see every prompt you send and take a cut of your traffic. Open-source proxies stay in your perimeter — but route by static config, learning nothing. We built the missing quadrant: intelligent routing, your infrastructure, regulator-grade audit.

Cui bono — who gains

Three offices. One gateway.

For the CFO

See what you’re overpaying for — before changing anything

The Shadow Routing Report is read-only: it watches your existing spend log and shows which calls burned frontier-model money on trivial work. Nothing is re-routed, nothing changes in production.

“Your own numbers, not vendor promises. Read-only, risk-gated, yours to keep.”
For engineers

Drop-in. OpenAI-compatible. Forget model names.

Point Claude Code, Cursor, Cline or any OpenAI SDK at one base URL and write model:"auto". Eval-driven routing with the CI you already run as the judge — no new evaluation infrastructure to build or maintain.

“Tests passed, not thumbs up. Your existing test suite is the ground truth.”
For compliance

An audit trail that survives the auditor

Every decision lands on an append-only, hash-chained ledger, signed with your Ed25519 key — client-owned, so we can’t touch it either — and verifiable by open-source tooling without trusting us at all.

“Logs can be edited. A ledger cannot. Bring it to the audit as-is.”
Concilium — for decisions that cost money

One mind can be confident.
A council can be right.

For high-stakes calls — a security review, an architecture fork, a payment-flow change — AgoraRouter convenes models from rival families to argue it out, and a facilitator distils the verdict. Disagreement is recorded, not averaged away.

ARCHITECTUS
the architect

Frames the system: boundaries, trade-offs, what breaks at scale.

FABER
the builder

Argues for the shippable path: cost, simplicity, time-to-value.

SCEPTICUS
the protected dissenter

Paid to attack the consensus. Holds a reserved seat the optimizer can never vote off — your built-in challenger function.

MODERATOR
the facilitator

Synthesises: where the panel agrees, where it split, and why — onto the ledger.

Different model families fail differently — that is the point. And when the dissenter’s warning later proves right, the system rehabilitates it: scored on retrospective rightness, never on popularity. Risk teams know this pattern as the challenger function in three lines of defense.

Probatio — named, not generic

Features your auditor can point at

learning

Verified-outcome routing

A Thompson-sampling bandit per task-class, moved only by verified evidence — CI passed, probe confirmed, no broken returns. Satisfaction is stored, but can never touch the weights: enforced in the schema itself.

Others learn offline from benchmarks — or don’t learn at all.
safety of learning

Shadow-first, gated go-live

Every task-class learns in shadow until a calibration gate (κ ≥ 0.7 over 50 verdicts) proves the judge can be trusted. A 5% canary and drift alarms watch the champion after go-live.

No silent drift towards a cheap model writing pretty garbage.
compliance

The PAN tripwire

Luhn + IIN detection on every request and response. A card number in a prompt means the call is blocked and an alarm is raised — because your upstream tokenization just failed.

Not generic “PII detection”. Payments-specific, fail-closed.
audit

Client-owned ledger

Hash-chained, append-only, signed with your key. Export and verify with open-source agora-verify — zero trust in us required. Anchor the chain hash anywhere you like.

EU AI Act & DORA ask for exactly this artifact.
finance

Fail-closed budgets

Per-consumer keys with hard budgets. Over the line → HTTP 429, not a surprise invoice. Spend attribution per key, per model, per day.

Cost control as a property of the gateway, not a dashboard afterthought.
sovereignty

Your perimeter, full stop

Self-hosted EU node, deterministic egress allow-list, secrets via files (never env), and no payment system reachable from the deliberation plane — by network, not by promise.

SaaS routers see every prompt and charge ~5% for the privilege.
Comparatio

The quadrant, in one table

GatewaySmart routingLearns from your verified outcomes Self-hostedPCI tripwireClient-owned ledgerCouncil mode
OpenRoutervia 3rd partynonononono
Not Diamondyesoffline eval-setnononono
LiteLLM / Bifrostrulesnoyesnologsno
Portkey / Kongconfignopartialgeneric PIIlogsno
AgoraRouteryesonline, closed-loopyesnamed featureEd25519, verifiablewith protected dissenter

Sourced from vendor docs and pricing pages, June 2026. “Logs” = editable, retention-billed logging — not a cryptographic ledger.

Iter — the adoption path

Start with zero risk. Stop wherever you like.

I

Silent Audit

Auditum

Log-only. We read your spend, you get the CFO report and a sized prize. Nothing changes in production.

risk: zero
II

The Router

Iter

Point one agent at model:"auto". Watch win-rates per task-class on the dashboard. Learning stays in shadow until proven.

risk: one base-url
III

The Council

Concilium

High-stakes decisions get a multi-model debate with a protected dissenter — and a signed Decision Audit Report.

risk: opt-in per call
IV

The Flywheel

Rota

Verified outcomes keep re-weighting the router on your reality. The longer it runs, the harder it is to replace — because it knows your work.

your data moat
Initium — two minutes

Keep your SDK.
Change one line.

  • OpenAI-compatible — works with the SDKs, Cline, Continue, Aider, Cursor configs you already have.
  • model:"auto" — task-class detected per request; high-stakes classes fan out to an ensemble.
  • Every response annotated — which model, which task-class, why.
  • No new eval infrastructure — learning is shadow-first and judged by the CI you already run.
# pip install openai — that's it
from openai import OpenAI

client = OpenAI(
    base_url="https://agora-mcp.legate.bot/v1",
    api_key="agk_live_…",
)

r = client.chat.completions.create(
    model="auto",  # the router decides
    messages=[{"role":"user",
      "content":"refactor this function…"}],
)
# → routed to the model that wins
#   at refactoring — in YOUR repos
Pretium

Flat. Yours. No tolls.

  • Self-hosted license, flat monthly fee — between an OSS proxy you babysit and an enterprise platform you rent.
  • No percentage of your traffic. SaaS routers take ~5% of every token forever.
  • No fees on your own audit trail. Others bill you for log volume and retention; your ledger is yours.
  • Start with the Silent Audit — free, log-only, and it tells you exactly what the router would save.
Flat fee
per month · self-hosted · all features
never a % of traffic · never per-log fees
Acta non verba

Let the strongest argument win —
and let us record why.

E disputatione veritas.