Multi-model adversarial deliberation

The room where AI argues with itself.

Routers optimize cost. Agents execute tasks. Think Council stress-tests decisions — by making four AI personas debate yours, then synthesizing the recommendation with the dissent attached.

~400 on the waitlist invites Fridays no card
The council
Claude GPT Gemini Llama
TOPIC Turn 1 — opening arguments
Proposer
claude-sonnet-4-5
Challenger
gpt-5
Steelmanner
gemini-2.5-pro
Pre-Mortem
llama-3.3-70b
Synthesis
0%

Recommendation. Use Auth0 now, plan to migrate at 5k MAU. Time savings justify the cost at your current stage.

The cast · always these four

Four roles. Forced.
Hue is the only thing that varies.

Every deliberation runs the same shape. The personas read each other across turns and rebut — they don't vote in parallel. That's the difference.

Proposer
For

Builds the strongest case.

Frames the upside, names the trade-offs, anchors the decision.

Challenger
Against

Attacks every assumption.

Surfaces blind spots, hidden costs, lock-in. "What am I not seeing?"

Steelmanner
Both

Argues the best version.

No straw men. Reconciles where reconcilable, names the real fork where not.

Pre-Mortem
Future

Writes the obituary.

Narrates how this fails 18 months from now. Validation gates fall out of it.

Why a council, not a router

Routers optimize cost.
Agents execute tasks.
Councils stress-test decisions.

Single model

One opinion, fast.

You ask, it answers, you ship. Great for low-stakes calls. Bad for the ones that compound.

No disagreement
Router · ensemble

Disagreement, averaged out.

Routes to the cheapest model, or votes across many. Optimizes cost or consensus — both wash out the dissent that mattered.

Disagreement discarded
Think Council

Disagreement, preserved + cited.

Four personas argue across turns. The synthesis tells you what to do — and shows you exactly which objection it overruled to get there.

Decision quality
Decisions that earn a council

Built for the calls that compound.

Build vs buy

"Auth0, or roll our own?"

Council surfaces lock-in, cost-curve break points, migration pain.

Architecture

"Postgres or DynamoDB for events?"

Pre-Mortem narrates the migration story you'd rather not write.

Product

"Ship the AI feature this quarter?"

Steelmanner reconciles velocity vs. retention risk into a defensible call.

Hiring

"Senior IC, or two mid-levels?"

Challenger names the second-order effects. You bring it to the room.

Sebastien Tang
From the founder

"I kept catching myself trusting whichever model answered first. The fix wasn't a smarter model — it was making them argue."

Frequently disputed

Questions worth answering.

Won't the four models just agree with each other?

They're prompted into adversarial roles with different cognitive stances. Agreement is rare — and when it happens, that's a signal, not noise.

How is this different from a multi-agent framework?

Agents execute. Councils deliberate. The output is a single recommendation with the dissent attached, not a chain of tool calls.

What does it cost?

Free: 3 deliberations/day with 2 models. Pro at $29/mo: unlimited, all four models, custom personas, BYOK, full history.

When do invites go out?

Weekly waves. We rate-shape so the streaming stays fast for everyone in the council.

Is the library actually open source?

Yes. MIT licensed on npm and GitHub. The hosted product is built on top.

Waitlist · invites Fridays

Show me the dissent,
not just the answer.

Join the waitlist. We'll send your invite the next Friday.

~400 ahead of you · no card · no sales call

Think Council · debating, not voting.