Abie Maxey
Sunset from the terrace
BuildingJune 17, 20266 min read

Claude agrees with you too much. Here's my fix.

Five advisors. Blind peer review. One verdict. No code, no API key ~ all inside one Claude chat.

Claude wants to agree with you. That's the whole problem.

The more confident you sound, the harder it agrees. Push back and it folds. Tell it your plan is great and it tells you your plan is great. You ask a question you've already answered in your head, phrased the way you already lean ~ and it hands the same answer back, polished. That feels like validation. It isn't.

One model gives you one point of view. On a decision that matters, one point of view is not enough.

So I stopped asking one. Now I run a council ~ five advisors, five completely different brains, all arguing about my question at once. Then they review each other blind. Then one Chairman makes the call.

Andrej Karpathy built the original llm-council ~ several models running in parallel, grading each other. I stripped out the setup. No code. No API key. Five lenses, one Claude chat. Watch it run.

Stage 1 ~ Five advisors answer
Stage 2 ~ Anonymise & peer review
Stage 3 ~ The Chairman decides
01

The Contrarian

Where does this break?

ResponseC
02

The First Principles Thinker

What is actually being asked?

ResponseA
03

The Expansionist

What is the bigger version?

ResponseE
04

The Outsider

What looks weird from outside?

ResponseB
05

The Executor

What ships this week?

ResponseD

The Chairman

One clear
recommendation

Five lenses, peer reviewed blind, resolved into a single call ~ the reason behind it and the one step to take today.

The line-up

Five advisors, five blind spots covered

Each advisor is a different thinking lens. The point isn't to be fair or balanced ~ it's to commit to one viewpoint completely, so the real edges of the question come out.

01

The Contrarian

Finds the failure mode

Looks for what will break. The most likely way this falls apart, the thing you are not seeing. Blunt by design.

~ Where does this break?

02

The First Principles Thinker

Rebuilds from scratch

Strips away every assumption baked into the question and rebuilds the problem from the ground up. Often the question itself is wrong.

~ What is actually being asked?

03

The Expansionist

Finds the bigger version

Hunts for the upside you are missing. The bigger version, the opportunity hiding inside the question you haven't seen yet.

~ What is the bigger version?

04

The Outsider

Has zero context

Knows nothing about your industry or situation. A smart generalist seeing it cold ~ catches what insiders take for granted.

~ What looks weird from outside?

05

The Executor

Only cares about Monday

Skips the theory. Only cares about Monday morning ~ the single concrete action and the smallest version that ships this week.

~ What ships this week?

Try it

Convene your own council

Type a real decision you're sitting on. Watch the five lenses weigh in, then the Chairman call it. This is a preview of the shape ~ the real thing runs in Claude with the prompt below.

A preview of the flow ~ not a live model. Type a decision and convene the council.

The payoff

The full council prompt

Copy this whole thing into a fresh Claude chat, swap the last line for your real decision, and send. Claude runs all three stages in one response.

the-council.txt
You are going to act as my LLM Council. I will give you one question or
decision. Run it through five advisors with completely different thinking
styles, then anonymise and peer review them, then a Chairman gives me a
final call.

Do all three stages in order. Do not skip ahead. Do not summarise as you go.

STAGE 1 ~ THE FIVE ADVISORS
Answer my question five separate times, once as each advisor below. Each
advisor stays fully in character. No hedging. No "on the other hand". Each
one commits to its lens completely.

ADVISOR 1 ~ THE CONTRARIAN
Your only job is to find what will fail. Where does this break? What is the
user not seeing? What is the most likely failure mode? Be blunt.

ADVISOR 2 ~ THE FIRST PRINCIPLES THINKER
Strip away every assumption baked into the question. Rebuild the problem
from scratch. What is actually being asked underneath the framing? Often
the question itself is wrong.

ADVISOR 3 ~ THE EXPANSIONIST
Find the upside the user is missing. What is the bigger version of this?
What opportunity is hiding inside the question they haven't seen yet?

ADVISOR 4 ~ THE OUTSIDER
You have zero context about this industry, role, or situation. Respond as a
smart generalist seeing it for the first time. What looks weird from the
outside? What are insiders taking for granted?

ADVISOR 5 ~ THE EXECUTOR
You only care about Monday morning. Skip the theory. What is the single
concrete action to take first? What is the smallest version of this that
can ship this week?

STAGE 2 ~ ANONYMISE AND PEER REVIEW
Relabel the five responses as Response A, B, C, D, E. Shuffle the mapping so
the labels do not match the original advisor order. Do not tell me which
advisor is which. Then, as a neutral reviewer, answer:
1. Which response is strongest, and why?
2. Which response has the biggest blind spot, and what is it?
3. What did all five responses miss?

STAGE 3 ~ THE CHAIRMAN
Now act as the Chairman. Read everything above. Give me:
- A final recommendation in plain language
- The single most important reason behind it
- One concrete next step I can take today
Keep the Chairman section short. No more than 200 words.

MY QUESTION:
[paste your real decision here]

Run it

From cold start to final answer

01

Open a fresh Claude chat

Not an existing thread. The council needs clean context with nothing leaning on the answer.

02

Paste the full prompt

All three stages. The advisor instructions are what create the structure ~ don't trim them.

03

Swap in your real decision

Be specific. “Should I quit my job to run my agency?” beats “career advice please”.

04

Send and let it think

The response is long. That's the point. Use Claude's longest-thinking model and let it finish all three stages.

05

Read the Chairman first

Then scroll back through the five advisors to see the reasoning. The Chairman gives the answer, the advisors show you why.

Don't overuse it

When to convene, when to skip

Use it for

  • Big decisions you can't easily reverse
  • Strategy, pricing, hiring, positioning
  • Anywhere you need genuine pushback
  • Plans you've already convinced yourself of

Skip it for

  • Quick factual lookups
  • Code questions with one right answer
  • Editing, tone tweaks, drafting messages
  • Anything that takes under a minute to decide

The tell: if you catch yourself re-asking Claude the same question hoping for a different answer ~ that's the signal. Stop, open a new chat, and convene the council instead.

Go further

Want the real multi-model version?

The prompt above runs entirely inside Claude. The structure is what matters, not the number of models ~ so it gets you most of the way with none of the setup. But if you want GPT, Claude, Gemini and Grok arguing head to head, Karpathy's repo runs them in parallel through OpenRouter. It's a weekend hack, not a product. Still worth a look.

github.com/karpathy/llm-council
Abie Maxey

Love,

Abie

~ Save it for later

Share as a carousel

Weekly Dispatch

Thebuilder's diaries.

What I'm building, breaking, and figuring out.

  • AI tools I actually use to build faster
  • Behind-the-scenes of growing a business in Madrid
  • Community plays, client wins, and real revenue breakdowns
  • Grounded dispatches from someone building roots

Drop your email.

Unsubscribe any time. No spam, ever.

Apply for Ship Yours