Autoreason · tournament harness

Autoreason

Self-refinement that knows when to stop.

Initializing tournament harness
Autoreason
AUTOREASON · DOSSIER 01LIVE

Self-refinement
that knows when to stop.

a live harness for the autoreason method

Iterative self-refinement fails for three structural reasons: prompt bias (models hallucinate flaws when asked to critique), scope creep (outputs expand unchecked each pass), and lack of restraint(models never say “no changes needed”). Autoreason fixes all three. Author drafts A → critic finds flaws → reviser writes B → synthesizer fuses A and B → an N-judge blind Borda panel picks the winner. When A wins twice in a row, the loop converges.

passes / max
≤ 5
judge panel
3–7
convergence
A wins 2×
latency
streamed
PIPELINE · ONE PASS
fig. I
tournament loop
fresh agents, no shared context
INCUMBENT Akept verbatim · “do nothing” ballotCRITICflaws only · no fixesAUTHOR-Brevises against critiqueSYNTHESIZERA + B → ABJUDGE PANEL × Nfresh agentsno shared contextBLIND BORDAA WINS 2× → CONVERGEDA := WINNER
RUN CONTROLS · II
fig. II
task definition
0 / 4000
presets
configuration
max passes3
judges3