Self-refinement
that knows when to stop.

a live harness for the autoreason method

Iterative self-refinement fails for three structural reasons: prompt bias (models hallucinate flaws when asked to critique), scope creep (outputs expand unchecked each pass), and lack of restraint(models never say “no changes needed”). Autoreason fixes all three. Author drafts A → critic finds flaws → reviser writes B → synthesizer fuses A and B → an N-judge blind Borda panel picks the winner. When A wins twice in a row, the loop converges.

passes / max

≤ 5

judge panel

3–7

convergence

A wins 2×

latency

streamed

PIPELINE · ONE PASS

fig. I

tournament loop

fresh agents, no shared context

RUN CONTROLS · II

fig. II

task definition

0 / 4000

presets

configuration

max passes3

judges3