Why Verification Matters
In visual mathematical reasoning, wrong answers become especially damaging at the RL stage: every rollout can repeatedly convert a noisy label into reward signal. VeriEvol moves answer reliability into the data-construction stage, before any policy update.