Start from business-critical questions, then generate variants by changing assumptions, missing evidence, and contradictory facts.

Track pass rate and explanation quality across model and prompt revisions.