It creates launch artifacts, not generic advice.

Builds a reusable AI-browser-agent evaluation from a real task instead of a vague prompt.
Exports a replay script, scoring rubric, fallback selector hints, SVG route map, and receipt.
Flags ambiguous success criteria, brittle selectors, overlays, and task blockers before agents fail silently.

Boundary: Not a guarantee that an AI agent will pass, not a substitute for real QA, and not a way to bypass site access controls, rate limits, authentication, or terms.