Validating agentic behavior when “correct” isn’t deterministic | Pasteblog