LLMs are inherently statistical summarizers.
Which is why they pull to the centroid.
"What is the most average answer, conditioned on the input so far?"
Which is why they pull to the centroid.
"What is the most average answer, conditioned on the input so far?"
From other episodes