Hey, thanks for your comment! Of course, I’m biased. I was literally trying to explain how greedy decoding works. The LLM doesn’t 'want' to do better, as mentioned in the post, it just follows probability patterns based on its training data. That’s kind of the whole point. No reality, that’s simply pattern-matching at scale.
It's interesting that even when the LLM offers to do better, you actually still prefer the biased version of reality.
Hey, thanks for your comment! Of course, I’m biased. I was literally trying to explain how greedy decoding works. The LLM doesn’t 'want' to do better, as mentioned in the post, it just follows probability patterns based on its training data. That’s kind of the whole point. No reality, that’s simply pattern-matching at scale.
Ah, but my comment was that after you proved the greedy decoding and it offered to do better, you still prefered it to be biased, n'est-ce pas?
I was greedy too? :D
;-)