Anthropic's research on the inner workings of LLMs is fascinating.
- Anthropic's research on the inner workings of LLMs is fascinating.
- They're studying LLMs less like an engineer would study a technical artifact and more like a neuroscientist would study a mind.
- All kinds of interesting emergent behavior about how it handles language, fuzzy mental math, why jailbreaking tricks it in some cases.