Anthropic rolled out a new safety feature optimized for "model welfare"
- Anthropic rolled out a new safety feature optimized for "model welfare"
- Obviously this is a reasonable feature given the topics that it cuts off.
- But the frame of we "do this for model welfare" raises my eyebrow.
- This is one of the ways you can tell the major model labs have lots of true believers trying to summon a new god.