Anthropic rolled out a new safety feature optimized for "model welfare"

· Bits and Bobs 8/18/25
    • Obviously this is a reasonable feature given the topics that it cuts off.
    • But the frame of we "do this for model welfare" raises my eyebrow.
    • This is one of the ways you can tell the major model labs have lots of true believers trying to summon a new god.

More on this topic

From other episodes