Agents get poisoned and keep confusing themselves.

  • Agents get poisoned and keep confusing themselves.
    • They see the wrong things in their history and get confused.
    • A self-poisoning doom spiral.
    • The more they get off track, the more that's in the context to confuse them.
    • The word "not" is easy to miss, so every bit of "Don't do X" just makes them see X again.