Goodhart's law is a form of 'cheating'.
- Goodhart's law is a form of 'cheating'.
- Cheating happens with agents who aren't aligned with the collective as an end in and of itself.
- That means if there's an action that will get them as an individual an edge at the cost of the collective, they'll take it.
- You can get strong alignments by having a deeply and widely believed end.
- An infinite.
- Something like "I will go to hell if I cheat."