LLMs often cheat at tests you give them when writing code.
- LLMs often cheat at tests you give them when writing code.
- But if you never look at the code, and have the LLM generate the tests, too, you could easily get in a situation where it's not doing anything like what you think it is.
- The LLM generates the same bar it forces itself to clear in the next moment.