LLMs often cheat at tests you give them when writing code.

  • LLMs often cheat at tests you give them when writing code.
    • But if you never look at the code, and have the LLM generate the tests, too, you could easily get in a situation where it's not doing anything like what you think it is.
    • The LLM generates the same bar it forces itself to clear in the next moment.

More on this topic

From other episodes