Even (very) noisy LLM evaluators are useful for improving AI agents.

· Bits and Bobs 6/1/26

More on this topic

From other episodes