LLMs are great at reasoning shallowly but broadly, and extremely quickly.
Humans can do deep reasoning narrowly and slowly.
Both are useful on their own
But what's really powerful is the two working together in a tight collaborative loop.
The LLM does 90% of the heavy lifting; the human applies the differentiated judgment and steering.
The result is something an order of magnitude better than what either the LLM or the human could have done alone.
LLMs are patient–or rather, are so quick that they don't need to be patient.
LLMs as preternaturally patient people.
The information that the human gives the LLM in these cooperative sessions would be extremely potent training data.
It's laser focused on the stuff the human didn't like or wanted to change, the parts to update to make the model better.