A pattern to dull the downside risk of agents: have them only write "drafts".

· Bits and Bobs 2/24/25
  • A pattern to dull the downside risk of agents: have them only write "drafts".
    • The drafts still need to be activated by a human before executing the action in the real world.
    • This provides a natural checkpoint to cap the most egregious downside risk.
    • But this now means that users have to constantly check back in when the agent has a proposed task to do.
    • If most of their useful actions (and research gathering ones) require actions that could plausibly be dangerous, the agent (and the user operating it) will alternate between being in twiddling-thumbs node while they wait for the other.
    • The amount of time the human spends actually doing the action gets smaller and smaller as the quality of recommendations and filters gets better.

More on this topic

From other episodes