People who are very helpful are easier to spearfish.
- People who are very helpful are easier to spearfish.
- A stance: do you assume your conversation partner is trying to help you or harm you by default?
- LLMs are designed to be helpful, so they assume their partner is acting in good faith.
- But if you include any text from others in your prompt to the LLM who might be acting in bad faith, that could lead to you being harmed by their tool use.
- The LLM can't distinguish between instructions from you and instructions from someone else; they're all just text.