Bits and Bobs 1/6/26

2026-01-06

1Lenny's podcast with Sander Schulhoff about Prompt Injection.

"You can patch a bug, you can't patch a brain."
About 11 minutes in they reference me!

2A tweet summary of a new paper:

"This paper from Stanford and Harvard explains why most "agentic AI" systems feel impressive in demos and then completely fall apart in real use.
The core argument is simple and uncomfortable: agents don't fail because they lack intelligence.
They fail because they don't adapt."

3A nice insight from a YouTube video:

"Making ai models is less like training an animal intelligence and more like summoning ghosts."^[dc]

4A tweet analyzing what Google is trying to do to OpenAI:

"google is trying to do to openai what facebook ended up doing to snap which is to first decelerate growth substantially (which kills a lot of momentum & morale) & then unleashing integrations at scale rapidly by leveraging distribution advantages."

5OpenAI admits that prompt injection is a fundamentally unsolvable problem:

"Prompt injection, much like scams and social engineering on the web, is unlikely to ever be fully 'solved.'"

6I don't want Big Tech, I want My Tech.

Big Tech is owned by someone else.
My Tech is owned by me.

7Mike Masnick on How we can make the internet good again.

8Nice summary of agentic browsers from a HackerNews comment.

"Let's spend years plugging holes in V8, splitting browser components to separate processes and improving sandboxing and then just plug in LLM with debugging enabled into Chrome. Great idea. Last time we had such a great idea it was lead in gasoline."

9This week's Wild West roundup.

10A tweet: "factorio is unironically the perfect tutorial for agentic coding systems btw."

11I binge watched Pluribus on my flight to London.

Excellent show.
There's an obvious metaphor for people who use AI and simply do whatever it tells them to do.
Compare to people who use AI to multiply their agency.
Looks similar, but wildly different.

12LLMs add a token that is most coherent with what's in the context.

So if it makes an error it will tend to make it again.
Because that error is put into the context.
The most coherent belief to amend silently assumes that the error is right.
The more errors it makes the more deeply ingrained the errors get.

13We're missing the digital "third place."

14A new paper from MIT's Daniel Jackson: What You See Is What It Does: A Structural Pattern for Legible Software.

15Chris Loy argues we're about to see the industrial revolution of software.

On the one hand, the idea that we've been at the hand-crafted / high friction style of software to date does resonate.
But unlike in the industrial revolution where everything was made more similar, the industrial revolution of software will allow software that is fractally personalized.

16Facebook is testing $14.99 monthly subscription fee to post links.

The aggregator endstate.

17Imagine a service that unlocks the resonant value of AI for individuals.

18The right mindset to use coding agents to their fullest potential: "I don't know if it's going to work, but let's try it!"

If you have the mindset of "That might not work, let's not try it." then you won't unlock their full power.

19Josh Marshall points out the Grand AI Disconnect.

AI is really, really unpopular with the American public.
- "Fewer than 20% of Americans think AI will have a positive impact on America over the next 20 years."
But everyone keeps on acting like most people love it.^[dh]^[di]^[dj]

20An "inductively knowable" UX works great with reasonable defaults.

It just does what you expect.
Then as you peel back layers of understanding there's no magic.
but you don't need to know the lower layers, they just make sense if you ever did peel them back.

21The appropriateness of data being used is tied to the context it was collected and used in.

So if it's collected in a context where only superficial things can be extracted and then it's put in a context where it can be deeply understood, it feels like a betrayal.
That's why Google's data is a blessing and a curse in an era of LLMs.
They're sitting on a trove of data for each user… but if they preprocessed everyone's decades of emails it would feel like a crazy beytral, an invasion.

22The vast majority of consumer software in the past couple of decades has been algorithmically simple.

We've explored every nook and cranny of that simple software landscape.
If your architecture doesn't matter, just build as quickly as possible and get as many users as possible as quickly as possible.
But if your architecture does matter, getting users on the wrong architecture could be deadly.

23We've all developed a learned helplessness about data and its use within apps.

We can't imagine it being any other way.
So we just stopped imagining.
For the world to become better, you have to first imagine that it could be better.^[ej]^[ek]^[el]

24A demoed system doesn't necessarily increase in performance.

A used system however will tend to increase in performance.
Users simply won't tolerate their usage patterns being slow.
When it's just a demo, users have to remember to actively demand performance.
- If you're distracted you won't demand it.
When it's something you're using, every time you use it you want it to be faster.
- It's just the most obvious, inescapable thing.

25All the vibecoding platforms are climbing the wrong hill.

They're trying to get everyone to create their own software.
That will never happen… software requires thinking at least like a PM.
But it's a necessity if software can't be safely shared by strangers.
To climb the right hill, you'd have to make software that could be safely shared by strangers.

26If the system assumes an LLM as the main loop then there's a floor of how cheap and performant it can be.

Whereas if it assumes normal compute sweetened with LLMs there's no floor or ceiling.
And also if you assume LLM in the loop the only way to improve is model quality or tools.
- Whereas normal code can accrete functionality over time.

27The big companies are all on the chatbot train.

Big companies can't bet on two contradictory things.
What if the most powerful use of LLMs that is not chatbots?

28If it's a hill climbing exercise, first, make sure you're on the right hill.

For example, performance work should come after required architectural changes.
When you have users is when you're forced to start climbing the hill.
If you have users prematurely, it forces you optimize prematurely, meaning you likely climb the wrong hill.

29A useful slog is a moat.

But a useless slog is a waste of time.
The whole question becomes: is it useful or not?

30Some teams are great at execution.

But they can't do multi-month slogs to non-obvious endpoints.
Other teams might go slower on simple projects, but are capable of achieving more complex projects.
If they achieve something useful, that gives them a moat.
Anyone who wants to catch them would have to go through a similar slog.

31To resonate, stories must be both true and believable.

32American culture is about maximizing, about never being satisfied.

Always striving to go above and beyond.
- Innovative… but exhausting.
Other cultures are more willing to merely satisfice.
- "It's good enough."
- "Why bother improving it?"

33PMF always starts as a needlepoint.

34The push for frictionless is a push for hollowness.

35First, make the theory practical.

Then make the practical marketable.

36To be useful, it has to feel more like a tool than a toy.

37If you're staring at the ground in front of you and not on the horizon you'll fall off a cliff.

38Friends don't use friends as smoke tests.

39The system builder and product builder are different.

The system builder, when they try to build product, will see the problem and try to solve it properly.
- Because it's a means to push their end, the system, to be full featured.
The product builder doesn't care about the system, they just want the product to work.

40The Six Degrees of Kevin Bacon effect shows up in many networks.

It happens because the combinatorics makes even a small incidence rate quickly saturate the network.
That also means that statements like "Nearly every academic paper is less than 6 citations away from a retracted paper" are not as interesting.
- It's trivially true no matter the density of retracted papers.

41If you give someone the smoking gun they can stop looking for it.

42I found service in restaurants in Europe to be notably slower than in the US.

I didn't think it was about tipping–if most tips are in the same narrow band (15-20%), that doesn't explain much difference in speed.
But the existence of tips–even with a narrow band–incentivizes turning as many tables as possible.
So even if the average "salary" comes out the same with the default tipping rate, the one with tipping will incentivize faster table service than the one without tips, which doesn't reward based on how many tables were finished.

43The defining characteristic of the Saruman is selfishness.

"I got mine, screw you."
Not caring about the externalities is selfishness.

44When you're fighting for money, you'll only optimize superficially.

You'll cut corners.
When you're fighting for honor, you'll optimize resonantly.
You won't take shortcuts.

45Resonance is soul.

46When incentives align, you get resonance by default.

47Ambition, when attached to selfishness and competency, can quickly become net bad for the surroundinging system.

Slytherin for example is ambitious.
Ambition unchecked by shame will take shortcuts with significant externalities.
Imagine a choice in front of you.
- One way is the right way.
  - You take a personal penalty.
- The other way is a shortcut, with externalities.
  - But no one will ever know you did it.
- How big must the externality be before you don't take it?
  - Everyone has a threshold.
  - Some people's thresholds are significantly lower than other people's.

48A popular authoritarian is much more dangerous than an unpopular one.

49Curious people are much less likely to get bored.

A question gives you something to do.
Curious people always generate another question.
A great question is
- 1) interesting
- 2) you care about the answer.

50Whether or not something is in extremistan: does it have a preferential attachment effect or not?

Preferential attachment is multiplicative vs additive.

51You're allowed to make your own choices if you can understand and be responsible for their implications.

That's why we don't yet young children make substantive decisions.

52Who gets to decide the lore of the tooth fairy?

No one.
It emerges, bottoms up.
Based on the stories people choose to repeat.
The ones that loom largest.

53It feels like most outcomes are decided top down instead of emerging bottom up.

But that's an illusion because only the top down outcomes can be thought of as "decisions".
We think naturally in narrative so top down decisions compress way easier and thus feel more obvious and tidy.
- The bottom up outcomes can't be compressed as easily with "this agent had this goal and made this decision".
Also even bottom up things could look like top down.
- An agent makes a locally optimal decision which unbeknownst to them pushes the emergent outcome past a critical point.
- That would look like that one agent's decision changing the whole system, but really those decisions are externalities.

54The game of telephone shows how every act of comprehension is an act of interpretation.

Over time, over multiple iterations, it continuously morphs.

55When you're making a movie, everyone has to be making the same movie.

Otherwise you get a discordant mess.
The alignment of vision is critical.

56You're reading a romance book.

You're reading a romance book. Will it have a happy ending?
- The best predictor: was it filed in the Romance section or the Drama section?

57When people have many devoted friends, it's a sign that they're generous and open hearted.

58Believing in something bigger than yourself is how you transcend.