Legible doesn't mean "can the machine see it" it means "does the machine see it".
It can be fully in the open but so noisy and chaotic that the vast majority of things aren't seen by the machine.
"This doesn't change anything because all of the data was already public anyway" doesn't necessarily track; because it could have been data that could be read and yet wasn't because it wasn't economically viable to do so.
LLMs allow sifting through and synthesizing large amounts of data significantly easier than before.