To get a handle on causation you have to experiment: perturb and observe.
When computers learn chess they can generate more games to absorb and experiment with.
But when it's a real world phenomena they are predicting/affecting they can't create more data and can just learn correlates, not causation.
If you optimize a correlation you just accentuate the bias.
The bias could theoretically be a random happenstance at the start that constantly blew up larger and etched deeper.
Grains of dust after the big bang that grew into whole galaxies.
You can optimize a causation not a correlation.