As someone who's background is biology and physics and who does ML work as well....

miltondts · on July 12, 2021

Totally agree.

While AlphaFold 2 is a tremendous achievement, to me the major drawback is the blackbox approach. It means it is very difficult to know when the model is outputting garbage and it also doesn't directly lead to new insights.

A much more interesting approach: "Discovery of Physics From Data: Universal Laws and Discrepancies" [1]

If ML did that, then it would be much more interesting.

[1] - https://www.frontiersin.org/articles/10.3389/frai.2020.00025...

mjburgess · on July 12, 2021

ML is an associative statistical system of function optimisation -- pretty much the opposite of science.

Ie., ML makes the assumption that data points are IID.

The whole purpose of science is to produce models which explain why data isnt IID.

blackbear_ · on July 12, 2021

> ML is an associative statistical system of function optimisation

You can also separate cause from effect by using causal inference, under some assumptions.

> ML makes the assumption that data points are IID.

Common ML algorithms do, but it is done for practical reasons rather than a limitation in the mathematics.

> The whole purpose of science is to produce models which explain why data isnt IID.

And ML can greatly help in this, though it is not a silver bullet.

jokoon · on July 12, 2021

Actually, there might not be a good way to model or describe the difference between causal inference, correlation and causality.

Causality involves a deep understanding of a phenomenon in science.

For example, the standard model of physics is pretty good at describing the real world in a good enough manner because we understand a lot of it. The difference with correlation and causality, in my view, is human, scientific understanding of what things are. Formulas, data or drawing are not enough.

For example there might never be a way to prove natural selection, even if there is a lot of data available, but a lot of scientific consensus is enough to describe causality.

jokoon · on July 12, 2021

When you say it's the opposite of science, you mean ML is just made of black boxes that completely hide away knowledge that humans can interpret?

Science is derived from scio, which in latin means knowledge.

It's true that in a way, ML allows new things, but which are still obscuring real knowledge...

I'm still curious about analysis of trained networks.