Citation needed. Please be more specific, or else this is just a tedious and dis...

Lerc · 2025-01-01T13:45:23 1735739123

Gpt4 can add very large integers.

It is evident that it is not recalling the sum because all combinations of integer addition were likely not in the training data, Storing the answer to the sum of all integers up to the size that GPT4 can manage would take more parameters than the model has.

That addition is a small capability but you only need a single counterexample to disprove a theory.

alexashka · 2025-01-01T14:43:02 1735742582

> That addition is a small capability but you only need a single counterexample to disprove a theory

No, that's not how this works :)

You can hardcode an exception to pattern recognition for specific cases - it doesn't cease to be a pattern recognizer with exceptions being sprinkled in.

The 'theory' here is that a pattern recognizer can lead to AGI. That is the theory. Someone saying 'show me proof or else I say a pattern recognizer is just a pattern recognizer' is not a theory and thus cannot be disproven, or proven.

This is also known as Russell's teapot. https://en.wikipedia.org/wiki/Russell%27s_teapot

If someone claims there's a teapot out in space - the burden of proof is on the person making the claim, not on the person saying it is bullshit.

Lerc · 2025-01-01T17:59:14 1735754354

It's not hardcoded, reissbaker has addressed this point.

I think you are misinterpreting what the argument is.

The argument being made is that LLMs are mere 'stochastic parrots' and therefore cannot lead to AGI. The analogy to Russell's teapot is that someone is claiming that Russells teapot is not there because china cannot exist in the vacuum of space. You can disprove that with a single counterexample. That does not mean the teapot is there, but it also doesn't mean it isn't.

It is also hard to prove that something is thinking. It is also very difficult to prove that something is not thinking. Almost all arguments against AGI take the form X cannot produce AGI because Y. Those are disprovable because you can disprove Y.

I don't think anyone is claiming to have a proof that an LLM will produce AGI, just that it might. If they actually build one, that too counts as a counterexample to anybody saying they can't do it.

reissbaker · 2025-01-01T16:01:26 1735747286

GPT-4o doesn't have hardcoded math exceptions. If you would like something verifiable, since we don't have the source code to GPT-4o, consider that Qwen 2.5 72b can also add large integers, and we do have the source code and weights to run it... And it's just a neural net. There isn't secret "hardcode an exception to pattern recognition" in there that parses out numbers and adds them. The neural net simply learned to do it.

alexashka · 2025-01-01T16:51:48 1735750308

That's interesting, I didn't know that, thanks.

Is the claim then that LLMs are pattern recognizers but also more?

It just seems to me and I guess many others that the thing it is primarily good at is being a better google search.

Is there something big that I and presumably many others are missing and if so, what is it?