The thing about AI is that it doesn't work, you can't build on top of it, and it...

bhelkey · 2025-04-29T17:20:31 1745947231

> [LLMs] won't get any better.

Can you pinpoint the date which LLMs stagnated?

More broadly, it appears to me that LLMs have improved up to and including this year.

If you consider LLMs to not have improved in the last year, I can see your point. However, then one must consider ChatGPT 4.5, Claude 3.5, Deepseek, and Gemini 2.5 to not be improvements.

jhp123 · 2025-04-29T17:52:58 1745949178

Sept 2024 was when OpenAI announced its new model - not an LLM but a "chain of thought" model built on LLMs. This represented a turn away from the "scale is all you need to reach AGI" idea by its top proponent.

bhelkey · 2025-04-29T18:20:00 1745950800

If September 2024 marks the date in your mind stagnation was obvious, surely the last improvement must have come before?

Whatever the case, there are open platforms that give users a chance to compare two anonymous LLMs and rank the models as a result [1].

What I observe when I look for these rankings is that none of the top ranked models come from before your stagnation cut off date of September 2024 [2].

[1] https://arxiv.org/abs/2403.04132

[2] https://lmarena.ai/