Sure but it's good to recognize Meta never stopped publishing even after Openai ...

AnimeLife · 2025-01-27T20:54:13 1738011253

But there is a big difference, llama is still way behind chatgpt and one of the key reasons to open source it could have been to use open source community to catch up with chatgpt. Deepseek on contrary is already at par with chatgpt.

llm_trw · 2025-01-27T21:05:08 1738011908

Llama is worse than gpt4 because they are releasing models 1/50th to 1/5th the size.

R1 is a 650b monster no one can run locally.

This is like complaining an electric bike only goes up to 80km/h

thot_experiment · 2025-01-27T23:44:36 1738021476

R1 distills are still very very good. I've used Llama 405b and I would say dsr1-32b is about the same quality, or maybe a bit worse (subjectively within error) and the 70b distill is better.

potamic · 2025-01-28T05:11:32 1738041092

What hardware do you need to be able to run them?

llm_trw · 2025-01-28T07:21:39 1738048899

The distils run on the same hardware as the llama models they are based on llama models anyway.

The full version... If you have to ask you can't afford it.

kandesbunzler · 2025-01-27T20:59:55 1738011595

Yea no shit, that's because meta is behind and Noone would care about them if it wasn't open source

troyvit · 2025-01-27T21:47:30 1738014450

Right, so it sounds like it's working then given how much people are starting to care about them in this sphere.

We can laugh at that (like I like to do with everything from Facebook's React to Zuck's MMA training), or you can see how others (like Deepseek and to a lesser extent, Mistral, and to an even lesser extent, Claude) are doing the same thing to help themselves (and each other) catch up. What they're doing now, by opening these models, will be felt for years to come. It's draining OpenAI's moat.

fragmede · 2025-01-28T01:06:26 1738026386

How's that old chestnut go? "First they laugh at us..."?