Having tried using it, it is much worse than r1. Both the standard and high effo...

CryptoBanker · 2025-02-01T16:24:37 1738427077

If it’s actually available, it can’t be that much worse than R1 which currently only completes a response about 50% of the time for me.

llm_trw · 2025-02-01T21:50:13 1738446613

There are multiple providers for it since it's open source.

egeozcan · 2025-02-02T05:53:41 1738475621

Are there any providers that have a chat interface (not just API access) with a fixed monthly cost? I couldn't find one.

llm_trw · 2025-02-02T14:01:46 1738504906

you.com when you disable their search the internet feature.

SkyPuncher · 2025-02-01T17:57:49 1738432669

Yea, o3-mini was a massive step down from Sonnet for coding tasks.

R1 is my cost effective programmer. Sonnet is my hard problem model still.

llm_trw · 2025-02-02T00:04:06 1738454646

R1 is interesting.

Since I have access to the thinking tokens I can see where it's going wrong and do prompt surgery. But left to it's own devices it gets thing _stupendously_ wrong about 20% of the time with a huge context blowout. So much so that seeing that happen now tells me I've fundamentally asked the wrong question.

Sonnet doesn't suffer from that and solves the task, but doesn't give you much if any, help in how to recover from doing the wrong task.

I'd say that for work work Sonnet 3.5 is still the best, for exploratory work with a human in the loop r1 is better.

Or as someone posted here a few days ago: R1 as the architect, Sonnet3.5 as the worker and critic.