My experience with cursor and sonnet is that it is relatively good at first trie...

delichon · 2025-02-01T01:27:39 1738373259

Claude makes a lot of crappy change suggestions, but when you ask "is that a good suggestion?" it's pretty good at judging when it isn't. So that's become standard operating procedure for me.

It's difficult to avoid Claude's strong bias for being agreeable. It needs more HAL 9000.

4b11b4 · 2025-02-01T02:21:58 1738376518

I'm always asking Claude to propose a variety of suggestions for the problem at hand and their trade-offs, then evaluating them for the top three proposals and why. Then I'll pick one of them and further vet the idea

kace91 · 2025-02-01T13:07:26 1738415246

>It's difficult to avoid Claude's strong bias for being agreeable. It needs more HAL 9000.

Absolutely, I find this a challenge as well. Every thought that crosses my mind is a great idea according to it. That's the opposite attitude to what I want from an engineer's copilot! Particularly from one who also advices junior devs.

esperent · 2025-02-01T03:16:01 1738379761

> when you ask "is that a good suggestion?" it's pretty good at judging when it isn't

Basically a poor man's COT.

jwpapi · 2025-02-01T00:10:27 1738368627

Yes it’s usually worth it to try to write a really good first prompt

earleybird · 2025-02-01T01:28:30 1738373310

More than once I've found myself going down this 'little maze of twisty passages, all alike'. At some point I stop, collect up the chain of prompts in the conversation, and curate them into a net new prompt that should be a bit better. Usually I make better progress - at least for a while.

SamPatt · 2025-02-01T04:52:16 1738385536

This becomes second nature after a while. I've developed an intuition about when a model loses the plot and when to start a new thread. I have a base prompt I keep for the current project I'm working on, and then I ask the model to summarize what we've done in the thread and combine them to start anew.

I can't wait until this is a solved problem because it does slow me down.

jwpapi · 2025-02-13T23:36:13 1739489773

Yes when new models come out it feels like breaking up.

dr_dshiv · 2025-02-01T01:36:13 1738373773

Why is it so hard to share/find prompts or distill my own damn prompts? There must be good solutions for this —

garfij · 2025-02-01T03:48:20 1738381700

What do you find difficult about distilling your own prompts?

After any back and forth session I have reasonably good results asking something like "Given this workflow, how could I have prompted this better from the start to get the same results?"

dr_dshiv · 2025-02-01T13:25:59 1738416359

Analysis of past chats in bulk.

whall6 · 2025-02-01T02:28:16 1738376896

Don’t outsource the only thing left for our brains to do themselves :/

sheepscreek · 2025-02-01T05:57:43 1738389463

For my advanced use case involving Python and knowledge of finance, Sonnet fared poorly. Contrary to what I am reading here, my favorite approach has been to use o1 in agent mode. It’s an absolute delight to work with. It is like I’m working with a capable peer, someone at my level.

Sadly there are some hard limits on o1 with Cursor and I cannot use it anymore. I do pay for their $20/month subscription.

electroly · 2025-02-01T06:56:26 1738392986

> o1 in agent mode

How? It specifically tells me this is unsupported: "Agent composer is currently only supported using Anthropic models or GPT-4o, please reselect the model and try again."

sheepscreek · 2025-02-01T12:05:16 1738411516

I think you’re right - I must have used it in regular mode, then got GPT-4o to fill in the gaps. It can fully automate a lot of menial work, such as refactors and writing tests. Though I’ll add, I had a roughly 50% success with GPT-4o bug fixing in agent mode, which is pretty great in my experience. When it did work, it felt glorious - 100% hands-free operation!

axkdev · 2025-02-01T11:39:54 1738409994

It seems like you could use aider in architecture mode. Basically, it will suggest the solution to your problem fist, and prompt you to start editing, you can say no to refine the solution and only start editing when you are satisfied with it.

mathieuh · 2025-02-01T03:24:53 1738380293

Hah, I was trying it the other day in a Go project and it did exactly the same thing. I couldn’t believe my eyes, it basically rewrote all the functions back out in the test file but modified slightly so the thing that was failing wouldn’t even run.

nprateem · 2025-02-01T07:13:55 1738394035

I've had it do similar nonsense.

I just don't understand all the people who honestly believe AGI just requires more GPUs and data when these models are so inherently stupid.

hahajk · 2025-02-01T01:02:15 1738371735

Can't you select Chatgpt as the model in cursor?

kace91 · 2025-02-01T01:10:22 1738372222

Yes, but for some reason it seems to perform worse there.

Perhaps whatever algorithms Cursor uses to prepare the context it feeds the model are a good fit for Claude but not so much for the others (?). It's a random guess, but whatever the reason, there's a weird worsening of performance vs pure chat.

electroly · 2025-02-01T02:40:04 1738377604

Yes but every model besides claude-3.5-sonnet sucks in Cursor, for whatever reason. They might as well not even offer the other models. The other models, even "smarter" models, perform vastly poorer or don't support agent capability or both.