Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

And what makes you so confident that all those people are using different prompt styles when comparing models? You think most people don’t even understand the bare basics of how to compare two products?


That's the point: maybe someone has a personal prompting style that works great with Claude but gives worse results with GPT-4.

They might complain that GPT-4 is rubbish in comparison to Claude, but someone with a different personal prompting style might experience the opposite.


Having a prompting style that works with a model but not quite with another is much different than "suffering from poor prompting" the previous person was accusing others.

And given that those are tools, it's more like "the model can work with the user's prompts" rather than "the user's prompts are adapted to the model".

Unless we're here for an ego trip.


Ah, I see. I’d be interested to see a study on that. I find it hard to believe it would make such a stark difference but it’s possible.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: