Hacker News new | past | comments | ask | show | jobs | submit login

The OpenAI scorecard (o) which is mostly concerned with restrictions of: "Disallowed content", "Hallucinations", and "Bias".

I propose the People's Scorecard, which is p=1-o. It measures how fun a model is. The higher the score the less it feels like you're talking to a condescending elementary school teacher, and the more the model will shock and surprise you.




That's LMSYS.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: