Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
pants2
3 months ago
|
parent
|
context
|
favorite
| on:
Mistral 3 family of models released
Nothing off the top of my head! If you find anything good let me know. GRPO is a training technique likely not exactly what you'd do for benchmarking, but it's interesting to read about anyway. Glad I cuold help
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: