Hacker News new | past | comments | ask | show | jobs | submit login

How much ELO on chatbot arena does 4 bit lose vs full fp16/bf16?



I think this is the only known benchmark that actually compares different quants: https://oobabooga.github.io/benchmark.html

It's not really all that consistent, but larger models can be compressed more without as much loss.




Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: