Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
littlestymaar
on July 24, 2023
|
parent
|
context
|
favorite
| on:
Attention Is Off By One
> I suppose only time will tell why it was ignored publicly before, maybe it doesn't do much, maybe it just fell through the cracks, maybe google just didnt push it, who knows
Maybe quantization wasn't as hot back then than it is now?
jablongo
on July 24, 2023
[–]
Yea the benefit is not going to come in terms of performance for a given model, but in terms of ability to be efficiently quantized.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
Maybe quantization wasn't as hot back then than it is now?