Hacker News new | past | comments | ask | show | jobs | submit login

Too bad there seem to be no pretrained models to download. This is not a quantization method to apply on existing models, so having the pretrained weights is needed if one wants to test it.



+1 On this, the real proof would have been testing both models side-by-side.

It seems that it may be published on GitHub [1] according to HuggingFace [2].

[1] https://github.com/microsoft/unilm/tree/master/bitnet

[2] https://huggingface.co/papers/2402.17764


Nothing there yet, but it's good to know they want to publish just did not get around to yet.


From [2]:

> We would definitely be happy to open-source the models for future research. Please stay tuned!


link #2 appears to be broken.


Tested earlier, still seems to be working fine. I can only suggest to try a VPN/alternative DNS?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: