Just curious, will I be able to use it using my Nvidia card with 10GB of memory?...

robbedpeter · on May 3, 2022

The smaller models, yes. I'd bet dollars to donuts that gpt-neo and EleutherAI models outperform most, if not all, of Facebook's.

Check out huggingface, you'll be able to run a 2.7b model or smaller.

https://huggingface.co/EleutherAI/gpt-neo-2.7B/tree/main

woodson · on May 3, 2022

As the model weights (even quantized) would be several hundred GBs, it’s unlikely, unless special inference code is written that loads and processes only a small subset of weights and calculations at a time. But running it that way would be painfully slow.

lostmsu · on May 3, 2022

The code is already there: DeepSpeed