Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Kind of a ridiculous approach, especially for this model. Use together.ai, fireworks.ai, RunPod serverless, any serverless. Or use ollama with the default quantization, will work on many home computers, including my gaming laptop which is about 5 years old.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: