\*For people without a 24GB RAM video card, I've got an 8GB RAM one running this...

throwaway314155 · 2025-05-22T04:20:51 1747887651

> For people without a 24GB RAM video card, I've got an 8GB RAM one running

What're you using for this? llama.cpp? Have a 12GB card (rtx 4070) i'd like to try it on.

johnQdeveloper · 2025-05-22T04:23:12 1747887792

I believe its just a HTTP wrapper and terminal wrapper around llama.cpp with some modifications/fork.

throwaway314155 · 2025-05-22T04:38:19 1747888699

Does ollama have support for cpu offloading?

johnQdeveloper · 2025-05-22T05:49:12 1747892952

> Does ollama have support for cpu offloading?

Yes.

taneq · 2025-05-22T07:43:16 1747899796

A perfect blend of LMGTFY and helpfulness. :)

johnQdeveloper · 2025-05-22T08:29:56 1747902596

lol. I try not to be a total asshole, it sometime even works! :)

Good luck to you mate with your life :)