You kind of can - projects like deepspeed (https://www.deepspeed.ai/) enable run...

		kernelsanderz on Sept 4, 2022 \| parent \| context \| favorite \| on: Running Stable Diffusion on Your GPU with Less Tha... You kind of can - projects like deepspeed (https://www.deepspeed.ai/) enable running a model that is larger than in VRAM through various tricks like moving weights from regular system RAM into VRAM between layers. Can come with a performance hit though depending on the model, of course.