Some things I'd like to see solved in this space: - Versioning. Don't change the...

nl · on Feb 22, 2023

All of these things are there right out of the box with the HuggingFace toolset.

(Determinism does depend more on the exact software running the model. In general it works now but there are occasional exceptions like PyTorch on M1 not being deterministic the first time you initialize it or something weird)

beecafe · on Feb 22, 2023

Determinism is largely impossible due to arbitrary ordering of GPU threads and non-associativity of floating point operations

thewataccount · on Feb 22, 2023

Is this true for LLMs and not for at least Stable Diffusion? Stable Diffusion is largely deterministic, with the main issues mainly when switching between software or hardware versions of torch, GPU architectures, CUDA/CUDANN, etc.

Or perhaps I'm wrong about Stable Diffusion too?

mattnewton · on Feb 22, 2023

I thought so too, but I run a stable diffusion service, and we see small differences between generations with the same seed and same hardware class on different machines with the same CUDA drivers running in parallel. It’s really close but there will be subtle differences, (that a downstream upscaler sometimes magnifies), and I haven’t had the time to debug/understand this.

thewataccount · on Feb 22, 2023

Ah okay that makes sense. In my experience I've only noticed differences when the entire composition changes so I'm guessing it's near pixel level or something?

I assume they're the most noticeable with the ancestral samplers like euler a and the DPM2 a (and variants)?

pcthrowaway · on Feb 22, 2023

IIUC versioning isn't really possible with a neural net, the training process influences the generation pathway

caveat: I have an incredibly superficial understanding of any of this.

Hendrikto · on Feb 22, 2023

It is definitely possible. At any point, you can just take a snapshot of the weights. Together with a description of the architecture, this is a complete description of a model.

nl · on Feb 22, 2023

This is complete wrong.

A model is a binary artifact that can be versioned like any other binary asset.

dhruval · on Feb 22, 2023

LLMs are pretrained and then released for use.

So can have v1, v2 etc.