I know you're half joking here but there are more consumer-affordable versions like the Geforce RTX 3090ti ($1600 for 24GB). It may not do CUDA work as fast as the A100 but it'll be able to run the model.
For the half-precision version at 7GB there are a ton more options (the RTX 3060 has 12GB for example at ~$450).
For the half-precision version at 7GB there are a ton more options (the RTX 3060 has 12GB for example at ~$450).