As an aside, since I was trying to install CuPy the other day and was having iss...

kmaehashi · 2024-09-20T17:40:31 1726854031

As a maintainer of CuPy and also as a user of several GPU-powered Python libraries, I empathize with the frustrations and difficulties here. Indeed, one thing CuPy values is to make the installation step as easy and universal as possible. We strive to keep the binary package footprint small (currently less than 100 MiB), keep dependencies to a minimum, support wide variety of platforms including Windows and aarch64, and do not require a specific CUDA Toolkit version.

If anyone reading this message has encountered a roadblock while installing CuPy, please reach out. I'd be glad to help you.

ttyprintk · 2024-09-20T21:24:19 1726867459

You can jam the argument cudatoolkit=1.2.3 when creating conda environments.

NB I’m using Miniforge.

mardifoufs · 2024-09-20T21:05:56 1726866356

One way to do it is to explicitly add the link to say, the pytorch+CUDA wheel from the pytorch repos in your requirements.txt instead of using the normal pypi package. Which also sucks because you then have to do some other tweaks to make your requirements.txt portable across different platforms...

(and you can't just add another index for pip to look for if you want to use python build so it has to be explicitly linked to the right wheel, which absolutely sucks especially since you cannot get the CUDA version from pypi)

welder · 2024-09-23T09:58:03 1727085483

Yes, you need to install the right version or Cupy hangs forever when installing via pip:

    pip install cupy-cuda12x

coeneedell · 2024-09-20T15:10:48 1726845048

Ugh… docker containers. I also wish there was a simpler way but I don’t think there is.

SubiculumCode · 2024-09-20T15:11:34 1726845094

this is not what I wanted to hear. NOT AT ALL. Please whisper sweet lies into my ears.

coeneedell · 2024-09-20T15:23:07 1726845787

At the moment I’m working on a system to quickly replicate academic deep learning repos (papers) at scale. At least Amazon has a catalogue of prebuilt containers with cuda/pytorch combos. I still occasionally have an issue where the container works on my 3090 test bench but not on the T4 cloud node…

m_d_ · 2024-09-20T15:28:31 1726846111

conda provides cudatoolkit and associated packages. Does this solve the situation?

SubiculumCode · 2024-09-20T21:22:47 1726867367

Actually yes it does....except I seem to remember that it doesn't go back that far in cuda versions. I can't seem to find it again right now.

nyrikki · 2024-09-20T15:46:33 1726847193

The condos 200-employee threshold licence change is problematic for some.

boldlybold · 2024-09-20T16:07:33 1726848453

As long as you stay out of the "defaults" and "anaconda" repos, you're not subject to that license. For my needs conda-forge and bioconda have everything. I'm not sure about the nvidia repo but I assume it's similar.

kmaehashi · 2024-09-20T16:47:35 1726850855

Actually all CUDA Toolkit libs are already available through the conda-forge channel: https://anaconda.org/conda-forge/cuda-cudart, https://anaconda.org/conda-forge/libcublas, etc.

whimsicalism · 2024-09-20T17:12:26 1726852346

in real life everyone just uses containers, might not be the answer you want to hear though

SubiculumCode · 2024-09-20T23:16:38 1726874198

I'm okay with containers generally, I think. Is this a situation where you put your code into the container and run it, or does the code make calls to the container's gpu?