I’ve found it to be pretty terrible compared to CUDA, especially with Huggingfac...

teaearlgraycold · 2024-05-07T23:17:26 1715123846

Yeah. It’s good with YOLO and Dino though. My M2 Max can compute Dino embeddings faster than a T4 (which is the GPU in AWS’s g4dn instance type).

ein0p · 2024-05-07T23:37:58 1715125078

MLX will probably be even faster than that, if the model is already ported. Faster startup time too. That’s my main pet peeve though: there’s no technical reason why PyTorch couldn’t be just as good. It’s just underfunding and neglect

whimsicalism · 2024-05-08T01:40:48 1715132448

t4's are like 6 years old