I feel the same. For example some stats from Whisper [0] (audio transcoding, 30 ...

pavelstoev · on May 15, 2023

Training and inference on GPUs significantly underutilize …the GPUs. So tuning and various tricks need to be applied to achieve dramatic performance gains. If I am not good at cooking, giving me a larger kitchen will not make me faster or better.

rahimnathwani · on May 14, 2023

You last paragraph is correct. Only about half the model was running on the GPU.