Machine learning benchmark: GPU providers

DTE · on Feb 9, 2018

Really great breakdown! Another important thing to think about when building out a production or even dev environment are the other associated costs (i.e. bandwidth, storage, egress, etc). We think that these costs can be pretty obscene in the public cloud and we try to optimize as much as possible to keep the total cost down.

Disclosure: I'm one of the founders of Paperspace (https://www.paperspace.com) and we spend a lot of time thinking about GPU compute and pricing. Happy to answer any questions here

eb0la · on Feb 9, 2018

There are also other cost to consider:

- How much time (=cost) do you need to start running (enterprise) - Time you need to access the cloud, starting up and closing vms, and getting shared GPU resources. - The cost of lock-in. Ecosystems are great; but sometimes you need the basics working.

Some providers are much better than others at this.

Mandatory Joel on Software mention: https://www.joelonsoftware.com/2000/06/03/strategy-letter-ii...

ethanwillis · on Feb 9, 2018

Just wanted to say thanks for paperspace, it's been a great tool for my org to setup some windows build environments when we're all running linux/mac

bwasti · on Feb 9, 2018

“IBM Softlayer and LeaderGPU appear expensive, mainly due to under-utilisation of their multi-GPU instances. The benchmark was carried out using the Keras framework whose multi-GPU implementation was surprisingly inefficient, at times performing worse than a single GPU run on the same machine.“ - this is unacceptable in a benchmark like this. There is an entire software stack that is influencing performance and much of the hardware is dissimilar.

zitterbewegung · on Feb 9, 2018

Another good source of information for GPU deep learning training is a comparison for graphics cards (This is mainly for researchers though). http://timdettmers.com/2017/04/09/which-gpu-for-deep-learnin...

lostmsu · on Feb 10, 2018

No Azure?