Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Just tried the patch in Google Colab and results for the example code were actually about 20% slower than without the patch.

https://imgur.com/a/7EmlYJy

What am I missing?

edit: it seems my instance was using AMD EPYC.



You have to import KMeans again after patching.

Also in you example it's tiny problem size that have lot of fluctuations. Basically in you code you are running stock both times.

this extension also would bring perf to AMD, although Intel would be better optimized




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: