https://imgur.com/a/7EmlYJy
What am I missing?
edit: it seems my instance was using AMD EPYC.
Also in you example it's tiny problem size that have lot of fluctuations. Basically in you code you are running stock both times.
this extension also would bring perf to AMD, although Intel would be better optimized
https://imgur.com/a/7EmlYJy
What am I missing?
edit: it seems my instance was using AMD EPYC.