The FP_ARITH_INST_RETIRED.512B_PACKED_SINGLE performance counter _seems_ to be the hardware performance counter tracking AVX512-32-bit instructions.
You wouldn't even need a debugger. Just a reset: turn on hardware performance counters in the BIOS and run a profiler that can read the hardware performance counters. I don't know if that's the performance counter for sure that we're looking for. There's a few other 512B counters there.
Well, if we're optimizing a terrible idea anyway...
https://download.01.org/perfmon/index/skylake_server.html
The FP_ARITH_INST_RETIRED.512B_PACKED_SINGLE performance counter _seems_ to be the hardware performance counter tracking AVX512-32-bit instructions.
You wouldn't even need a debugger. Just a reset: turn on hardware performance counters in the BIOS and run a profiler that can read the hardware performance counters. I don't know if that's the performance counter for sure that we're looking for. There's a few other 512B counters there.