Just a side rant here... I'm really frustrated I can't monitor the Neural Engine's usage in the M1 in my MacBook Air. Apparently Apple did not build an API for extracting data points from the these 16 cores, so I can only watch what the CPU and GPU are doing when running and optimizing Tensorflow models while the NE remains a black box.