While the single benchmark provided probably isn't statistically significant, I noticed that performance on system with 4 cores of less is worse with this patch.
If this is actually symptom of the patch, would it be possible to toggle lock behavior depending on available cores?
Currently, the number of sublists created is equal to the number of cores on the system.
While it'd be fairly trivial to extend the code to make it easy for an admin to define the number of sublists, I don't have any reason to believe that'd be a useful knob to export.
If this is actually symptom of the patch, would it be possible to toggle lock behavior depending on available cores?