Extremely poor VK_EXT_device_generated_commands performance

I don’t think that is weird. Internally on our implementation the two methods should be generating similar HW instructions and I don’t expect more than 2-3% variance across runs.