Cuda memory pool performance issue

I suggest filing a bug.