Memory allocation time

Community,

I am trying to measure memory allocation time to make sense of its impact upon the total execution time.

When I use pinned memory (cudaMallocHost) the memory allocation time is much less than when I use traditional malloc.

Does that make sense? my understanding is that when allocating pinned memory, the lower layer will spend more time to build the requested contiguous allocation space.

Cheers,
./abdullah