No cudaMemsetAsync?

It seems like there is no support for asynchronous device memory memset. Why not?

Programming guide: Page 32:

“In order to facilitate concurrent execution between host and device, some runtime
functions are asynchronous: Control is returned to the application before the device
has completed the requested task. These are:
Kernel launches through global functions or cuLaunchGrid() and
cuLaunchGridAsync();
The functions that perform memory copies and are suffixed with Async;
The functions that perform device ↔ device memory copies;
The functions that set memory.