Dynamic Parallelism and cuDNN

I am looking for the possibility to compute entirely on the device using the cuDNN library. Is this possible? And how would one go about doing this?