Libcu++ monotonically increasing clock in host code

where is that defined? Did you mean std::chrono::steady_clock ?

If you are using std::chrono in host code, there should be no issue. The limitation for std:: applies to device code. When you say “time cuda kernels”, I assume you are referring to host-based timing methods.

You can find plenty of examples scattered around the web of people using std::chrono in host code that is compiled by nvcc. Here is one example. Here is another. Here is an example that uses std::chrono::steady_clock.

(I guess if it were me, I would just give it a try.)