Division is missing

Looking through the documentation, there doesn’t seem to be any way to do pointwise division of two tensors. This is needed for example, in layer normalization where you divide by the standard deviation of each channel. Is there really no way to divide two tensors in cudnn other than copying to the CPU, doing the division, and copying back to the GPU?