I noticed that calling cudnnSetTensor4dDescriptor with the CUDNN_TENSOR_NHWC format returns 9 (“not supported”). However,
int ns = c*w*h; int cs = 1; int hs = c*w; int ws = c; status = cudnnSetTensor4dDescriptorEx(d, dataType, n, c, h, w, ns, cs, hs, ws);
which should be equivalent, returns “success”. Why the discrepancy, and is the NHWC format supported at all?