I rewrote the yolov4 directly using the cuda libs and it worked on windows. However when compiling on linux(Ubuntu 18.04) I receive runtime errors executing the function cutensorElementwiseBinary. The error code is 15, “CUTENSOR_STATUS_NOT_SUPPORTED”. Upon investigation it seems like cutensorInitTensorDescriptor filled out the descriptor differently on the two operating systems. I am using cuda 11 on both and cutensor version 1.3.3.
The code can be found here: GitHub - TKGgunter/yolov4_tiny_rs: A rust implementation of yolov4_tiny algorithm. .
Would you be able to isolate the failing call s.t. we could have a closer look?
cuTENSOR’s logging capabilities (see User Guide — cuTENSOR 1.3.3 documentation) might be useful to you; please see if CUTENSOR_LOG_LEVEL=1 gives you more insights. If not, could you please report the output of CUTENSOR_LOG_LEVEL=5?
I’ll write a smaller standalone program later, but the relevant code links are below. The first link is to the first instance in main where the error is returned. The second is the source code of that function. The output of CUTENSOR_LOG_LEVEL=5 is given below.