cudnn verify failed - Cublas failure

I use nvidia driver 410 + cuda 9.0 + cudnn 7.3, and when I run cudnn sample mnistCUDNN, I get error:

:~/PerformanceTest/cudnn_samples_v7/mnistCUDNN make clean
rm -rf *o
rm -rf mnistCUDNN

:~/PerformanceTest/cudnn_samples_v7/mnistCUDNN make
/usr/local/cuda/bin/nvcc -ccbin g++ -I/usr/local/cuda/include -IFreeImage/include -m64 -gencode arch=compute_30,code=sm_30 -gencode arch=compute_35,code=sm_35 -gencode arch=compute_50,code=sm_50 -gencode arch=compute_53,code=sm_53 -gencode arch=compute_53,code=compute_53 -o fp16_dev.o -c fp16_dev.cu
g++ -I/usr/local/cuda/include -IFreeImage/include -o fp16_emu.o -c fp16_emu.cpp
g++ -I/usr/local/cuda/include -IFreeImage/include -o mnistCUDNN.o -c mnistCUDNN.cpp
/usr/local/cuda/bin/nvcc -ccbin g++ -m64 -gencode arch=compute_30,code=sm_30 -gencode arch=compute_35,code=sm_35 -gencode arch=compute_50,code=sm_50 -gencode arch=compute_53,code=sm_53 -gencode arch=compute_53,code=compute_53 -o mnistCUDNN fp16_dev.o fp16_emu.o mnistCUDNN.o -I/usr/local/cuda/include -IFreeImage/include -LFreeImage/lib/linux/x86_64 -LFreeImage/lib/linux -lcudart -lcublas -lcudnn -lfreeimage -lstdc++ -lm

:~/PerformanceTest/cudnn_samples_v7/mnistCUDNN ./mnistCUDNN
cudnnGetVersion() : 7301 , CUDNN_VERSION from cudnn.h : 7301 (7.3.1)
Host compiler version : GCC 5.4.0
There are 4 CUDA capable devices on your machine :
device 0 : sms 46 Capabilities 7.5, SmClock 1815.0 Mhz, MemSize (Mb) 7952, MemClock 7000.0 Mhz, Ecc=0, boardGroupID=0
device 1 : sms 46 Capabilities 7.5, SmClock 1815.0 Mhz, MemSize (Mb) 7952, MemClock 7000.0 Mhz, Ecc=0, boardGroupID=1
device 2 : sms 46 Capabilities 7.5, SmClock 1815.0 Mhz, MemSize (Mb) 7952, MemClock 7000.0 Mhz, Ecc=0, boardGroupID=2
device 3 : sms 46 Capabilities 7.5, SmClock 1815.0 Mhz, MemSize (Mb) 7952, MemClock 7000.0 Mhz, Ecc=0, boardGroupID=3
Using device 0

Testing single precision
Loading image data/one_28x28.pgm
Performing forward propagation …
Testing cudnnGetConvolutionForwardAlgorithm …
Fastest algorithm is Algo 0
Testing cudnnFindConvolutionForwardAlgorithm …
^^^^ CUDNN_STATUS_SUCCESS for Algo 0: 0.012128 time requiring 0 memory
^^^^ CUDNN_STATUS_SUCCESS for Algo 2: 0.194656 time requiring 57600 memory
^^^^ CUDNN_STATUS_SUCCESS for Algo 7: 0.223072 time requiring 2057744 memory
^^^^ CUDNN_STATUS_SUCCESS for Algo 5: 0.336416 time requiring 203008 memory
^^^^ CUDNN_STATUS_SUCCESS for Algo 4: 0.394272 time requiring 207360 memory
Cublas failure
Error code 0
gemv.h:77
Aborting…

How to solve it?

same problem
RTX 2080 Ti
Driver 410
CUDA 9.0
CUDNN 7.1.4

This will help you out.

https://devtalk.nvidia.com/default/topic/1042638/is-nvidia-driver-410-57-incompatible-with-cuda-9-0/?offset=6#5289184

same problem
RTX 2080Ti
NVIDIA Driver 430
CUDA 9.0
CUDNN 7.6.4

I installed CUDA 10.1 instead of 9.0, it works