There’s not enough information to debug, but you really should be using blocks with at least 64 threads. I highly suggest you take the CUDA C++ DLI Courses – NVIDIA
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| How to get the max value in a vector when using cublas | 2 | 5028 | April 28, 2017 | |
| Problem with cublasIsamax | 2 | 1614 | January 9, 2013 | |
| Error using cublasIsamax() | 2 | 1006 | May 30, 2011 | |
| Error while using cublasIcamax with device pointer to store the result | 0 | 698 | June 6, 2011 | |
| cublas source - where? i need to modify the IsaMax function | 0 | 2660 | June 28, 2010 | |
| CUBLAS and cublasIsamax() for complex data type find index of element with complex structure. | 2 | 3927 | September 3, 2010 | |
| Cublas not working for multi -gpu | 4 | 1355 | August 28, 2013 | |
| Max() function? | 1 | 3980 | April 29, 2009 | |
| CUBLAS_V2 - Keep results in GPU or return it to CPU? cublasIdamin function in CUBLAS_V2 | 2 | 3226 | January 31, 2012 | |
| Result of a CUBLAS function | 7 | 10858 | April 22, 2010 |