cuBLAS for lower-end GPUs

technik · May 20, 2016, 8:00am

I’m implementing some machine learning algorithms on CPU, and they are quite intensive on matrix computations. I intended to do an alternative implementation on GPU using cuBLAS and compare performance. However, the whole system must run on rather low-end platforms (cuda compute capabilities 2.1) and I have found that cuBLAS requires higher capabilities. If such is the case, I’d like to implement my own cuda kernels for the BLAS subset I use, and I intend to make it api-compatible with cuBLAS. Now the question is: I read in older topics in this forum, that the source for cuBLAS was available, but it seems to no longer be available. Can I find it somewhere to use it as a reference, or is there any other resource that can help me with implementation?

Thanks in advance.

Robert_Crovella · May 20, 2016, 1:04pm

CUBLAS can work on a cc 2.0 or higher GPU.

Topic		Replies	Views
cublas, gpu or cpu CUDA Programming and Performance	2	5266	December 15, 2008
Compile cublas library optimized to GPU architecture? CUDA Programming and Performance	1	3598	April 15, 2009
Kernel-level cuBLAS GPU-Accelerated Libraries cublas	3	584	October 12, 2021
Run Cuda App/Compiled Code On Other Machines with CUDA-Compatible GPUs GPU-Accelerated Libraries	0	428	April 13, 2019
cuBLAS source code? GPU-Accelerated Libraries	1	1309	November 3, 2017
cublas on kernel Jetson TK1	1	599	July 29, 2016
Multiple Cublas functions on single GPU CUDA Programming and Performance	5	1702	August 8, 2010
Optimizing Sequential cuBLAS Calls for Matrix Operations—Alternatives to Kernel Fusion? GPU-Accelerated Libraries cublas	3	388	April 29, 2024
Emulate another Compute Capability for debugging CUDA Developer Tools	0	339	March 29, 2021
Why CUBLAS performance is not good in kepler. GPU-Accelerated Libraries	2	950	April 11, 2015

cuBLAS for lower-end GPUs

Related topics