cuBlas VBO interoperability Multiplication on linear array

I can create a VBO as a linear float array consisting of x,y,z values. I wonder if it is possible to:

  1. Use the array pointer returned by cudaGLMapBufferObject as a parameter in CUBLAS function.
  2. If yes, with which function I multiply all x,y,z values with a predefined 4*4 matrix with CUBlas.
  3. If no, is it appropriate to write a kernel function to make the mentioned multiplication. What would be the optimal grid, block etc. size to execute the multiplication kernel on a linear array for matrix multiplication.


Yes I can mix CUBLAS and Kernel calls. I can mix cublasAlloc and cudaMalloc

I will try VBO …