Matrix requirements for cusparse*csrgemm2

I would like to perform a matrix multiplication like:

C = A*B*A'

To do this I split it into two matrix-matrix multiplications where all matrices are stored in CSR format with zero based index which is specified in the cusparse matrix description.
However the first matrix multiplication works fine but the second one failed when determining the number of non-zeros for the resulting matrix.
The CSR format of the matrix should be ok. Unfortunately the matrix is so big, that I cannot check the format by hand.
What requirements need the matrices fullfil when performing the cusparse*csrgemm2 function? Are there any helper functions where I can check the matrix? Maybe outside cuda?

Thanks in advance