Hi, I’ve tried a simple case with both hybrid memory and execute mode on. But the calculation failed. So, is this combination of modes allowed in cuDSS? Or it could be something I missed from your example. Some syncs are needed maybe? Below is how I enable both modes.
... ...
// Enable hybrid mode where factors are stored in host memory
// Note: It must be set before the first call to ANALYSIS step.
int hybrid_mode = 1;
cudssConfigSet(solverConfig_, CUDSS_CONFIG_HYBRID_MODE, &hybrid_mode, sizeof(hybrid_mode));
// Enable Hybrid Execute Mode alowing cuDSS to use both CPU and GPU for kernel executions.
int hybrid_execute_mode = 1;
cudssConfigSet(solverConfig_, CUDSS_CONFIG_HYBRID_EXECUTE_MODE, &hybrid_execute_mode, sizeof(hybrid_execute_mode));
// Symbolic factorization
cudssExecute(handle_, CUDSS_PHASE_ANALYSIS, solverConfig_, solverData_, matrixA_, sol_, rhs_);
... ...