CUDA error: operation not permitted when stream is capturing

shubhamsp2195 · February 10, 2023, 6:41am

Execution of my modulus code is resulting in the following error.

[code]
[06:15:56] - attempting to restore from: outputs/Battery
[06:15:56] - Success loading optimizer: outputs/Battery/optim_checkpoint.0.pth
[06:15:56] - Success loading model: outputs/Battery/battery_network.0.pth
[06:15:57] - [step:          0] record constraint batch time:  4.146e-01s
[06:15:57] - [step:          0] saved checkpoint to outputs/Battery
[06:15:57] - [step:          0] loss:  2.148e+01
[06:16:09] - Attempting cuda graph building, this may take a bit...
Error executing job with overrides: []
Traceback (most recent call last):
  File "/modulus/modulus/trainer.py", line 728, in _cuda_graph_training_step
    self.loss_static, self.losses_static = self.compute_gradients(
  File "/modulus/modulus/trainer.py", line 54, in adam_compute_gradients
    losses_minibatch = self.compute_losses(step)
  File "/modulus/modulus/solver/solver.py", line 52, in compute_losses
    return self.domain.compute_losses(step)
  File "/modulus/modulus/domain/domain.py", line 133, in compute_losses
    constraint.forward()
  File "/modulus/modulus/domain/constraint/continuous.py", line 116, in forward
    self._output_vars = self.model(self._input_vars)
  File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1186, in _call_impl
    return forward_call(*input, **kwargs)
  File "/modulus/modulus/graph.py", line 220, in forward
    outvar.update(e(outvar))
  File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1186, in _call_impl
    return forward_call(*input, **kwargs)
  File "/modulus/modulus/utils/sympy/torch_printer.py", line 274, in forward
    output = self.torch_expr(args)
  File "<lambdifygenerated-7>", line 3, in _lambdifygenerated
    return (-3.85e-11*sqrt(c)*sqrt(c_s)*sqrt(28606 - c_s)*(-2.71828**(-Phi_1 + Phi_2) + 2.71828**(Phi_1 - Phi_2)) + j_n)
  File "/opt/conda/lib/python3.8/site-packages/torch/_tensor.py", line 32, in wrapped
    return f(*args, **kwargs)
  File "/opt/conda/lib/python3.8/site-packages/torch/_tensor.py", line 671, in __rpow__
    return torch.tensor(other, dtype=dtype, device=self.device) ** self
RuntimeError: CUDA error: operation not permitted when stream is capturing

[/code]

The culprit seems to be the constraint corresponding to the equation

(-3.85e-11*sqrt(c)*sqrt(c_s)*sqrt(28606 - c_s)*(-2.71828**(-Phi_1 + Phi_2) + 2.71828**(Phi_1 - Phi_2)) + j_n)

as can be seen from the error. What can be the potential causes for this issue? Is it possible that exponential terms are too large for the gradients to be computed?

ngeneva · February 11, 2023, 12:30am

Hi @shubhamsp2195

This error occurs when there’s a tensor / cuda object getting created or transferred inside a recorded graph. All CUDA objects need to be initialized and on the GPU prior to recording a graph. I’m not sure why exactly this is occurring for you, but you can shut off Cuda graphs in your config.yaml using cuda_graphs = False.

zejun.chen · October 29, 2025, 3:41pm

Hi, @ngeneva

How are you? Nice to meet you! May I know if there is any method to debug the cuda graph error, such like ‘operation not permitted when stream is capturing‘ ? Something like env flags can tell the frontend users which ops are not permitted during capturing

Thank you

Topic		Replies	Views
CUDA Graph capture - work on separated streams invalidates graph capture CUDA Programming and Performance	5	713	May 1, 2025
Multistream in cudagraph capturing CUDA Programming and Performance	1	480	February 6, 2025
Capturing a graph launch CUDA Programming and Performance	0	393	July 11, 2023
[CUDA Graph] cuBLAS routine produces incorrect result after calling cudaStreamBeginCapture CUDA Programming and Performance	2	765	March 2, 2022
Issues Running TensorRT Inference on Jetson Orin: CUDA Stream Capture Errors General tensorrt , cuda	1	354	July 7, 2025
CUDA Graph and TensorRT batch inference TensorRT tensorrt , cuda , kernel	2	3948	October 12, 2021
Prohibited and Unhandled Operations in CUDA graphs CUDA Programming and Performance	2	593	March 7, 2023
Why cudaGraphLaunch(graph_exec_, stream1) dont run the graph at stream1 CUDA Programming and Performance cuda , graphics	1	95	June 6, 2025
Access Violation while launching Graph CUDA Programming and Performance	2	614	July 29, 2022
FFT Execution Error: Cufft Callbacks During Graph Capture not permitted GPU-Accelerated Libraries cufft	2	709	February 8, 2023

CUDA error: operation not permitted when stream is capturing

Related topics