ERROR: engine.cpp (370) - Cuda Error in ~ExecutionContext: 77

edit_or · June 19, 2019, 4:55am

I do Int8 calibration using TensorRT.

Once calibration is completed and test the inference. I have error at

stream.synchronize()

in the following function.

No issue running on FP32 and FP16 engines. Only have error running at Int8 engine. What could be wrong?

def infer(engine, x, batch_size, context):
  inputs = []
  outputs = []
  bindings = []
  stream = cuda.Stream()
  for binding in engine:
    size = trt.volume(engine.get_binding_shape(binding)) * batch_size
    dtype = trt.nptype(engine.get_binding_dtype(binding))
    # Allocate host and device buffers
    host_mem = cuda.pagelocked_empty(size, dtype)
    device_mem = cuda.mem_alloc(host_mem.nbytes)
    # Append the device buffer to device bindings.
    bindings.append(int(device_mem))
    # Append to the appropriate list.
    if engine.binding_is_input(binding):
      inputs.append(HostDeviceMem(host_mem, device_mem))
    else:
      outputs.append(HostDeviceMem(host_mem, device_mem))
    #img = np.array(x).ravel()
    im = np.array(x, dtype=np.float32, order='C')
    im = im[:,:,::-1]
    #im = im.transpose((2,0,1))
    #np.copyto(inputs[0].host, x.flatten())  #1.0 - img / 255.0
    np.copyto(inputs[0].host, im.flatten())
    [cuda.memcpy_htod_async(inp.device, inp.host, stream) for inp in inputs]
    context.execute_async(batch_size=batch_size, bindings=bindings, stream_handle=stream.handle)
    # Transfer predictions back from the GPU.
    [cuda.memcpy_dtoh_async(out.host, out.device, stream) for out in outputs]
    # Synchronize the stream
    stream.synchronize()
    # Return only the host outputs.

Topic		Replies	Views
An illegal memory access was encountered using PyCUDA and TensorRT TensorRT	0	1078	June 6, 2019
[TensorRT] ERROR: 1: [resize.cu::performLinearKernelLaunch::457] Error Code 1: Cuda Runtime (invalid argument) TensorRT tensorrt , cupy	4	5210	June 14, 2022
I have an internal error with the engine.cpp as you can see below TensorRT tensorrt , cuda , ubuntu	2	331	March 14, 2024
Cuda Error in executeMemcpy: 1 (invalid argument) TensorRT	5	2252	October 13, 2021
Internal Error (Parameter check failed at: runtime/api/executionContext.cpp::resolveSlots::1495, condition: allInputDimensionsSpecified(routine) TensorRT	2	1245	June 28, 2022
[genericReformat.cuh::copyPackedRunKernel::1487] Error Code 1: Cuda Runtime (invalid resource handle) TensorRT	0	82	August 19, 2024
../rtSafe/cuda/cudaConvolutionRunner.cpp (483) - Cudnn Error in executeConv: 3 (CUDNN_STATUS_BAD_PARAM) TensorRT	3	708	November 2, 2022
TensorRT10.3 Cuda Runtime Error When Directly Using cuda device inputs for function execute_async_v3 TensorRT cuda	1	43	April 22, 2025
Confusion about TensorRT stream.synchronize() in GPU-only inference TensorRT tensorrt	1	572	November 2, 2022
Running 2 models on the same GPU with TensorRT TensorRT	7	1248	January 15, 2021

ERROR: engine.cpp (370) - Cuda Error in ~ExecutionContext: 77

Related topics