Error while running resnet10.caffemodel_b1_int8.engine file with Tensorrt

Pritam · April 3, 2020, 10:44am

hello,
I am trying to run object detection using tensorrt using resnet10.caffemodel_b1_int8.engine file. but I am getting always same value of h_output[0] with different different image.
below i am attaching the codebase.

import tensorrt as trt
import os
import sys
import cv2
import time
import ctypes
import numpy as np
import pycuda.autoinit
import pycuda.driver as cuda

labels = [‘Car’,‘Person’,‘Bicycle’,‘Roadsign’]

# initialize
TRT_LOGGER = trt.Logger(trt.Logger.INFO)
trt.init_libnvinfer_plugins(TRT_LOGGER, ‘’)
runtime = trt.Runtime(TRT_LOGGER)

with open(“resnet10.caffemodel_b1_int8.engine”, ‘rb’) as f:
** buf = f.read()**
** engine = runtime.deserialize_cuda_engine(buf)**

# create buffer
host_inputs = []
cuda_inputs = []
host_outputs = []
cuda_outputs = []
bindings = []
stream = cuda.Stream()

for binding in engine:
** size = trt.volume(engine.get_binding_shape(binding)) * engine.max_batch_size**
** print("Binding shape is : ",engine.get_binding_shape(binding))**
** # host_mem = cuda.pagelocked_empty(size, np.int8)**
** host_mem = cuda.pagelocked_empty(size, np.float32)**
** cuda_mem = cuda.mem_alloc(host_mem.nbytes)**

** bindings.append(int(cuda_mem))**
** if engine.binding_is_input(binding):**
** host_inputs.append(host_mem)**
** cuda_inputs.append(cuda_mem)**
** else:**
** host_outputs.append(host_mem)**
** cuda_outputs.append(cuda_mem)**

context = engine.create_execution_context()

#TODO enable video pipeline
#TODO using pyCUDA for preprocess
ori = cv2.imread(“sample_720p.jpg”)
image = cv2.cvtColor(ori, cv2.COLOR_BGR2RGB)
# image = cv2.resize(image, (model.dims[2],model.dims[1]))
image = cv2.resize(image, (640,368))
image = image.astype(np.int8)
# print("Output shape of the model is : ",engine.output_shape)
# image = cv2.resize(image, (1280,720))
image = (2.0/255.0) * image - 1.0
image = image.transpose((2, 0, 1))

print("image_ravel_valus is : ",image.ravel())
print("value of host_inputs is : ",host_inputs[0])
# image = np.uint8(image)
np.copyto(host_inputs[0], image.ravel())

start_time = time.time()
cuda.memcpy_htod_async(cuda_inputs[0], host_inputs[0], stream)
context.execute_async(bindings=bindings, stream_handle=stream.handle)
cuda.memcpy_dtoh_async(host_outputs[1], cuda_outputs[1], stream)
cuda.memcpy_dtoh_async(host_outputs[0], cuda_outputs[0], stream)
stream.synchronize()
print("execute times "+str(time.time()-start_time))

output = host_outputs[0]
height, width, channels = ori.shape
print("output is : ",len(output))
model_layout = 7
for i in range(int(len(output)/model_layout)):
** prefix = i*model_layout**
** index = int(output[prefix+0])**
** label = int(output[prefix+1])**
** conf = output[prefix+2]**
** xmin = int(output[prefix+3]width)*
** ymin = int(output[prefix+4]height)*
** xmax = int(output[prefix+5]width)*
** ymax = int(output[prefix+6]height)*
** print("index : ",index)**
** print("label : ",label)**
** print("conf : ",conf)**
** print(“xmin , ymin, xmax, ymax”, xmin,ymin,xmax,ymax)**

** if conf > 0.7:**
** print(“Detected {} with confidence {}”.format(labels[label], “{0:.0%}”.format(conf)))**
** cv2.rectangle(ori, (xmin,ymin), (xmax, ymax), (0,0,255),3)**
** cv2.putText(ori, labels[label],(xmin+10,ymin+10), cv2.FONT_HERSHEY_SIMPLEX, 1, (255,255,255), 2, cv2.LINE_AA)**

cv2.imwrite(“result.jpg”, ori)
cv2.imshow(“result”, ori)
cv2.waitKey(0)

please help me where i am wrong.
an if there is another tensorrt sample by which i can run object detection using .engine file used in DS and .trt file generated by Transfer learning toolkit please let me know.
Thanks.

SunilJB · April 6, 2020, 6:16am

Hi,

Please refer below sample:
https://github.com/NVIDIA-AI-IOT/tlt-iva-examples

Thanks

Pritam · April 6, 2020, 7:41am

Thanks SunilJB for your response.
I have used TLT for training and test that weight also on DS-4.0.1 and that is working fine but I want to run that weights without gst pipeline I have to customise my code. when I go for customising Deepstream code base I face many issues and it takes too much time to resolve it so that’s why I want to use normal inference using importing tensorrt and its library. and not want to use tlt-infer tool because for that I will have to use TLT container.
so please let me know how should go.

import numpy as np
import imageio
import tensorrt.legacy as trt
import matplotlib.pyplot as plt
import utils
import cv2

ENGINE_PATH = ‘…/Secondary_VehicleTypes/resnet18.caffemodel_b16_int8.engine’
CLASSES = [‘coupe’,‘largevehicle’,‘sedan’,‘suv’,‘truck’,‘van’]

CROP_SIZE = (224,224)
**DATA_TYPE = trt.infer.DataType.INT8 **

engine = trt.lite.Engine(PLAN = ENGINE_PATH, data_type = DATA_TYPE)

INPUT_IMAGE_PATH = ‘car3.jpg’

def prepare_image(image_in,crop_size,data_type):
** img = cv2.resize(image_in, (224,224))**
** img = img.astype(np.int8)**
** img = img.transpose(2,0,1) #to CHW**
** return img**

img = imageio.imread(INPUT_IMAGE_PATH, pilmode=‘RGB’)
img = prepare_image(img,CROP_SIZE,DATA_TYPE)
out = engine.infer(img)
print(‘Prediction : {}’.format(CLASSES[np.argmax(out[0])]))

I had used this code for classification but result were not that good.
I want same type of code where i can run detection model (Primary detector weights) and able to get B-box, precision and detected person result.

Thanks.

SunilJB · April 13, 2020, 7:57am

Hi,

For normal inference sample using TRT you can refer below links:
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource

Thanks

Topic		Replies	Views
Run inference using TensorRT using .engine file of .trt file Frameworks (archived) tensorflow	1	613	December 4, 2020
Error while running resnet18_detector.trt weights with Deepstream sdk DeepStream SDK	6	1411	October 12, 2021
Run inference using TensorRT using .engine file of .trt file TensorRT	0	334	April 2, 2020
use resnet18_detector.trt in custom python code TAO Toolkit	12	1528	October 12, 2021
Unable to Run TLT resnet18 weights on deepstream4.0.1 test app 2 TAO Toolkit	4	533	October 12, 2021
About acceleration with TensorRT5.1.5.0 on Tesla T4 TensorRT	1	648	December 3, 2019
Inference error at engine.cpp::enqueue::293 TensorRT	4	2338	January 31, 2019
TensorRT get bad classifier results GPU-Accelerated Libraries	4	1176	December 26, 2017
Im trying to get the :python3 deepstream_test_1.py to run with the TLT resnet10_detector.trt TAO Toolkit	4	656	October 12, 2021
TensorRT fails to build FasterRCNN GIE model with using INT8 TensorRT	28	9342	May 3, 2018

Error while running resnet10.caffemodel_b1_int8.engine file with Tensorrt

Related topics