How to get cudastream in customparser code

sharma.rahul98912 · October 6, 2021, 3:44pm

Hardware Platform Nvidia Tesla T4
• DeepStream Version 5.1

• TensorRT Version 7.2.2.3
• NVIDIA GPU Driver Version (valid for GPU only) 460.32…03
• Issue Type( questions, new requirements, bugs) question

Hi, I am trying to port a SCRFD face detector code to deepstream.
But in customparser plugin code, cuda_stream is required.

code snippet-
const int MAX_IMAGE_BBOX = 1024;
const int NUM_BOX_ELEMENT = 16; // left, top, right, bottom, confidence, keepflag(1keep,0ignore), landmark(x, y) * 5|
TRT::Tensor affin_matrix_device(TRT::DataType::Float);|
TRT::Tensor output_array_device(TRT::DataType::Float);|
TRT::Tensor prior(TRT::DataType::Float);|
float confidence_threshold_=0.5;|
float nms_threshold_=0.5;|
int max_batch_size = 3;|
output_array_device.to_gpu(false);|
affin_matrix_device.resize(max_batch_size, 8).to_gpu();
output_array_device.resize(max_batch_size, 1 + MAX_IMAGE_BBOX * NUM_BOX_ELEMENT).to_gpu();
NvDsInferLayerInfo layer_out = layerFinder(“465”);
"for(int ibatch = 0; ibatch < 1; ++ibatch){
//auto& job = fetch_jobs[ibatch];
float image_based_output = (float )layer_out->data;
float output_array_ptr = output_array_device.gpu(ibatch);
std::cout<<"output_array_ptr “<<output_array_ptr<<std::endl;
auto affine_matrix = affin_matrix_device.gpu(ibatch);
checkCudaRuntime(cudaMemsetAsync(output_array_ptr, 0, sizeof(int), cuda_stream));
Scrfd::decode_kernel_invoker(
image_based_output,
16800, confidence_threshold_, nms_threshold_, affine_matrix,
output_array_ptr, MAX_IMAGE_BBOX, prior.gpu(),
cuda_stream);
}”

So how can i get cuda_stream here from upstream?

Link that i am following for reference -

github.com

shouxieai/tensorRT_Pro/blob/main/src/application/app_scrfd/scrfd.cpp#L177


      
          for(int ibatch = 0; ibatch < infer_batch_size; ++ibatch){
              auto& job  = fetch_jobs[ibatch];
              auto& mono = job.mono_tensor->data();
              affin_matrix_device.copy_from_gpu(affin_matrix_device.offset(ibatch), mono->get_workspace()->gpu(), 6);
              input->copy_from_gpu(input->offset(ibatch), mono->gpu(), mono->count());
              job.mono_tensor->release();
          }
          
          
engine->forward(false);
          
          
output_array_device.to_gpu(false);
          for(int ibatch = 0; ibatch < infer_batch_size; ++ibatch){
              auto& job                 = fetch_jobs[ibatch];
              float* image_based_output = output->gpu<float>(ibatch);
              float* output_array_ptr   = output_array_device.gpu<float>(ibatch);
              auto affine_matrix        = affin_matrix_device.gpu<float>(ibatch);
              checkCudaRuntime(cudaMemsetAsync(output_array_ptr, 0, sizeof(int), stream_));
              decode_kernel_invoker(
                  image_based_output, 
                  output->size(1), confidence_threshold_, nms_threshold_, affine_matrix, 
                  output_array_ptr, MAX_IMAGE_BBOX, prior.gpu<float>(),

Thanks.

mchi · October 9, 2021, 7:26am

Hi @sharma.rahul98912
You need to create it and maintain it by yourself.
DS does not expose internal CUDA stream outside.

Thanks!

system · November 2, 2021, 2:27am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Share cuda memory with gstreamer DeepStream SDK deepstream	14	77	June 10, 2025
Deepstream app with triton server DeepStream SDK	12	511	June 22, 2022
initial setup at workstation with titan xp DeepStream SDK	2	634	October 12, 2021
deepstream for tesla in docker DeepStream SDK	8	1748	June 5, 2018
Error while running deepstream custom application! DeepStream SDK tensorrt , cuda , deepstream	3	264	June 4, 2024
TensorRT multi stream TensorRT	3	2708	February 29, 2024
Cuda Error Illegal Address DeepStream SDK	10	839	June 7, 2023
Deepstream-test3 deepstream-5.0 DeepStream SDK ubuntu , gstreamer	12	1434	October 12, 2021
How to run the deepstream model locally in jetson? TensorRT	2	100	July 12, 2024
Test .engine generated with depstream avec tensort DeepStream SDK tensorrt , deepstream	5	31	January 22, 2025

How to get cudastream in customparser code

Related topics