I experimented with speedup by increasing the number of streams or batch size.
I expected enough speedup in both cases.
But, there is no significant speedup by increasing the number of streams.
Multi-stream is faster than sequential processing.
Even multi-stream reduces the memory transfer time of input image by pipelining.
Do you think this result is normal??
I try it again with seperate context (nvinfer1::IExecutionContext) for each stream.
But the execution time shows similar pattern. (no significant speedup by increasing the number of streams)
Do I need to create anything more seperately? @NVES