Model inference speed suddenly slows down

jaywonchunguz985 · January 31, 2020, 7:26am

Hi,

I am performing semantic segmentation on a Jetson Nano board (Jetpack 32.2.3), with a small U-Net model implemented with pytorch 1.3.0.

As images are fetched from the SD card and forwarded through the model, I find that the inference speed suddenly slows down. That is, for the first 7~8 images, inference takes around 25ms each. Then, the time shoots up to around 150ms on the next image, and sustains an inference time of around 280ms then on.

Inference time is measured by wrapping the model as follows:

...
def inference(self, x):
    tic = time.perf_counter()
    with torch.no_grad():
        pred = self.model(x)
    toc = time.perf_counter()
    return pred, toc-tic
...

Hence I am pretty sure that the variation in image fetch time is not the problem.

Could you provide suggestions or guesses about the cause?

Thanks,
Jaewon

AastaLLL · February 17, 2020, 6:35am

Hi,

Sorry for the late update.

Another possible cause is the memory allocation.
If the buffer is not be reused, you may need to release/allocate essential memory for each frame.

Would you mind to help us checking this first?
Please help to add a sleep code after inference and monitor the system status with tegrastats.

sudo tegrastats

Thanks.

Topic		Replies	Views
Lowest latency model and/or ways to reduce latency from camera to inference Jetson Nano camera , jetson-inference	2	51	August 12, 2024
Faster inference in Jetson Nano B01 board Jetson Nano jetson-inference	2	455	December 26, 2022
Question about inference speed Jetson Nano	2	588	October 18, 2021
Randomness at inference time Jetson Nano jetson-inference	6	550	May 19, 2022
Inference time not stable for Jetson Nano with TensorRT Jetson Nano tensorrt	4	677	October 18, 2021
Strange jumping results on FPS and inference time Jetson Nano	9	1170	October 18, 2021
Taking longer for inferencing even after TensorRT optimization TensorRT	3	398	May 28, 2020
Detection tesnorRT takes seconds to run on TX2 Jetson TX2 tensorrt , jetson-inference	8	652	October 18, 2021
Improve inference performances yolov5 Jetson Nano yolo , nano2gb	4	1538	June 2, 2022
Processing speed of first few frames is slower Jetson Nano	6	687	October 18, 2021

Model inference speed suddenly slows down

Related topics