GPU Inference Monitoring in Browser

drcnyc · August 11, 2020, 7:50pm

We are developing an app which is HTML based, and where we want to be able to “monitor” the status of inference apps running in the background (the inference apps are being used as sensors to count widgets of a certain type). The app runs locally on the nano - both web server and client (chromium browser in kiosk mode). We are determining the best way to integrate the inference display output with the web app, and I’m looking for any guidance from this forum given the level of experience and knowledge here is very high.

Our inference apps are developed based on the jetson-inference / hello AI world demos, and we have no problem getting them to run as a standalone app on the display. However, to integrate them into a web interface, we have come up with a few different approaches:

Use XPRA / VirtualGL to set up a second virtual display, and display the html content in the browser; this appears to be a very heavyweight solution as it requires a second X server to be running, OpenGL emulation and a whole host of supporting apps and modules. Also it appears to lack support for EGL.
Develop a custom interface using electron (or similar) with web rendering capability and integrate the inference apps into this application. Potential challenges here are that we want inference apps running constantly (regardless of monitoring status) and thus need the ability to simply turn on or off the display output.
Use an HDMI dummy plug to run a second display that allows for GPU rendering, and use a VNC-to-html server such as noVNC to render the output of the second dummy display in the browser.

Our initial assessment of these options is that the first doesn’t work, the second requires a significant custom coding effort and the third would be simplest but somewhat of a hack. Am curious if there are any other thoughts or perspectives, or if anyone has done something similar to this in the past.

DaneLLL · August 20, 2020, 7:50am

Hi,
For running deep learning inference, we have DeepStream SDK. Please check

It is gstreamer-based frameworks. for your usecase, it may work if you integrate it with gstreamer WebRTC.

We don’t have full implementation. Just an idea for reference.

Topic		Replies	Views
Streaming data from Jetson Nano to a local computer Jetson Nano rtsp	6	4715	October 11, 2021
Jetson nano inference and object tracking Jetson Nano gstreamer	4	717	October 15, 2021
Jetson Nano running DeepStream using the Kinect V2 Live, Infrared and Depth Streams DeepStream SDK	5	827	October 12, 2021
Jetson-inference rendering Jetson Nano jetson-inference	4	1459	October 18, 2021
Examples for Deployment of and Inference with Pretrained Custom PyTorch-Based Models on Jetson Orin Nano Jetson Orin NX pytorch	13	105	May 25, 2025
How to send inference result as RTP streams to another Jetson for on screen display & filesink? DeepStream SDK camera , gstreamer	7	872	October 12, 2021
Run Gaze Estimation model on Nvidia Jetson Nano on own data TAO Toolkit tensorrt	16	1399	March 12, 2022
Inferring on a folder of images using deepstream in python DeepStream SDK tensorrt , jetson-inference , gstreamer	2	765	October 12, 2021
Jetson Nano Inferencing to Mobile App Jetson Nano jetson-inference	3	1367	June 25, 2021
How save inference output and display inference on another local network PC in deepstream-reference-app DeepStream SDK	12	540	February 16, 2023

GPU Inference Monitoring in Browser

Related topics