With my team we are working on a Computer Vision project and for our solution, we are using NVidia tools and more precisely for this part the FacialLandmarks and the Gaze models. But we are facing a problem when executing the app : it works well when there is only one person on the camera but when there is more persons, the landmarks and gaze are correctly detected only for the first person seen on the video while they appear totally random for the other persons.
The face detect is also working on our side (we can have more than one red boxes), but the landmarks inside these boxes seems to be random. Are you able to get the landmarks even on multiple faces ?