It depends on more training dataset which contains similar scenario, i.e, two persons standing next to each other.
Adding these images and labels, then trigger training.
More reference: Detecting stacking of products
It depends on more training dataset which contains similar scenario, i.e, two persons standing next to each other.
Adding these images and labels, then trigger training.
More reference: Detecting stacking of products