People Net -

ishan · May 4, 2020, 6:12pm

I am looking at the recently uploaded pretained model peoplenet model

Q) Would it be possible to add another class to detect such as helmets for example? When retraining it I want to keep the original class of people detection, but want to add another class such as helmets

Q) One of the usecases mentions social distancing; is there any pointers to understanding how that can be done with this model ?

Morganh · May 5, 2020, 7:38am

It is possible for you to train your own data with the unpruned peoplenet model as pretrained weight. You need to prepare the helmets’ images/labels, resize to 960x544, and set the training spec accordingly. That’s for one class. If you want to train a 4-classes detector, you also need to add some images/labels for person/bags/faces.
PeopleNet can be used to accurately count people in a crowded environment for security

ishan · May 5, 2020, 9:03am

So, is my understanding correct to train a 4-class detector - hypothetically - I can add 5 images for training for each of those classes; but the accuracy will still be high because of the original weights?

My end goal is to add a class to peoplenet - retrain with the 4 classes - but I won’t have all the data you used when training the original peoplenet - I will have a much smaller version. Will the accuracy of the retrained model with the classes of person/bag/faces be essentially the same of the original model?

Is my understanding correct? I am assuming that I have to explicitly mention the classes the exact same as “person” “bags” “faces” so the model knows?

Morganh · May 5, 2020, 4:38pm

Hi ishan,
Firstly, please prepare your own dataset for 3 classes: Person, Bag, Face. The quantity of data depends on you. Smaller is ok.
And set the correct class name in the training spec accordingly.
The class name is as below.

nvidia@nvidia:/opt/nvidia/deepstream/deepstream-5.0/samples/configs/tlt_pretrained_models$ cat labels_peoplenet.txt
Person
Bag
Face

Then you can use “tlt-evaulate” to check if the peoplenet pretrained model “resnet34_peoplenet.tlt” has a good mAP.

$ tlt-evaluate detectnet_v2 -e spec_3class.txt -m resnet34_peoplenet.tlt -k tlt_encode

Normally, the mAP will be high. It means the peoplenet weights take effect on your own 3 classes data.

Then, you can prepare the data of the 4th class, and set the training spec accordingly. Trigger the training.

ishan · May 28, 2020, 3:13am

Can I retrain the peoplenet on my 4th class, without adding newer data to the existing classes to bags and faces?

Morganh · May 28, 2020, 3:28am

So, you just want to train only one class? You can try, but I’m afraid it is not working. Because after training, your tlt model is just a detection model which will detect only one class.

ishan · May 28, 2020, 5:00am

I want to keep the ability to track people from people net, but i want to add another class. I am not interested in bags or faces, but if it means sacrificing the people class I am ok with training on 4 classes (3 original classes + 1 my new class).

If i want to add a 4 th class to people net - is that possible?

1)Person
2)Bag
3)Face
4) Cardboard Boxes (My classes)

After retraining, will my new model have that sam accuracy on people as the original peoplenet?

Morganh · May 28, 2020, 7:45am

For your case, you can only run two classes. One is Person, another is your new class.
Need to prepare the data for both classes.

ishan · May 28, 2020, 6:02pm

I will do that, just to confirm, by using 2 classes (person and cardboard boxes), this newly trained model will have the same performance for the person class as the original peoplenet model ?

Morganh · May 29, 2020, 7:47am

For your case, actually it is a new training. The peoplenet model contains pretrained weights that may be used as a better starting point for people class.
I also do similar experiment on my side. I train the “People” class and a new class “cart”.
Prepare some data for both classes. All the data are 960x544.
Then set the training spec and also tune the class_weight.
The unpruned peoplenet pretrianed model works as a good pretrained weight.

ishan · May 29, 2020, 7:49am

Your new model with the people and carts is as good as the original peoplenet model when it comes to detecting people?

Also, thank you for doing this experiment.

Morganh · May 29, 2020, 7:55am

Actually it is a new training because a new class is added.
I prepare 14k person data and 3.7k cart data, run totaly 10 epochs, 40 minutes. The AP for Person is about 60%.
I did not finetune any hyper-parameters a lot.
So, after finetune or run longer, I believe the mAP can still improve further.

ishan · May 29, 2020, 4:36pm

Would it be possible to share your person dataset?

Morganh · May 30, 2020, 6:14am

Sorry, this data is from Nvidia internal only.

ishan · May 31, 2020, 11:55pm

I understand, thanks.

Andrew_Smith · June 4, 2020, 8:57am

Good Morning,

In relation to transfer learning, would it be possible to leverage the already good PeopleNet to be able to detect gray images (Infrared Cameras).

Retraining completely is not possible as "PeopleNet v1.0 model was trained on a proprietary dataset with more than 5 million objects for person class. "

If we can use transfer learning to retain the networks ability to detect people in colour images and gray scale images with the same level of accuracy

If so how many new “gray” images would we need to use? How high is the risk of over training the network with the new gray images

Morganh · June 5, 2020, 3:54am

See https://ngc.nvidia.com/catalog/models/nvidia:tlt_peoplenet

Dark-lighting, Monochrome or Infrared Camera Images

The PeopleNet model was trained on RGB images in good lighting conditions. Therefore, images captured in dark lighting conditions or a monochrome image or IR camera image may not provide good detection results.

More reference:

For training on gray scale images only, please consider to set

output_image_channel: 1

About how many images need to use, refer to Dataset Practices - #3 by Morganh

Andrew_Smith · June 6, 2020, 1:24pm

Thanks

I was aware Peoplenet was not trained for gray scale images so I want to be able to detect people in the day time and at night.

As for retraining I thought the purpose of transfer learning was to reduce the need for huge amounts of new data ?

The link you gave just tells the amount of data used to train the network for RBG images at different distances, half indoors half outdoors?

Does this mean we would need a similar number of images from IR cameras, and will this not reduce the detections of colour images since we cant add Nvidias Training Images to the dataset

Morganh · June 8, 2020, 8:18am

@Andrew_Smith
No, you need not a similar number of images. That is why unpruned peoplenet model is provided in ngc. User can set it as pretrained model and train their own data. If your data are colour images, the transfer learning should run smoothly. But as the link said, “monochrome image or IR camera image may not provide good detection results”, that is the known limitation.

Andrew_Smith · June 8, 2020, 11:24am

Thank you for the response.

I need to detect both daytime and night time camera images.

So should I disregard PeopleNet?

Does this mean the Unpruned PeopleNet will not be able to be trained to recognize IR camera images?

If I aquire a large sum of IR camera images, will training on the unpruned model completely ruin the Colour Image detection afterwards

Topic		Replies	Views
Retraining peoplenet model with own images TAO Toolkit	43	1577	October 12, 2021
No detections after training PeopleNet using custom labeled data TAO Toolkit	7	867	October 12, 2021
Training acc is too low than expected: Peoplenet on custom dataset TAO Toolkit	14	529	November 15, 2022
Run PeopleNet with tensorrt TAO Toolkit	35	9783	August 10, 2021
Peoplenet Inference TAO Toolkit	14	1442	October 12, 2021
PeopleNet not detecting bags TAO Toolkit tensorrt	10	1227	September 30, 2022
Retrained Peoplenet Model Not Detecting TAO Toolkit	17	1175	July 6, 2022
While Using Peoplenet model for Transfer learning, got bad result TAO Toolkit	25	1073	March 10, 2022
Retraining Trafficcamnet with custom vehicle dataset TAO Toolkit	30	2472	March 11, 2022
PeopleNet Dataset Training Images resolution and number of images required TAO Toolkit	2	1108	October 12, 2021

People Net -

Dark-lighting, Monochrome or Infrared Camera Images

Related topics