Deep Learning for Object Detection with DIGITS

jwitsoe · August 11, 2016, 12:59am

Originally published at: https://developer.nvidia.com/blog/deep-learning-object-detection-digits/

Figure 1: A screenshot of DIGITS 4 showing the input image (top) and the final result with bounding boxes around detected vehicles (bottom). Today we’re excited to announce the availability of NVIDIA DIGITS 4. DIGITS 4 introduces a new object detection workflow and DetectNet, a new deep neural network for object detection that enables data scientists…

anon4776004 · August 16, 2016, 1:21am

How can we creat label discription for object tetection?

anon16338414 · August 16, 2016, 12:57pm

Is't possible to train faster r-cnn or single shot detector models now or in close future?

anon66576231 · August 24, 2016, 9:22am

Good post!! The goal of an object detection system is to detect all instances of objects of a known category in an image. Figure 1 shows the final results of an object detection system trained with DIGITS which can detect vehicles on a construction site. Starting with a successful vehicle detection system like this, you can solve a number of other problems such as recognizing the makes and models of the vehicles, counting and tracking vehicle locations over time, generating natural language descriptions of the images and so on.

Jafar

anon92853494 · August 30, 2016, 11:14am

Indeed an interesting tool with a simple interface. I wonder how easy is it to change the network architecture?

anon23303201 · September 30, 2016, 9:17pm

Changing the GoogleNet FCN portion of DetectNet is relatively straightforward. You just have to make sure that you reconcile the input and output data shapes with whatever FCN you replace it with.

In addition to Object Detection networks DIGITS 4 also supports image segmentation networks and image classification networks. Network architectures can be modified using the Caffe prototxt format.

anon23303201 · September 30, 2016, 9:18pm

It is not possible to train faster r-cnn through DIGITS currently. DetectNet is a single shot detector. It should be possible to also implement other single shot detectors like YOLO but there is not currently an example of this.

anon23303201 · September 30, 2016, 9:20pm

There are a variety of tools available for creating bounding box annotations on images that can be converted in to the KITTI format that DetectNet requires. One example is the open-source tool Sloth: http://sloth.readthedocs.io...

anon54543392 · November 6, 2016, 5:39am

Hi,
Do we have support for multiple object detection(pedestrians, cars in an image) in DIGITS 4?
If not, is it planned for the future?

anon59830129 · November 7, 2016, 10:03pm

You can and here is a PR with some information about it -
https://github.com/NVIDIA/c...

anon82536188 · November 13, 2016, 8:42pm

Hi!
"Unfortunately, this dataset is not shareable" - maybe the net weights are shareable? The data look the same as in my problem (cars viewed above), would love to try it out on my dataset :)

anon75353715 · March 21, 2017, 6:59pm

try "ALT" from alpslabel.wordpress.com but no guarantee its the easiest way. Prefer considering it "easier".

anon38692942 · March 29, 2017, 11:01pm

Im pretty sure that recall is Tp / (Tp + Fn), not Tp / (Tp + Tn)

anon95180265 · March 30, 2017, 2:39am

Good catch. Fixed it, thanks.

anon32998975 · April 1, 2017, 1:54am

Can we use torch model instead of caffe in custom network option? can we convert caffe prototxt to torch lua?

anon2494271 · April 10, 2017, 6:43pm

I have very large images at about 5000x4000 with training bounding boxes that are mostly about 110x110. There are more than 500 of these images. Is there any advice with dealing with this much data or estimation of how long it could take to train? I am using a Tesla K40. Any idea what batch size I will likely have to use or even a reference as to how to determine the batch size?

anon75353715 · May 8, 2017, 3:24pm

Hi Leonard, for input image size, you are restricted to gpu memory. I am training large images also and for detectnet, at 12 GB gpu RAM, 4-4.5 megapixel RGB images are maximum. This means something like 2000x2000 or 4000X1000, etc.
Have you checked the tools at alpslabel.wordpress.com ? You may find some useful stuff there.

anon40269687 · May 11, 2017, 8:06am

Thank you Baker and Prasanna for step by step guidance
I am new bee here and I followed all your steps to create DB using KITTI vision with 56 sets for training and 13 for validation

I got error when created DB and I really dont know what the error msg means

2017-05-11 09:39:43 [ERROR] ValueError: invalid literal for int() with base 10: '116.41870117188'
Traceback (most recent call last):
File "/home/yasirac/digits/digits/tools/create_generic_db.py", line 478, in <module>
args['stage']
File "/home/yasirac/digits/digits/tools/create_generic_db.py", line 443, in create_generic_db
force_same_shape)
File "/home/yasirac/digits/digits/tools/create_generic_db.py", line 296, in create_db
entry_ids = extension.itemize_entries(stage)
File "/home/yasirac/digits/digits/extensions/data/objectDetection/data.py", line 183, in itemize_entries
self.load_ground_truth(self.train_label_folder)
File "/home/yasirac/digits/digits/extensions/data/objectDetection/data.py", line 208, in load_ground_truth
datasrc.load_gt_obj()
File "/home/yasirac/digits/digits/extensions/data/objectDetection/utils.py", line 193, in load_gt_obj
gt.occlusion = int(row[2])
ValueError: invalid literal for int() with base 10: '116.41870117188'

anon19482705 · May 25, 2017, 10:21am

I want to know the performance impact in object detection if the image resolution is high.

anon67906094 · December 30, 2017, 10:40pm

classification + localization = object detection. Am I right ?

Topic		Replies	Views
DetectNet: Deep Neural Network for Object Detection in DIGITS Technical Blog	23	1381	July 7, 2019
DIGITS: Deep Learning GPU Training System Technical Blog	54	727	January 7, 2025
Error while using "DetectNet" model with driveworks sdk v1.0 DriveWorks	6	1577	March 27, 2017
Easy Multi-GPU Deep Learning with DIGITS 2 Technical Blog	34	668	January 28, 2016
DIGITS 4 on Jetson TX1 Jetson TX1	10	3026	October 5, 2017
Create Object Detection Model without DIGITS? Jetson TX2	25	3281	October 18, 2021
How to Prepare LMDB for Training DetectNet of Jetson-Inference? Jetson TX2	7	1770	October 18, 2021
Detectnet time delay with Digits trained model. Suggestions or it is what it is? Jetson TX2	7	960	October 18, 2021
DetectNet Tutorial Problems Jetson TX2	12	1046	October 18, 2021
Caffe failed with py-faster-rcnn demo.py on TX1 Jetson TX1	17	14419	February 1, 2018

Deep Learning for Object Detection with DIGITS

Related topics