Deep Learning for Computer Vision with MATLAB and cuDNN

jwitsoe · October 27, 2015, 5:21am

Originally published at: https://developer.nvidia.com/blog/deep-learning-for-computer-vision-with-matlab-and-cudnn/

Deep learning is becoming ubiquitous. With recent advancements in deep learning algorithms and GPU technology, we are able to solve problems once considered impossible in fields such as computer vision, natural language processing, and robotics. Figure 1: Pet detection and recognition system. Deep learning uses deep neural networks which have been around for a few…

anon28337922 · October 28, 2015, 8:59pm

Nice writeup. For those interested in GPU systems with cuDNN preinstalled I found the following which readers here might be interested in: http://exxactcorp.com/index...

anon28593763 · November 3, 2015, 9:05pm

Hi. Nice work. Could you share your images and videos for this benchmark. Thus, I'll be able to compare it on my hw? Tnx in advance.

anon48679366 · November 4, 2015, 2:53pm

Hi Wendell, glad you liked the post. The images and videos belong to my my colleagues and unfortunately I don’t have permissions to share all of them. You can find several dog and cat image datasets and videos on the internet that be readily used for this task. Please note that your mileage may vary since the solution is sensitivity to the training images. For example, if your training data is small and only includes certain pet poses, your model may not be robust to all poses in the video. You may then need to gather more images to introduce pose invariance.

anon90918232 · November 8, 2015, 2:50pm

>> opticFlow = opticalFlowFarneback;

Undefined function or variable 'opticalFlowFarneback'.

where can i find opticalFlowFarneback library ?

anon48679366 · November 9, 2015, 2:58pm

Hi Kadir, opticalFlowFarneback is part of the Computer Vision System Toolbox and was introduced in R2015b (current release or MATLAB). Make sure you upgrade to this release and you should be able to run the example. Feel free to contact us if you have further questions or need help with upgrade or usage: http://www.mathworks.com/su...

anon38958062 · November 12, 2015, 5:32pm

I setup Computer Vision System Toolbox, but in examples, with
mexOpencv example.cpp command, creates the example.mex and example.m files and then build example.m using this mex file. But I have now only script file so it returns Undefined function or variable 'opticalFlowFarneback'. error. So, is there any other solution or path-lib settings I have forgotten?

anon48679366 · November 16, 2015, 4:01am

Hi Elif, see my response to Kadir below. If you have R2015b version of MATLAB and Computer Vision System Toolbox, you should be able to just run 'opticalFlowFarneback' without the need to install anything from external packages.
Here's the link to the function's documentation page:
http://www.mathworks.com/he...

anon45410826 · December 10, 2015, 1:53pm

i have problems and i am using 2015a. i am using rgb image. Should i have to use gray scale image?

Reference to non-existent field 'normalization'.

Error in cnnPredict/cnnPreprocess (line 84)

im = imresize(im, cnnModel.net.normalization.imageSize(1:2));

Error in cnnPredict (line 26)

resTemp = vl_simplenn(cnnModel.net, cnnPreprocess(predImage(:,:,:,1)), [], []);

Error in PetDetectionRecognitionScript (line 18)

label = cnnPredict(cnnModel,img);

anon45410826 · December 14, 2015, 3:23pm

I have a problem.Under MatConvNet, the function vl_nnconv.m has nothing within it. If you have it, then kindly send the zip folder
''matconvnet-1.0-beta15'' to me. It will be very helpful for me.

email: rahman3.1416@yahoo.com

anon45410826 · December 14, 2015, 3:27pm

Under MatConvNet, the function vl_nnconv.m has nothing within it. If you have it, then kindly send the zip folder
''matconvnet-1.0-beta15'' to me. It will be very helpful for me.

email: rahman3.1416@yahoo.com

anon48679366 · December 16, 2015, 7:53pm

Hi sadman,

"Reference to non-existent field 'normalization'" means that the cnn model you provided to cnnPredict function doesn't have a field called 'normalization' cnnPredict function needs field to do two things: (1) To resize your input image such that it is compatible with the imagenet network (2) subtract the imagenet average image.
If you downloaded a pretrained imagenet model from vlfeat webpage as suggested in the code files, the model must already have a 'normalization' field that cnnPredict expects, in order to make a prediction.

anon45410826 · December 25, 2015, 8:45pm

hi, i have a problem. I ran the code successfully but i didn't get the desired ouput. There was no bounding box around dog or cat in the constructed video test.avi. What's wrong with it please explain someone.

anon79132173 · January 12, 2016, 7:44pm

Great, but how i can try a net with my images for recognize pictures?
I dont want download pretrained mat file.
Thank you in advance

anon62122626 · January 19, 2016, 2:46pm

Hi Shashank Prasanna,

I have exactly the same problem :
>> imageSize = cnnModel.net.normalization.imageSize;
Reference to non-existent field 'normalization'.

The cnnModel.net is properly downloaded from Vlfeat. I tried "imagenet-vgg-f.mat" and 'imagenet-matconvnet-vgg-f.mat".

Where I got it wrong ? Thank you.

anon48679366 · January 20, 2016, 1:06am

Hi Nico, The version we used for this post is: matconvnet-1.0-beta15
It's possible that later releases store normalization differently.
See list of changes here:
http://www.vlfeat.org/matco...

anon48679366 · January 20, 2016, 1:08am

The blog post outlines the steps you would take to use a pretrained CNN as a feature extraction technique. Alternatively you could train a network from scratch, you should find code examples to do in the MatConvNet examples folder.

https://github.com/vlfeat/m...

anon17497068 · January 24, 2016, 12:08pm

Hi Nico,

I had the same problem with matconvnet-1.0-beta 18, but there are only a few lines to fix in the code to get tit working.
You simply need to update the NN in order to make it compatible by:

net = load('imagenet-vgg-f.mat');

cnnModel.net = vl_simplenn_tidy(net)

Those networks apparently have a slightly different structure, than in earlier versions.

In Shashank Prasanna's function cnnPredict.m simply add the "meta" struct field (e.g. cnnModel.net.normalization --> cnnModel.net.meta.normalization ) in lines 78, 84 and 85:
78: classLabel = cnnModel.net.meta.classes.description(labelId)';
84: im = imresize(im, cnnModel.net.meta.normalization.imageSize(1:2));
85:im = bsxfun(@minus,im,cnnModel.net.meta.normalization.averageImage);

Hope that helps.
And thanks to Shashank Prasanna for the great blog post!

anon62122626 · January 27, 2016, 1:55pm

Hi Daniel and Shashank,

It works thank you !
I thought about the vl_simplenn_tidy conversion but it was not enough.

Now I have a curious problem when I test cnnPredict:
Number of images: 1
Number of batches: 1
Whereas the "summary(trainingLabels)" indicates the right number of images (more than 50 images in cat and dog folders.
Any idea please ?
Thank you again !

anon65339821 · January 27, 2016, 2:10pm

Hi Shashank Prasanna,
first of all thanks for your great blog post, really interesting and useful.
I have tried it and I've had a problem in this part

for ii = 1:numel(imset)
for jj = 1:imset(ii).Count
trainingImages(:,:,:,jj) = imresize(single(read(imset(ii),jj)),imageSize(1:2));
end
end

PROBLEM --> Assignment has fewer non-singleton rhs dimensions than non-singleton subscripts

If I continue doing the following steps, in this one I get another problem:

svmmdl = fitcsvm(cnnFeatures,trainingLabels);

PROBLEM --> Error using classreg.learning.FullClassificationRegressionModel.prepareDataCR (line 138)
X and Y do not have the same number of observations.

Error in ClassificationSVM.prepareData (line 607)
[X,Y,W,dataSummary] = ...

Error in classreg.learning.FitTemplate/fit (line 205)
[X,Y,dataPrepOut{1:this.NDataPrepOut}] = ...

Error in ClassificationSVM.fit (line 237)
this = fit(temp,X,Y);

Error in fitcsvm (line 279)
obj = ClassificationSVM.fit(X,Y,varargin{:});

What could I do to solve it?
Thank you very much.

Topic		Replies	Views
Using MATLAB and TensorRT on NVIDIA GPUs Technical Blog	1	516	February 9, 2021
Accelerate Machine Learning with the cuDNN Deep Neural Network Library Technical Blog	32	892	February 26, 2018
DIGITS: Deep Learning GPU Training System Technical Blog	54	1197	January 7, 2025
DetectNet: Deep Neural Network for Object Detection in DIGITS Technical Blog	23	1629	July 7, 2019
VisionWorks + Inference Jetson TX2	14	3777	October 18, 2021
Deep Learning for Automated Driving with MATLAB Technical Blog	4	535	December 10, 2019
Questions about Face-Recongnition Jetson TX2	46	8465	October 18, 2021
DetectNet Tutorial Problem - OpenCV 3? Jetson TX2	16	1685	October 18, 2021
Train custom object detectio model Jetson Nano ai-training	12	3191	October 18, 2021
How to build the objection detection framework SSD with tensorRT on tx2? Jetson TX2	96	22569	February 21, 2018

Deep Learning for Computer Vision with MATLAB and cuDNN

Related topics