This caffe model:
https://github.com/spmallick/learnopencv/blob/master/FaceDetectionComparison/models/res10_300x300_ssd_iter_140000_fp16.caffemodel
runs slower on nvcaffe GPU than on OpenCV CPU
prototxt:
https://github.com/spmallick/learnopencv/blob/master/FaceDetectionComparison/models/deploy.prototxt
OpenCV code here:
https://github.com/spmallick/learnopencv/blob/master/FaceDetectionComparison/face_detection_opencv_dnn.cpp