ANNOUNCING NVIDIA® cuDNN EMBEDDED – GPU Accelerated Machine Learning for Jetson TK1

sjones · November 14, 2014, 10:14pm

ANNOUNCING NVIDIA® cuDNN EMBEDDED – GPU Accelerated Machine Learning for Jetson TK1

NVIDIA cuDNN is a GPU-accelerated library of primitives for deep neural networks. It emphasizes performance, ease-of-use, and low memory overhead. Integrated into higher-level machine learning frameworks, such as UC Berkeley’s popular Caffe software, it’s now available for the wildly popular Jetson TK1. Perfect for low-power computer vision and machine learning applications, like robotics and autonomous vehicles, this platform provides an amazing level of performance packed into a low-power board.

Key Features

Forward and backward convolution routines, tuned for NVIDIA GPUs
Always optimized for latest NVIDIA GPU architectures
Arbitrary dimension ordering, striding, and subregions for 4d tensors
Forward and backward paths for common layer types (ReLU, Sigmoid, Tanh, pooling, softmax)
Context-based API allows for easy multithreading

Visit the Parallel ForAll Blog for an overview of cuDNN for embedded or visit hereto download the library.

Stephen Jones
Product Manager – Strategic Alliances

ShervinE · December 17, 2014, 5:04am

I added this to the Wiki at [url]http://elinux.org/Jetson/cuDNN[/url] and have made this post non-sticky.

Topic		Replies	Views
ANNOUNCING NVIDIA® cuDNN EMBEDDED – GPU Accelerated Machine Learning for Jetson TK1 Announcements	1	2746	April 22, 2017
ANNOUNCING NVIDIA® cuDNN – GPU Accelerated Machine Learning Announcements	0	2587	September 7, 2014
ANNOUNCING NVIDIA® cuDNN – GPU Accelerated Machine Learning GPU-Accelerated Libraries	0	3026	September 7, 2014
Embedded Machine Learning with the cuDNN Deep Neural Network Library and Jetson TK1 Technical Blog	1	308	June 8, 2016
cuDNN v2: Higher Performance for Deep Learning on GPUs Technical Blog	2	453	November 18, 2015
Accelerating Transformers with NVIDIA cuDNN 9 Technical Blog cudnn	2	193	January 12, 2025
Embedded Deep Learning with Jetson — NVIDIA webinar, Wednesday Oct. 12 Announcements	0	1296	October 2, 2016
cuDNN 5.1 delivers 2.7x Faster Training of Networks with 3x3 Convolutions Announcements	0	2080	August 12, 2016
Caffe and Imagenet Jetson TK1	3	1878	March 17, 2015
cuDNN 5 on Tegra K1 Jetson TK1	2	1673	June 24, 2016

ANNOUNCING NVIDIA&reg; cuDNN EMBEDDED &ndash; GPU Accelerated Machine Learning for Jetson TK1

Related topics

ANNOUNCING NVIDIA® cuDNN EMBEDDED – GPU Accelerated Machine Learning for Jetson TK1