The Power of C++11 in CUDA 7

jwitsoe · March 18, 2015, 8:50am

Originally published at: https://developer.nvidia.com/blog/power-cpp11-cuda-7/

Today I’m excited to announce the official release of CUDA 7, the latest release of the popular CUDA Toolkit. Download the CUDA Toolkit version 7 now from CUDA Zone! CUDA 7 has a huge number of improvements and new features, including C++11 support, the new cuSOLVER library, and support for Runtime Compilation. In a previous…

anon10397723 · March 18, 2015, 8:51pm

This looks really great! However, I am using fedora so I cannot try it out myself yet :-( Do you know when the fedora edition will be available?

anon95180265 · March 19, 2015, 6:22am

Hi Kenneth. The .run file installer *might* work with Fedora 20, though we haven't tested it. It's worth a try. We're working on getting the Fedora 21 installer available soon (only 21 will be *officially* supported with CUDA 7 -- please see the release notes).

anon15748217 · March 24, 2015, 9:14am

Hi Mark, thanks, nice and clear explanations. I have found a typo in the first code snippet, when calling count_if, there should be text instead of data.

anon95180265 · March 24, 2015, 1:01pm

Good catch; I fixed it. Thanks!

anon42620448 · April 26, 2015, 12:28pm

Hi Mark, which parts of the STL of C++11 can be used on the device with cuda 7? NB: I was thinking in defining classes that make use of std::vector, etc ... that would incorporate unified memory managed class, as in a previous posting of you, and then having __device__ __host__ functions using those classes. I have tried but it seems not to work. Does one need to use something like Thrust?, or am I mistaken? Thank you.

anon95180265 · April 27, 2015, 12:26am

CUDA 7 adds support for C++11 language features in device code, but not the standard template library, I'm afraid. You can use a thrust::device_vector. You could indeed write your own vector class that uses managed memory, but existing STL headers won't "just work" because of the need to annotate all functions called on the device with "__host__ __device__"

anon42620448 · April 27, 2015, 1:32am

That makes sense. Thanks for the clarification. I had started to do so, about the '__host__ __device__' with a ifdef/ifndef, but, clearly, I encountered problems using the STL methods. In order to use the implicit methods of STL, the closest and best thing to do seems to use Thrust. I had not looked at it before and it is clearly very good, as well, and probably close to what an adapted C++ for the device memory would be.

Topic		Replies	Views
CUDA 7 Release Candidate Feature Overview: C++11, New Libraries, and More Technical Blog	43	1482	August 8, 2016
Thrust v1.1 release A high-level C++ template library for CUDA CUDA Programming and Performance	6	13825	September 18, 2009
C++ support for STL containers in device code and memory CUDA Programming and Performance	11	14222	December 11, 2010
Anyone working on STL for CUDA? CUDA Programming and Performance	7	10433	December 11, 2010
Question about Thrust Library with Kernel CUDA Programming and Performance	2	1017	March 19, 2019
10 Ways CUDA 6.5 Improves Performance and Productivity Technical Blog	21	477	January 21, 2015
CUDA Toolkit 7 For Fedora 21 now available for download Announcements	3	2230	July 16, 2015
smart pointers and stl implementations in cuda CUDA Programming and Performance	0	2253	November 11, 2017
CUDA with C++ CUDA Programming and Performance	5	3612	May 28, 2009
Conversion from cpp functors to __device__ functors CUDA Programming and Performance	2	4137	February 22, 2011

The Power of C++11 in CUDA 7

Related topics