Long shot... access mesh data in different program, but already loaded in the GPU

tjaenichen · October 15, 2020, 7:39am

I am quite sure the answer is “No way”, but I am still wondering if I could somehow accomplish this.

We run a Unity application that loads a bunch of meshes. On these meshes we run calculations with Optix.

Right now this is done by walking through the Unity scene and grabbing each Mesh, converting the mesh data to a format that works for Optix and then send it there.

So at this point the GPU has loaded the same meshes twice, once for display in our app and once for Optix.

Is there a way to somehow reuse the meshes we already loaded? Even if Optix couldn’t directly access that it may be faster to download them from the GPU to re use instead of getting them from Unity.

droettger · October 15, 2020, 8:54am

Define “in a different program”. You mean inside a different process?

Accessing other graphics API’s (OpenGL, Vulkan, DX) resources on the device in OptiX directly will always require CUDA interoperability.
This is possible inside the same process for a subset of types of resources and data types supported by CUDA interop. (E.g. images and textures need to be 1-, 2- or 4-component formats, but not compressed or exotic bit layouts.)

Always keep in mind that CUDA vector types have a specific alignment requirement!
For example if you have tightly interleaved vertex data like struct { float3 v; float2 t; }; in your graphics API, you cannot simply map that to CUDA because the float2 requires an 8-byte alignment but is at offset 12 => crash with misaligned access error.
(You could reinterpret that as individual floats to get 4-byte alignment, which is then is slower to load then aligned float2).

Doing CUDA data access across process boundaries requires Inter Process Communication (IPC).
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#interprocess-communication

According to the memory management chapter “IPC functionality is restricted to devices with support for unified addressing on Linux and Windows operating systems. IPC functionality on Windows is restricted to GPUs in TCC mode”.
https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__MEM.html#group__CUDA__MEM

I would also assume the IPC memory handles require actual CUDA memory allocations and not just temporarily mapped virtual pointers from CUDA-graphics interop resources.
(I have no experience with CUDA IPC. I’m a Windows graphics guy.)

In summary, CUDA interop with graphics APIs in the same process shouldn’t be a problem, but I don’t expect that to be working across processes using IPC easily, and not at all under Windows devices running graphics.

tjaenichen · October 15, 2020, 9:47am

Thanks!

I was thinking that maybe from Unity I could a handle on things via CUDA, or at least the in DLL (which uses Optix) I am importing, which should run inside the same process. Unity is using DirectX 11, if that makes a difference.

droettger · October 15, 2020, 11:16am

If you’re loading a DLL into the same process, then you should be able to use CUDA interop on buffer and image resources, but as said, be very careful with the CUDA alignment restrictions.

Once you have a CUDA resource handle, you can access the underlying device data.
I’m showing that for OpenGL in my OptiX 7 examples. Search for m_interop in this file for example:
https://github.com/NVIDIA/OptiX_Apps/blob/master/apps/rtigo3/src/DeviceSingleGPU.cpp

There was very similar discussion recently which effectively does the same with D3D11 and boils down to another CUDA header and the resp. D3D11 variants of the resource registering calls, the rest behaves identical:
https://forums.developer.nvidia.com/t/unity3d-rendertexture-texture2d-to-optiximage2d/156408

tjaenichen · October 15, 2020, 11:30am

Awesome, thanks a lot! I’ll have a look

Topic		Replies	Views
Mixing shaders and CUDA CUDA Programming and Performance	3	3411	January 11, 2009
How to use opengl3.3 with CUDA using OpenGL Interoperability OpenGL	0	650	April 23, 2019
CUDA GL interop - Reading mapped buffer texture from another process CUDA Programming and Performance	0	589	May 5, 2014
OpenGL in OptiX 7 OptiX	3	1155	June 14, 2022
Optix7.0 opengl interoperation OptiX opengl	5	1961	June 15, 2022
Interop with Unity/D3D OptiX	8	1315	June 27, 2022
D3Dinterop CUDA Programming and Performance	0	3129	April 6, 2010
interop with opengl is quite slow OptiX	4	1102	June 14, 2022
CUDA / OpenGL / GLSL Pixel Shader running within the same application. CUDA Programming and Performance	3	10115	July 26, 2010
OpenGL & GPU Interaction CUDA Programming and Performance	5	6095	April 16, 2008

Long shot... access mesh data in different program, but already loaded in the GPU

Related topics