I’ve migrated one of my workstations to the VFX Reference Standard, which runs on Rocky 9. (RHEL-ish for future searchers, which I always thought was a better name :)
This workstation has two RTX A5000.
Group 0:
Properties:
physicalDevices: count = 2
NVIDIA RTX A5000 (ID: 0)
NVIDIA RTX A5000 (ID: 1)
subsetAllocation = 0
and is running
| NVIDIA-SMI 545.23.08 Driver Version: 545.23.08 CUDA Version: 12.3 |
Some of my code seems to have a lot of trouble with vulkan memory allocation.
For example, UE complains a lot at startup re:
[2024.02.08-13.52.25:141][ 0]LogVulkanRHI: Warning: Failed to allocate Device Memory, Requested=131072.00Kb MemTypeIndex=1
The vulkaninfo command core dumps part way through its report, right after the start of the Device Groups section.
I’ve tried various driver install strategies, without success. It seems that, if it’s crashing in the vulkininfo that I have troubles beyond the user code, and I can’t seem to find much out there re: what next steps to take when the basic Vulkan tests don’t work.
Thoughts?