How do I effectively manage and troubleshoot the use of H100 GPUs in Confidential Computing (CC) mode, especially in multi-GPU systems with NVLink, and ensure seamless integration with both Intel TDX and AMD SEV-SNP environments? I am encountering issues such as SPDM session failures, GPU pass-through problems, and attestation errors, particularly when running non-CC workloads or mixing CC-enabled and CC-disabled GPUs. Additionally, I need clarification on the compatibility of H100 GPUs with different guest OS configurations and ensuring secure communication across multiple GPUs in CC mode. Could someone provide insights or resources on optimizing GPU setup, firmware updates, and handling DMA traffic in these complex scenarios?
sbellock
2
The questions asked are too broad to be answerable. You’ll have better luck asking for help with specific issues.
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| How to run H100 GPU without CC Mode? | 5 | 671 | February 28, 2024 | |
| Userguide to get started with H100 GPUs? | 6 | 1486 | January 23, 2024 | |
| Does H800 support HCC? | 5 | 350 | February 24, 2025 | |
| Pass-through cc-disabled H100 to a non-confidential VM | 0 | 467 | May 23, 2024 | |
| Use CC in multi-GPU system (with NvSwitch) | 6 | 1345 | October 28, 2023 | |
| Confidential Compute on GitHub? | 2 | 768 | July 26, 2023 | |
| Confidential Computing on NVIDIA H100 GPUs for Secure and Trustworthy AI | 1 | 814 | July 3, 2024 | |
| Can I test Confidential Computing with H100 and TDX on Different Machines | 2 | 338 | January 6, 2025 | |
| Announcing Confidential Computing General Access on NVIDIA H100 Tensor Core GPUs | 2 | 366 | May 14, 2025 | |
| Is it possible for an H100 without CC mode to run in CVM, like TDX? | 1 | 192 | August 12, 2024 |