Hi @scousins, you’re right, and I owe you a correction on my first bullet. There isn’t a newer TensorFlow container to point you at: 25.02 is the final one, and my “pull a newer tag” note was wrong for TensorFlow specifically. Apologies for sending you hunting for something that doesn’t exist.
Here’s what actually happened. NVIDIA stopped publishing the Optimized TensorFlow containers after 25.02. It’s in the 25.02 release notes:
After the 25.02 release, NVIDIA Optimized TensorFlow containers will no longer be released.
NVIDIA staff have said the same on the forums, on the “ptxas too old” report for this exact image: Compile error with ptxas too old on latest version(nvcr.io/nvidia/tensorflow:25.02-tf2-py3)
So 25.02 (which is CUDA 12.8, not 13) is the end of the line for TF. The rest of the NGC framework line did move to CUDA 13 and is on the 26.x tags now, which is where my mix-up came from. TensorFlow just didn’t make that jump.
The URLs you asked for:
Good news for your setup: H200 is Hopper (sm_90), which CUDA 12.8 supports fully, so 25.02 is the right and final container for your hardware. The hard “ptxas too old” failure in that thread is on DGX Spark (GB10 / sm_121), a newer architecture than your Hopper card, so it doesn’t apply to you. What you have is the milder case: the +ptx85 line is the container’s older toolchain ignoring a newer PTX feature flag, which on sm_90 isn’t a correctness problem by itself.
Realistic options:
1. Stay on 25.02 (lowest effort, and correct for H200)
It’s the final, validated TF image and it fully supports sm_90; your CUDA 13.2 driver runs it via forward compatibility. Before you treat the +ptx85 warning as a blocker, confirm it’s actually costing you step time. Some XLA JIT compilation at startup is normal on any container version, so the warning by itself may just be log noise.
2. Build your own CUDA 13 + TF image (only if you specifically need the CUDA 13 toolchain)
No NVIDIA-optimized path here, but you can layer the upstream tensorflow pip wheels (or build from source) on an nvcr.io/nvidia/cuda:13.x base. You give up the NGC tuning and own the XLA/cuDNN compatibility yourself.
3. Switch frameworks for an NVIDIA-optimized CUDA 13 image
Only if your workload isn’t TF-locked: the PyTorch NGC container, for example, is still on the monthly cadence and on CUDA 13 (the 26.x tags).
If you can share whether you’re seeing an actual step-time regression (vs just the warning in the log), I can help you decide whether option 1 is good enough or whether you actually need the rebuild in option 2.
Thanks,
Atharva