|
About the Container: HPC category
|
|
0
|
792
|
February 1, 2020
|
|
Containerized illegal memory access requires restart of entire node
|
|
0
|
50
|
October 10, 2025
|
|
Broken HPC-Benchmarks container (v25.02/v25.04)
|
|
0
|
93
|
September 2, 2025
|
|
NCCL randomly crashes on Leonardo
|
|
5
|
343
|
June 6, 2025
|
|
Loop tiling not giving expected results
|
|
0
|
43
|
April 25, 2025
|
|
Cudss cannot link in win10
|
|
0
|
68
|
February 19, 2025
|
|
Undefined reference to `cusparsedcsrgemm2_sethpm_'
|
|
0
|
40
|
February 8, 2025
|
|
CUDA_ERROR_ILLEGAL_ADDRESS in OpenACC Code with Pointer Alias – cuEventSynchronize Error
|
|
0
|
48
|
February 6, 2025
|
|
Sendrecv_perf nccl-tests - The process needs to be terminated manually - Volatile GPU-Util: 100%
|
|
0
|
168
|
February 6, 2025
|
|
Nvshmem error in docker HPL benchmark
|
|
1
|
374
|
December 25, 2024
|
|
Relion 5.0.0 container does not work with Blush, etc
|
|
2
|
183
|
December 19, 2024
|
|
OpenCL Executable will not run in Runtime Container
|
|
0
|
177
|
October 27, 2024
|
|
OpenACC + MPI resources needed
|
|
0
|
48
|
September 25, 2024
|
|
H100 HPL results
|
|
0
|
496
|
June 29, 2024
|
|
cudaHostRegister with cudaHostRegisterIoMemory flag returns cudaErrorOperatingSystem
|
|
0
|
267
|
May 20, 2024
|
|
Building HPL on Rocky 9.2
|
|
0
|
182
|
May 6, 2024
|
|
Nvidia docker nvcr.io/nvidia/hpc-benchmarks:23.10 HPL running error at HPC ARM Developer-kit
|
|
2
|
1504
|
February 22, 2024
|
|
HPL check displays nan in Container 23.10
|
|
0
|
435
|
January 18, 2024
|
|
Slurmstepd error URL 21.09-tf2-py3 returned error code: 401 Unauthorized
|
|
0
|
496
|
January 11, 2024
|
|
Unable to make nccl work
|
|
0
|
328
|
December 20, 2023
|
|
ERROR: CUDA driver version is insufficient for CUDA runtime version using triton 22.08 image in Apptainer container
|
|
2
|
702
|
November 30, 2023
|
|
Could the `ldmatrix` re-written in C++ code?
|
|
0
|
538
|
September 13, 2023
|
|
Error trying to build Triton Inference Server container using Singularity and Docker bootstrap setting
|
|
1
|
1077
|
August 28, 2023
|
|
Cusparse spmm of matB matC that shares device pointer
|
|
0
|
419
|
May 22, 2023
|
|
Floating point exception when running HPC-Benchmark:23.3
|
|
0
|
921
|
April 28, 2023
|
|
23.1 docker images : some tags are missing
|
|
0
|
530
|
February 3, 2023
|
|
Mlnx-ofed 5.4: ibv_create_qp cannot malloc memory more than 4026 clients on one sigle node
|
|
1
|
1344
|
December 2, 2022
|
|
Would like some help in running the xhpl 21.4 container on slurm
|
|
0
|
1214
|
November 4, 2022
|
|
Enroot with nvhpc
|
|
0
|
1241
|
October 27, 2022
|
|
TensorFlow on sbatch/srun is way slower than on only srun or sbatch
|
|
1
|
1103
|
October 18, 2022
|