|
How to benchmark on Thor to get the real FP4/FP8 performance TFOPS
|
|
9
|
202
|
January 8, 2026
|
|
How to run MLPerf v3.1 with Thor
|
|
2
|
15
|
January 8, 2026
|
|
What prompt processing speed can one expect above 500k ctx?
|
|
6
|
255
|
January 3, 2026
|
|
Benchmarking and Optimizing Averager algorithm on Jetson Nano
|
|
1
|
44
|
December 14, 2025
|
|
Run hpc_benchmark23.10 HPL with v100GPU
|
|
4
|
1755
|
November 24, 2025
|
|
Clarification on CUDA IPC: Does cudaMemcpyDeviceToDevice guarantee remote memory visibility?
|
|
0
|
42
|
November 13, 2025
|
|
Thor torch.mm benchmark results (float32/float16/float8_e3m2fn)
|
|
5
|
274
|
September 15, 2025
|
|
nVidia nVector - download and documentation
|
|
3
|
3182
|
August 5, 2025
|
|
cuDNN vs cuBLAS performance on GEMMs
|
|
0
|
101
|
June 19, 2025
|
|
Anyone has comparison of LLM engines(TRTLLM/VLLM/MLC)?
|
|
3
|
504
|
June 16, 2025
|
|
Has Anyone Benchmarked (U-Net Segmentation) on Jetson Orin Series?
|
|
2
|
204
|
June 2, 2025
|
|
Source Code of Cutlass GemmKernel from Basic Gemm
|
|
1
|
95
|
April 16, 2025
|
|
Orin nano/nx ResNet-50 benchmark on R36.4.3(jetpack6.2)
|
|
8
|
478
|
March 24, 2025
|
|
Orin nano benchmark on R36.4.3(jetpack6.2)
|
|
14
|
542
|
February 26, 2025
|
|
Issue encountered while executing jetson_benchmarks from GitHub
|
|
3
|
160
|
December 3, 2024
|
|
FPS calculation (estimate) for NVIDIA RTX 2000 Ada Generation Embedded GPU
|
|
0
|
88
|
November 3, 2024
|
|
ONNX engine initialisation/build takes significantly longer in TensorRT 8.5 vs 8.0
|
|
10
|
1581
|
August 20, 2024
|
|
Fp32 precision support on Jetson AGX Orin
|
|
2
|
571
|
June 4, 2024
|
|
Tx2 Benchmarks error
|
|
3
|
307
|
May 21, 2024
|
|
Compare cpu vs gpu execution time with google benchmark
|
|
0
|
593
|
February 15, 2024
|
|
Freeze when running benchmarks
|
|
14
|
1111
|
December 15, 2023
|
|
Jetson Orin Developer Kit - unexpected drop in PCIe transfer speed
|
|
4
|
871
|
December 6, 2023
|
|
Jetson_benchmark Minimum memory requirements
|
|
19
|
1237
|
November 14, 2023
|
|
Jetson_benchmarks got Error opening engine file
|
|
7
|
1053
|
September 7, 2023
|
|
Isaac Sim very slow compared to Mujoco or PyBullet (both physics and rendering)
|
|
5
|
2878
|
April 5, 2024
|
|
L4 Quality vs throughput with FFMPEG
|
|
0
|
687
|
July 21, 2023
|
|
Jetson Xavier NX slower than Jetson TX2 at pytorch inferences
|
|
4
|
638
|
June 29, 2023
|
|
Floating point exception when running HPC-Benchmark:23.3
|
|
0
|
921
|
April 28, 2023
|
|
Questions about whether HPL uses Tensor Core in A100
|
|
3
|
983
|
April 27, 2023
|
|
L40 vs. RTX 6000 Ada FP16/FP8 throughput?
|
|
7
|
15941
|
April 4, 2023
|