|
NVIDIA A5000 - How to get full specs and how to compare cards?
|
|
3
|
1650
|
June 11, 2026
|
|
Toolery 0.1.0 - a deterministic tool-calling benchmark for local LLMs
|
|
7
|
513
|
June 2, 2026
|
|
Nemotron-3-Super-120B-A12B-NVFP4 on single DGX Spark: 23.45 tok/s (spark-arena.com/ benhmarks)
|
|
6
|
843
|
May 26, 2026
|
|
Collecting eval results for Spark-sized quants of models
|
|
50
|
1883
|
May 11, 2026
|
|
NIM vs Ollama on RTX 5090: 7.3x Faster Inference + NeMo Guardrails at 2.1% Overhead — 870 Data Points
|
|
0
|
343
|
March 31, 2026
|
|
Request for Approval to Publish DeepStream Benchmarking Results - NMS Layer Placement Analysis
|
|
1
|
66
|
March 16, 2026
|
|
How to benchmark on Thor to get the real FP4/FP8 performance TFOPS
|
|
10
|
496
|
March 16, 2026
|
|
Tools recommended by NVIDIA for measuring DeepStream pipeline performance metrics on Jetson
|
|
4
|
112
|
March 2, 2026
|
|
Benchmarking VLM on Orin
|
|
6
|
346
|
March 2, 2026
|
|
How can I assess general knowledge on a benchmaxxed model?
|
|
1
|
1286
|
February 13, 2026
|
|
Support for openai_gptoss reasoning parser in vLLM, and its impact on the effective inference performance on Spark
|
|
7
|
726
|
January 26, 2026
|
|
How to run MLPerf v3.1 with Thor
|
|
2
|
85
|
January 8, 2026
|
|
What prompt processing speed can one expect above 500k ctx?
|
|
6
|
659
|
January 3, 2026
|
|
Benchmarking and Optimizing Averager algorithm on Jetson Nano
|
|
1
|
97
|
December 14, 2025
|
|
Run hpc_benchmark23.10 HPL with v100GPU
|
|
4
|
1849
|
November 24, 2025
|
|
Clarification on CUDA IPC: Does cudaMemcpyDeviceToDevice guarantee remote memory visibility?
|
|
0
|
63
|
November 13, 2025
|
|
Thor torch.mm benchmark results (float32/float16/float8_e3m2fn)
|
|
4
|
430
|
September 15, 2025
|
|
nVidia nVector - download and documentation
|
|
3
|
3247
|
August 5, 2025
|
|
cuDNN vs cuBLAS performance on GEMMs
|
|
0
|
141
|
June 19, 2025
|
|
Anyone has comparison of LLM engines(TRTLLM/VLLM/MLC)?
|
|
2
|
613
|
June 16, 2025
|
|
Has Anyone Benchmarked (U-Net Segmentation) on Jetson Orin Series?
|
|
1
|
353
|
June 2, 2025
|
|
Source Code of Cutlass GemmKernel from Basic Gemm
|
|
1
|
127
|
April 16, 2025
|
|
Orin nano/nx ResNet-50 benchmark on R36.4.3(jetpack6.2)
|
|
7
|
637
|
March 10, 2025
|
|
Orin nano benchmark on R36.4.3(jetpack6.2)
|
|
13
|
681
|
February 12, 2025
|
|
Issue encountered while executing jetson_benchmarks from GitHub
|
|
2
|
194
|
November 14, 2024
|
|
FPS calculation (estimate) for NVIDIA RTX 2000 Ada Generation Embedded GPU
|
|
0
|
127
|
November 3, 2024
|
|
ONNX engine initialisation/build takes significantly longer in TensorRT 8.5 vs 8.0
|
|
10
|
1699
|
August 20, 2024
|
|
Fp32 precision support on Jetson AGX Orin
|
|
1
|
656
|
May 7, 2024
|
|
Tx2 Benchmarks error
|
|
2
|
341
|
April 29, 2024
|
|
Compare cpu vs gpu execution time with google benchmark
|
|
0
|
616
|
February 15, 2024
|