I am trying to install bitsandbytes on my Nvidia Jetson AGX Orin(64 GB) GPU with compute compatibility 8.7 and cuda version 12.6 from source because the pip package didn’t work well in my case. I used these commands
git clone https://github.com/TimDettmers/bitsandbytes.git
cd bitsandbytes
cmake ..
CUDA_VERSION=126 python setup.py install
and to test if the bitsandbytes module is installed and working well I run this command
python -m bitsandbytes but I got the following errors
bitsandbytes library load error: Configured CUDA binary not found at /mnt/storage/LogLLM/bitsandbytes/bitsandbytes/libbitsandbytes_cuda126.so
If you are using Intel CPU/XPU, please install intel_extension_for_pytorch to enable required ops
Traceback (most recent call last):
File "/mnt/storage/LogLLM/bitsandbytes/bitsandbytes/cextension.py", line 319, in <module>
lib = get_native_library()
File "/mnt/storage/LogLLM/bitsandbytes/bitsandbytes/cextension.py", line 282, in get_native_library
raise RuntimeError(f"Configured {BNB_BACKEND} binary not found at {cuda_binary_path}")
RuntimeError: Configured CUDA binary not found at /mnt/storage/LogLLM/bitsandbytes/bitsandbytes/libbitsandbytes_cuda126.so
================ bitsandbytes v0.47.0.dev0 =================
Platform: Linux-5.15.148-tegra-aarch64-with-glibc2.35
libc: glibc-2.35
Python: 3.10.18
PyTorch: 2.6.0+cu126
CUDA: 12.6
HIP: N/A
XPU: N/A
Related packages:
accelerate: 1.8.1
diffusers: 0.34.0
numpy: 2.2.6
pip: 25.1
peft: 0.15.2
safetensors: 0.5.3
transformers: 4.53.0
triton: not found
trl: not found
============================================================
PyTorch settings found: CUDA_VERSION=126, Highest Compute Capability: (8, 7).
Library not found: /mnt/storage/LogLLM/bitsandbytes/bitsandbytes/libbitsandbytes_cuda126.so. Maybe you need to compile it from source?
Checking that the library is importable and CUDA is callable...
Traceback (most recent call last):
File "/mnt/storage/miniconda3/envs/logllm/lib/python3.10/runpy.py", line 196, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/mnt/storage/miniconda3/envs/logllm/lib/python3.10/runpy.py", line 86, in _run_code
exec(code, run_globals)
File "/mnt/storage/LogLLM/bitsandbytes/bitsandbytes/__main__.py", line 4, in <module>
main()
File "/mnt/storage/LogLLM/bitsandbytes/bitsandbytes/diagnostics/main.py", line 107, in main
raise e
File "/mnt/storage/LogLLM/bitsandbytes/bitsandbytes/diagnostics/main.py", line 96, in main
sanity_check()
File "/mnt/storage/LogLLM/bitsandbytes/bitsandbytes/diagnostics/main.py", line 35, in sanity_check
p1 = p.data.sum().item()
RuntimeError: CUDA error: no kernel image is available for execution on the device
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.```