I am working on the Jetson Thor platform and trying to set up flash_attn (Flash Attention) for my projects.
The Issue:
When I attempt to install flash_attn by compiling it from source (e.g., via pip install flash-attn --no-binary flash-attn), the system becomes unstable during the build process and ultimately restarts without completing the installation.
I suspect this is due to the high memory pressure or thermal constraints during the heavy compilation workload on the device.
Request:
- Does anyone have a pre-compiled wheel (
.whl) forflash_attnspecifically built for the Jetson Thor?
Any guidance or shared binaries would be greatly appreciated. I am happy to test any community builds.