Nvidia-cutlass-dsl sbsa wheels

nvidia-cutlass-dsl only provides one python version and only x86

I was working to compile for aarch64:

#!/usr/bin/env bash
echo "Building cutlass $CUTLASS_VERSION"
set -ex

SRC=/opt/cutlass
WHL=/opt/wheels

export MAX_JOBS="$(nproc)"
export CMAKE_BUILD_PARALLEL_LEVEL=$MAX_JOBS
echo "Building with MAX_JOBS=$MAX_JOBS and CMAKE_BUILD_PARALLEL_LEVEL=$CMAKE_BUILD_PARALLEL_LEVEL"


git clone --branch v$CUTLASS_VERSION --depth=1 https://github.com/NVIDIA/cutlass $SRC

cd $SRC
echo "Building cutlass wheel in $SRC"
pip3 wheel . -w $WHL

cd $SRC/python

python3 setup_library.py bdist_wheel --dist-dir $WHL
python3 setup_pycute.py bdist_wheel --dist-dir $WHL

cd $SRC

pip3 install $WHL/nvidia_cutlass*.whl $WHL/pycute*.whl
twine upload --verbose $WHL/nvidia_cutlass*.whl || echo "failed to upload wheel to ${TWINE_REPOSITORY_URL}"
twine upload --verbose $WHL/pycute*.whl || echo "failed to upload wheel to ${TWINE_REPOSITORY_URL}"

python3 -c 'import cutlass; print(cutlass.__version__)'

But it generates me:

and not:
nvidia-cutlass-dsl