Flash Attention/Torch SDPA on Orin Nano Devleoper kit

Hi ,
I am trying to build convernsation ai tool on Nvidea jetson orin nano 8gb
using distil whisper medium

what all the optimization possible for shorter audio like <10 second
is the Flash Attenntion or torch sdpa is avaialble

also will the streaming will help for shorter audios ?

Thanks
Navene

Hi,

Please find below related topics for info:

Thanks.

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.