Hi,
Nvidia DLA is designed specifically for the deep learning use case and is used to offload the GPU’s inference effort.
These engines improve energy efficiency and free up the GPU to run more complex networks or dynamic tasks implemented by the user.
It doesn’t target for performance but energy efficiency.
You can find some performance data in our Deepstream document to compare the results:
https://docs.nvidia.com/metropolis/deepstream/dev-guide/text/DS_Performance.html
Thanks.