Hi jungminash, yes you can run the same model in parallel on both DLA engine. It would run as two independent instances of the same model (i.e. the layers within the model would not be split between DLA’s, each DLA engine would run the model independently).
If all the layers in your model are supported on DLA, when building your TensorRT you can disable GPU fallback by calling IBuilder::allowGPUFallback(false). Otherwise, if there are some layers in your model that aren’t supported on DLA, the TensorRT engine will fail to build unless GPU fallback is enabled with IBuilder::allowGPUFallback(true)