Say I have a classical encoder-decoder segmentation network, U-net like style. Also assume that the model has two outputs: one low-res output and one high-res output. The nature of this model implies that the high-res output is dependent on the low-res output and thus has a higher latency.
In PyTorch it is possible to query the low-res output before the high-res output has been computed. I was wondering if this is possible in TensorRT, so you don’t have to wait for entire computational graph to be computed, if you are just interested in the low-res output.