Problems with shared batch in TRT APIs

Because TrT assumes the same batch throughout the entire network, ops that use fixed (known statically) but different batch dimension for a specific purpose get problematic. Please add support for true variable-fixed batch in TRT.

In ptic this means that first dimension is not always semantically batch.