However, not seeing this happening – when deserializeCudaEngine is called, it seems to go straight to initialize() without configureWithFormat() ever being called.
Any other way to let custom plugin know the input dimensions before it is initialized?
I suppose you’re talking about your own plugin layers?
I agree their documentation is not great in this regard. However, when you’re using deserializeCudaEngine, you read from some serialized data. It’s up to you what you’d like to write to that data, and you can write to it things that configureWithFormat might do. In other words, write all you need to the serialization stream to be able to reinstantiate your plugin later.