How to do inference with fpenet_fp32.trt

I already have read the fpenet documentation from the TLT 3.0 documentation. However, what I need is to infer in Python code. fpenet inference or TLT CV inference Pipeline do not fit my requirements as I need to perform face alignment using the facial landmarks in a real-time face recognition pipeline that is built in Python. Note that I created the .trt engine file from your previous answer.

My problem is that the inference output from the engine file significantly differs from the model.tlt output and is not accurate. I appreciate it if you could help me solve this or find out what the problem is.

Thanks.