To deploy a model, you will also need some memory for input/output/intermediate tensor.
As a result, the real required memory is much more than the model itself.
Would you mind to check the memory status with tegrastats and share with us?
$ sudo tegrastats
Please also try the configure shared below to see if helps.