Save image sequence with Maskrcnn inference masks in python

The error as above was caused by missing model indeed. I got the models by:
wget https://nvidia.box.com/shared/static/8k0zpe9gq837wsr0acoy4oh3fdf476gq.zip -O models.zip

The problem now is that there’s only bbox and text on the video, but no masks?