Conversion of Owl-ViT model to ".engine" file which is used in metropolis GenAI application

akash.parikh · June 13, 2024, 7:36am

Hello all,

I am using mmj_genai application. It is using the google’s Owl-ViT model.

I want to use mmj_genai application to detect custom objects. So first I want to train the Owl-ViT model and then convert it to “.engine” file which is needed for mmj_genai application. I want to know how can I convert the Owl-ViT model to .engine file? In the GitHub they are using checkpoint only. How can I save the model from checkpoint and then convert it to “.engine” file?

kesong · June 13, 2024, 9:12am

There is guide for convert model to engine file (Build the TensorRT engine for the OWL-ViT vision encoder) in:GitHub - NVIDIA-AI-IOT/nanoowl at cfef75a8ad5fb8be0e3beb501a763661a9336d1d.
The model we use in the mmj_genai example is based on OWL-ViT from google. The OWL-ViT github repository has some resources on how to train it. scenic/scenic/projects/owl_vit at main · google-research/scenic · GitHub

akash.parikh · June 13, 2024, 9:50am

I looked at the Nanoowl github. Somehow I missed this. Thanks.

The training steps on scenic github repo is not working though. It is giving error related to DECODER for the dataset (‘lvis’ or ‘coco’ datasets).

akash.parikh · June 17, 2024, 5:24am

Have you ever tried fine-tuning the model?

kesong · June 17, 2024, 6:36am

Please file new topic if you have more question. Thanks!

system · July 1, 2024, 6:37am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.