Loading a pre-trained model into NeMo

bob.briski · January 4, 2023, 4:36pm

Hello,

I’m just getting started with NeMo and I’m trying to load a pre-trained model. The steps I’d like to accomplish are:

Load a pre-trained QA (question answering) model
Load some context
Ask a question and get an answer

I’m using NeMo and I’m loading a Financial Megatron model finmegatron345m_gpt2_bpe. I feel like my problem is in the configuration of NeMo. I’m using the basic QA conf that is provided in the Github (qa_conf.yaml) and then following this Notebook (Question_Answering.ipynb).

I’ve been tweaking the configuration to point to my pre-trained model but I’m getting a variety of errors, most recently that I need to load a language model checkpoint.

Traceback (most recent call last):
  File "/home/ubuntu/finance.py", line 14, in <module>
    model = GPTQAModel.restore_from(restore_path='/home/ubuntu/finmegatron_gpt2_en_uncase.nemo', override_config_path='/home/ubuntu/qa_conf.yml')
  File "/home/ubuntu/.pyenv/versions/3.10.9/lib/python3.10/site-packages/nemo/core/classes/modelPT.py", line 330, in restore_from
    instance = cls._save_restore_connector.restore_from(
  File "/home/ubuntu/.pyenv/versions/3.10.9/lib/python3.10/site-packages/nemo/core/connectors/save_restore_connector.py", line 235, in restore_from
    loaded_params = self.load_config_and_state_dict(
  File "/home/ubuntu/.pyenv/versions/3.10.9/lib/python3.10/site-packages/nemo/core/connectors/save_restore_connector.py", line 158, in load_config_and_state_dict
    instance = calling_cls.from_config_dict(config=conf, trainer=trainer)
  File "/home/ubuntu/.pyenv/versions/3.10.9/lib/python3.10/site-packages/nemo/core/classes/common.py", line 507, in from_config_dict
    raise e
  File "/home/ubuntu/.pyenv/versions/3.10.9/lib/python3.10/site-packages/nemo/core/classes/common.py", line 499, in from_config_dict
    instance = cls(cfg=config, trainer=trainer)
  File "/home/ubuntu/.pyenv/versions/3.10.9/lib/python3.10/site-packages/nemo/collections/nlp/models/question_answering/qa_gpt_model.py", line 47, in __init__
    self.language_model = MegatronGPTModel.restore_from(cfg.language_model.lm_checkpoint, trainer=trainer)
  File "/home/ubuntu/.pyenv/versions/3.10.9/lib/python3.10/site-packages/nemo/core/classes/modelPT.py", line 319, in restore_from
    restore_path = os.path.abspath(os.path.expanduser(restore_path))
  File "/home/ubuntu/.pyenv/versions/3.10.9/lib/python3.10/posixpath.py", line 231, in expanduser
    path = os.fspath(path)
TypeError: expected str, bytes or os.PathLike object, not NoneType

My code is super-simple

import os
import wget

import pytorch_lightning as pl
from omegaconf import OmegaConf

from nemo.collections.nlp.models.question_answering.qa_gpt_model import GPTQAModel
from nemo.utils.exp_manager import exp_manager

pl.seed_everything(42)


model = GPTQAModel.restore_from(restore_path='./finmegatron_gpt2_en_uncase.nemo', override_config_path='./qa_conf.yml')

When reading through the pre-trained model info, it doesn’t say anything about needing to include a language model checkpoint. Is there any simple documentation on just loading up a pre-trained model in NeMo and running an inference pass?

Thanks!

kurkovpavel · April 3, 2023, 1:16pm

.nemo file is just a archive that stores config, model, checkpoint. Try to unrar nemo file to know what files does it have. I think you should define full path to finmegatron_gpt2_en_uncase.nemo

Topic		Replies	Views
Inference on finetuned MegaMolBart model fails "Unexpected key(s) in state_dict" provided model is fine BioNeMo nemo	1	177	July 11, 2024
NeMo Tutorial ModuleNotFoundError: No module named 'megatron.core' AI Foundation Models and Endpoints nemo	2	2265	April 11, 2024
Llama-3.1-70b-instruct Models llama-31-70b-instruct , llama	4	267	December 2, 2024
[NeMo_Gemma] RuntimeError: Error(s) in loading state_dict for MegatronGPTSFTModel: AI Foundation Models and Endpoints	0	194	April 30, 2024
How to Deploy and Run an LLM Designed with the 'NVIDIA NeMo Framework' and 'NVIDIA Megatron' AI Foundation Models and Endpoints nemo	3	128	February 21, 2025
How to Transfer a LoRA Model from NeMo to NIM After Fine-Tuning with Megatron's Script? Models nemo , nim , llama	4	203	October 9, 2024
Failed to convert Nemo model to Riva using nemo2riva for ASR Riva riva	1	46	January 24, 2025
Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 2 Technical Blog	1	143	May 13, 2024
Tao and W&B TAO Toolkit	6	547	July 21, 2022
Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 1 Technical Blog	1	158	May 13, 2024

Loading a pre-trained model into NeMo

Related topics