[SUPPORT] Workbench Example Project: Llama 2 Finetune

Hi! This is the support thread for the Llama 2 Finetune Example Project on GitHub. Any major updates we push to the project will be announced here. Further, feel free to discuss, raise issues, and ask for assistance in this thread.

Note: You need to signup for the NeMo Frameworks GA to access the container used in the Project.

Please keep discussion in this thread project-related. Any issues with the Workbench application should be raised as a standalone thread. Thanks!

Is this project in place to augment the SDXL project, or is it specifically set up as a standalone?

Build failed with this error: STEP 1/18: FROM nvcr.io/ea-bignlp/ga-participants/nemofw-training:23.08.03
Trying to pull nvcr.io/ea-bignlp/ga-participants/nemofw-training:23.08.03…
Error: creating build container: initializing source docker://nvcr.io/ea-bignlp/ga-participants/nemofw-training:23.08.03: reading manifest 23.08.03 in nvcr.io/ea-bignlp/ga-participants/nemofw-training: authentication required

Hi Brian,

Check the following:

  • Do you have an NGC api key configured in workbench?
  • Did you sign up for the GA program to get access to this container?

Tyler

This Project is not related to the SDXL example. It should be “stand alone” in that you can use it without needing anything else.

1 Like

Yes and no. I couldn’t find the second item.

Found the proper link to be able to log into the NGC.

I am trying to to run this projects on remote machine… cloned it successfully, but stuck on how and where to download the model? I have credentials (integrations) created in my local machine…I have access to HF model below

but what command and from what machine I need to type to download it?
the instruction in the ReadMe shows below but it’s not working!!
git lfs clone https://huggingface.co//

which I substituted with
git lfs clone https://huggingface.co/hijer/https://huggingface.co/meta-llama/Llama-2-7b-hf

Yes. The way the project is currently written is confusing. Note that your namespace hijer shouldn’t be in the URL.

  1. Then, go to the command line for Workbench, activate the proper location, and open the Project with:
    nvwb activate <location_name>
    nvwb open <project_name>
  2. Then, attach to the Project container on the remote host with
    nvwb attach
  3. Your prompt will change and you are in the container on the remote.
  4. Then you can try the clone command, and you may need to add your HF token when it prompts you.

I will have the person who created the Project respond here with other suggestions.