[SUPPORT] Workbench Example Project: Llama 2 Finetune

edwli · January 9, 2024, 5:43pm

Hi! This is the support thread for the Llama 2 Finetune Example Project on GitHub. Any major updates we push to the project will be announced here. Further, feel free to discuss, raise issues, and ask for assistance in this thread.

Note: You need to signup for the NeMo Frameworks GA to access the container used in the Project.

Please keep discussion in this thread project-related. Any issues with the Workbench application should be raised as a standalone thread. Thanks!

brian_d_scott · February 13, 2024, 4:05pm

Is this project in place to augment the SDXL project, or is it specifically set up as a standalone?

brian_d_scott · February 14, 2024, 2:21am

Build failed with this error: STEP 1/18: FROM nvcr.io/ea-bignlp/ga-participants/nemofw-training:23.08.03
Trying to pull nvcr.io/ea-bignlp/ga-participants/nemofw-training:23.08.03…
Error: creating build container: initializing source docker://nvcr.io/ea-bignlp/ga-participants/nemofw-training:23.08.03: reading manifest 23.08.03 in nvcr.io/ea-bignlp/ga-participants/nemofw-training: authentication required

twhitehouse · February 14, 2024, 2:28am

Hi Brian,

Check the following:

Do you have an NGC api key configured in workbench?
Did you sign up for the GA program to get access to this container?

Tyler

twhitehouse · February 14, 2024, 2:30am

This Project is not related to the SDXL example. It should be “stand alone” in that you can use it without needing anything else.

brian_d_scott · February 14, 2024, 2:51am

Yes and no. I couldn’t find the second item.

brian_d_scott · February 14, 2024, 2:37pm

Found the proper link to be able to log into the NGC.

raed_hijer · February 20, 2024, 1:36pm

I am trying to to run this projects on remote machine… cloned it successfully, but stuck on how and where to download the model? I have credentials (integrations) created in my local machine…I have access to HF model below

but what command and from what machine I need to type to download it?
the instruction in the ReadMe shows below but it’s not working!!
git lfs clone https://huggingface.co//

which I substituted with
git lfs clone https://huggingface.co/hijer/https://huggingface.co/meta-llama/Llama-2-7b-hf

twhitehouse · February 20, 2024, 1:50pm

Yes. The way the project is currently written is confusing. Note that your namespace hijer shouldn’t be in the URL.

Then, go to the command line for Workbench, activate the proper location, and open the Project with:
nvwb activate <location_name>
nvwb open <project_name>
Then, attach to the Project container on the remote host with
nvwb attach
Your prompt will change and you are in the container on the remote.
Then you can try the clone command, and you may need to add your HF token when it prompts you.

I will have the person who created the Project respond here with other suggestions.