Using wandb (weights and biases) with TAO API the same way we use ClearML

ganinduN · May 2, 2023, 2:15pm

TAO 4.0.0

Hi I’m using TAO with a local k8 cluster, for telemetry I use clearml (super easy to setup becase we can include all the data in the values.yaml of the helm chart) like this

# Optional MLOPS setting for ClearML
clearMlWebHost: https://app.clear.ml
clearMlApiHost: https://api.clear.ml
clearMlFilesHost: https://files.clear.ml
clearMlApiAccessKey:  <ACCESS-KEY>
clearMlApiSecretKey: <API-KEY>

and by configuring the spec

# get default specification schema for training.
endpoint = f"{base_url}/model/{model_ID}/specs/train/schema"
response = requests.get(endpoint, headers=headers, verify=rootca)
specs = response.json()["default"]
specs["training_config"]["visualizer"]["clearml_config"] = {}
specs["training_config"]["visualizer"]["clearml_config"]["project"] = "my_project"
specs["training_config"]["visualizer"]["clearml_config"]["tags"] = ["training", "tao_toolkit"]
specs["training_config"]["visualizer"]["clearml_config"]["task"] = "training_experiment_1"

This works!

But I’d like to try wandb in a similar manner

I have a wandb server deployed in the local k8 cluster and to sttream telemetry I have to manually stream log entries by repeatedly runing REST api calls to retrive data from TAO and calling

wandb.log({"key1":value1, "key2", value2 ... })

to send data to wandb local server.

Is there a way to run

wandb.login(key="local-API-KEY", host="http://my.local.domin:my_local_domian_port_number", relogin=True)

command within the client that is in the k8 pod? so all I have to do is inlcude

...
specs["training_config"]["num_epochs"] = num_epochs
specs["training_config"]["visualizer"]["enabled"] = True

# add the wandb_config section
specs["training_config"]["visualizer"]["wandb_config"] = {}
specs["training_config"]["visualizer"]["wandb_config"]["project"] = "my_net"
specs["training_config"]["visualizer"]["wandb_config"]["tags"] = ["training", "tao_toolkit"]
specs["training_config"]["visualizer"]["wandb_config"]["notes"] = "training_experiment_1"

Cheers,
Ganindu.

Morganh · May 5, 2023, 2:01am

Sorry for late. I will check internally further.

Topic		Replies	Views
W&B integration TAO Toolkit	13	610	October 27, 2023
How to add weight bias in TAO config file? TAO Toolkit ai-model-training	3	19	January 28, 2025
Tao and W&B TAO Toolkit	6	547	July 21, 2022
Logging the hyperparameters in ClerML TAO Toolkit	7	532	October 19, 2023
AutoML for v4.0.2 with Efficientnet_b1_relu TAO Toolkit	35	744	July 25, 2023
TAO 5.3, YOLOv4 : Retrieving kmeans 'anchor shapes' through API call TAO Toolkit yolo , api , tao	10	32	October 24, 2024
TAO 4.0 Multi GPU Setup Question TAO Toolkit	5	407	July 19, 2023
Error during TAO training in WSL2 TAO Toolkit	3	18	December 5, 2024
TAO Toolkit 5.2 Directory Not Empty TAO Toolkit	9	457	April 14, 2024
TAO 5.0 Training Spec discrepancy TAO Toolkit	9	466	August 3, 2023

Using wandb (weights and biases) with TAO API the same way we use ClearML

Related topics