Clarification on Expected Structure of Per-Year .zarr Files for Stormcast train

renato14 · June 2, 2025, 4:54pm

I’m trying to use the existing data_loader_hrrr_era5 (as referenced in examples/generative/stormcast/datasets/data_loader_hrrr_era5.py) to train the regression model (e.g. via regression.train). The documentation describes the folder layout under <location>—with separate era5/ and <hrrr_dataset_name>/ directories, each containing per-year .zarr files inside their respective train/, valid/, and test/ subdirectories. However, it’s not clear:

Specifically, I’d like to know:

For ERA5 per-year .zarr files, how the variables should be structured (dimensions, coordinate names, etc.).

For HRRR per-year .zarr , what the internal layout must look like and how the variables should be structured (dimensions, coordinate names, etc.).

Without knowing the exact schema, it’s difficult to build a custom Zarr export that the loader can consume. Any examples, minimal specs, or references to how the original datasets were organized (paths + internal variable names) would be extremely helpful. Thanks in advance for any pointers!

Topic		Replies	Views
Unable to train SSD-Resnet-18 TAO Toolkit	16	2018	October 12, 2021
Resnet-50 based uff-model is giving error due to mismatch. DeepStream SDK	17	2279	November 30, 2018
ValueError: No dataset tfrecords file found at path TAO Toolkit	10	1686	October 12, 2021
Creating a separate evaluation TFRecord (PeopleNet) TAO Toolkit	6	462	October 12, 2021
Error while evaluating with a separate validation set TAO Toolkit	8	719	October 12, 2021
ValueError: Some errors were detected ! Line #4 (got 15 columns instead of 16) TAO Toolkit	3	2138	October 12, 2021
How to read infer raw output? DeepStream SDK	2	1155	October 12, 2021
Cannot run deepstream-test-1 in deepstream_python_apps: Where is the ../../../../samples/ folder? DeepStream SDK	5	2292	October 12, 2021
[Tesla P4][Deepstream3] How to confirm the effect of the configuration "interval". DeepStream SDK	3	847	October 12, 2021
Custom dataset -- ValueError: steps_per_epoch must be > 0 TAO Toolkit	6	1362	October 12, 2021

Clarification on Expected Structure of Per-Year .zarr Files for Stormcast train

Related topics