Data Collator for FastPitch and HiFi-GAN Fine-Tuning in NeMo

hasanmaqsood8747 · August 9, 2024, 6:44am

Hello everyone,

I am currently fine-tuning the FastPitch and HiFi-GAN models using the NeMo toolkit on a custom Dataset on system with NVIDIA GeForce RTX 3090. Despite reducing the batch size, adjusting gradient accumulation, and applying other optimizations, I am still encountering a “CUDA out of memory” error.

I am considering using a Data Collator to manage memory more efficiently, but I am unable to find relevant options in NeMo. Could anyone guide me on how to implement or customize a Data Collator within NeMo? Alternatively, are there any other strategies within NeMo that I can use to overcome this memory issue?

Thank you for your assistance!

Best regards,
Hasan Maqsood

AakankshaS · August 30, 2024, 2:11pm

Hi @hasanmaqsood8747 ,
Can you help us with more details of which model/version are you using?
the document you are refering to?

Topic		Replies	Views
Nemo > Canary 1B > RuntimeError: CUDA driver error: out of memory Jetson AGX Orin cuda , nemo , generative_ai , speech	10	165	January 22, 2025
How to handle big data with a machine learning algorithm on CUDA device if whole do not fit into De CUDA Programming and Performance	2	937	June 24, 2013
CUDA and GPU memory How to get CUDA to exit cleanly when a routine demands too much memory CUDA Programming and Performance	2	7546	February 11, 2010
CUDA running out of memory when training a classifier CUDA Developer Tools	0	465	May 31, 2020
"out of memory" problem.. CUDA Programming and Performance	1	6455	May 9, 2007
CUDA_ERROR_OUT_OF_MEMORY: out of memory cuDNN cuda , tensorflow , windows-driver	1	1750	July 31, 2023
Certain samples fail with 'no device supporting CUDA' CUDA Programming and Performance	4	12465	January 26, 2009
Cuda Out of Memory with tons of memory left? CUDA Programming and Performance	5	39009	December 23, 2009
CUDA: OUT OF MEMORY CUDA Programming and Performance	4	1710	August 25, 2010
Huge memory leak CUDA Programming and Performance	16	5619	July 27, 2016

Data Collator for FastPitch and HiFi-GAN Fine-Tuning in NeMo

Related topics