Parabricks 4.2.0-1 haplotypecaller error - cudaSafeCall() failed - out of memory

avenkatraman · October 17, 2023, 7:19am

Hi

I am running haplotypecaller nvcr.io/nvidia/clara/clara-parabricks:4.2.0-1 on AWS g4dn.12xlarge and I get this error message below. Interestingly, the job exits with Exit Code 0

[PB Info 2023-Oct-17 02:52:14] ProgressMeter -  Current-Locus   Elapsed-Minutes Regions-Processed       Regions/Minute
[PB Debug 2023-Oct-17 02:52:15][src/assembleRegions_GPU.cu:333] The gpu memory used by Stage 0 is 3097207948
[PB Debug 2023-Oct-17 02:52:15][src/assembleRegions_GPU.cu:414] The cpu memory used by Stage 0 is 810178340
[PB Error 2023-Oct-17 02:52:15][src/likehood_test.cu:951] cudaSafeCall() failed: out of memory, exiting.
aSafeCall() failed: out of memory, exiting.
023-Oct-17 02:52:15][src/likehood_test.cu:951] cudaSafeCall() failed: out of memory, exiting.

The command used with haplotypecaller is shown below and the --in-bam file is the file that was output from fq2bam

pbrun haplotypecaller \
    --ref hs38DH.fa \
    --in-bam HA2WPADXX_2.pb.bam \
    --in-recal-file HA2WPADXX_2.pb.BQSR-REPORT.txt \
    --out-variants "HA2WPADXX_2.HAPLOTYPECALLER.pb.vcf" \
    --logfile "HA2WPADXX_2.HAPLOTYPECALLER.log.txt" \
    --run-partition \
    --gpu-num-per-partition 1 \
    --verbose --x3 \
    --num-gpus 4

fq2bam had a --low-memory option. Is there something similar for haplotypecaller? Also, do I need to change/enable --htvc-low-memory and/or --num-htvc-threads to overcome this error?

FWIW, the same --in-bam file works well with deepvariant

pbrun deepvariant \
    --ref hs38DH.fa \
    --in-bam HA2WPADXX_2.pb.bam \
    --out-variants "HA2WPADXX_2.DEEPVARIANT.pb.vcf" \
    --logfile "HA2WPADXX_2.DEEPVARIANT.log.txt" \
    --num-streams-per-gpu 4 \
    --run-partition \
    --gpu-num-per-partition 1 \
    --verbose --x3 \
    --num-gpus 4

Thanks in advance.

avenkatraman · October 23, 2023, 11:30am

Hi Parabricks developers,

Just checking in on the above - any suggestions on how to circumvent this. Thanks in advance.

The haplotypecaller link in my original post does not work - here is the changed link - haplotypecaller - NVIDIA Docs

gburnett · October 23, 2023, 8:23pm

for resolution see: post

system · November 6, 2023, 8:24pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Robin_hood::map overflow in GATK HaplotypeCaller (parabricks 4.0.0) Parabricks ai	6	993	November 13, 2023
Pbrun haplotypecaller running error Parabricks	0	32	December 12, 2024
Parabricks:4.0.0-1 Illegal instruction (core dumped) in haplotypecaller step Parabricks ai	0	871	July 20, 2023
Error on haplotypecaller after minimap2 Parabricks ai	0	709	April 23, 2024
Several sample usage in parabricks haplotypecaller Parabricks ai	0	33	September 4, 2024
Encountering Bugs/Errors with Germline pipeline - Seeking Help! Parabricks ai , fq2bam	5	78	January 17, 2025
Haplotypecaller producing an empty VCF file Parabricks inception	7	498	October 9, 2024
Haplotypecaller batch mode Parabricks ai	2	1011	July 17, 2023
Error/Bug In running HaplotypeCaller Parabricks	0	739	February 20, 2024
[Question]: What does htvc stand for in haplotypecaller - Parabricks 4.2.0-1 Parabricks ai	7	1250	October 23, 2023

Parabricks 4.2.0-1 haplotypecaller error - cudaSafeCall() failed - out of memory

Related topics