One of my researching colleagues at our university has already acquired a MinION sequencer from Oxford Nanopore Tech., and would like to apply for an NVIDIA research grant for a suitable GPU-based platform for base-calling of .fast5 files produced by the MinION as well as perhaps performing alignments on both MinION and MiSeq sequencing data.
I’d appreciate any constructive input on the suitability of the Clara AGX (or perhaps other NVIDIA) platform for the above purposes, thanks.
Initially, using ONT’s guppy_basecaller software we’d like to process a semi-weekly run consisting of ~400GB from the MinION in ~3 days or less.
Would you be able to describe what tools you use for alignment and any other processing steps you might be running (e.g. assembly, polishing, or variant analysis) and what organisms you are working with?
Additionally it’s possible with GPUs to perform basecalling in realtime with a MinION (and other ONT hardware) so you would not need to perform a batch based weekly run job and immediately have fasta files available. This reduces the amount of storage needed as well. Is that something of interest?
Thanks @Emmett for your reply.
Re organisms, I can only say mammalian for now. Re other processing steps, I’m not sure my colleague is interested in assembly so much as the other types you’ve mentioned. Thus, any software recommendations would be appreciated.
Also, I suspect that the high accuracy requirement for running guppy_basecaller would prevent us from base calling in realtime, true?
You should be able to run the high accuracy basecaller in realtime for a MinION on the majority of our GPUs. Based on external benchmarks in  a Jetson Xavier AGX should be fast enough for real time basecalling.
At the moment we release GPU accelerated alignment tools in Parabricks and GenomeWorks, but these are suited for x86 workstations so a traditional GPU (like a V100, A100, etc) would be the best. These libraries contain other common genomics pipeline tools accelerated for the GPU and are continuing to grow.
Thanks. So for running basecalling as well as tools downstream from basecalling, would you recommend, say, A100 over the Jetson Xavier AGX? If so, what features for the x86 workstation?
There unfortunately isn’t a one size fits all for the variety of workloads that exist in this space.
A Jetson AGX would be able to basecall in realtime, which means you no longer need to store fast5 files, and can keep the significantly smaller fasta/fastq files around. This can have large bandwidth and storage savings.
Some secondary analysis steps run well on a Jetson AGX, such as running Kraken2 with one of our reference docker images, while others require 8 GPUs or even cluster setups. I’d suggest reaching out on the Parabricks forums for more information about secondary analysis pipelines using x86.
Hope this helps,
I’m going to weigh in here if I may. Sorry, I don’t frequent these boards much (at all), but I have a lot of experience with Oxford Nanopore sequencing and GPU compute. First a collection of resources that may be useful to you:
OK, so that’s an overview of some hopefully useful bits and bobs. I would now like to comment on a few things. Firstly, you really want to hold on to your fast5 files, doing so allows you to return to them again and again as models improve accuracy and allow the deteciton of additional base modifications. Real-time basecalling while sequencing is awesome, and something like an RTX3060 can keep up easily in high accuracy mode (HAC), even in super high accuracy (SUP) with some tweaking. But you will always want to go back and basecall again after the fact. The Jetson Xavier AGX will not keep up with HAC calling in real-time (we’ve tried), but in FAST mode it’s great, you can actually run 2x MinIONs. The Clara AGX could easily keep up, but it’s not widely available yet. While the A10 is an amazing card, I would advise to spend that amount of money on other things. Something like a 308Ti or a 3090 is more than enough for 95% of people (probably 99.9% of people to be honest).
Sorry for the long post, happy to comment more if useful.
This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.