Does Parabricks support the GRCh38 RefSeq?

Hi there,

I would like to know if Parabricks supports the GRCh38 reference sequence, as the GRCh38 RefSeq contains not only ATCG+N but also B, K, M, R, S, W, Y bases. I could not find any relevant information in the documentation, and the Homo_sapiens_assembly38.fasta provided by NVIDIA uses UCSC bases (which only uses ATCG+N bases).

GRCh38.p14: GCF_000001405.40_GRCh38.p14_genomic.fna

A: 558619211
B: 2
C: 413530454
G: 413917617
K: 8
M: 8
N: 161611379
R: 29
S: 5
T: 559373567
W: 15
Y: 36
a: 364497992
c: 229022463
g: 231314379
t: 366543471
  • total: 3,298,430,636 (3298430636) bases

hg38.p14.fa

A: 446356635
C: 304761492
G: 305028912
N: 161608333
T: 447005205
a: 476942826
c: 338004625
g: 340408553
n: 3149
t: 479090309
  • total: 3,299,210,039 (3299210039) bases

Hey @tj_tsai, our examples use UCSC but GRCh38 is also supported.

1 Like