Hi there,
I would like to know if Parabricks supports the GRCh38 reference sequence, as the GRCh38 RefSeq contains not only ATCG+N but also B, K, M, R, S, W, Y bases. I could not find any relevant information in the documentation, and the Homo_sapiens_assembly38.fasta
provided by NVIDIA uses UCSC bases (which only uses ATCG+N bases).
GRCh38.p14: GCF_000001405.40_GRCh38.p14_genomic.fna
A: 558619211
B: 2
C: 413530454
G: 413917617
K: 8
M: 8
N: 161611379
R: 29
S: 5
T: 559373567
W: 15
Y: 36
a: 364497992
c: 229022463
g: 231314379
t: 366543471
- total: 3,298,430,636 (3298430636) bases
hg38.p14.fa
A: 446356635
C: 304761492
G: 305028912
N: 161608333
T: 447005205
a: 476942826
c: 338004625
g: 340408553
n: 3149
t: 479090309
- total: 3,299,210,039 (3299210039) bases