| Literature DB >> 31173059 |
Rose Orenbuch1,2, Ioan Filip1, Devon Comito3, Jeffrey Shaman3, Itsik Pe'er2, Raul Rabadan1.
Abstract
MOTIVATION: The human leukocyte antigen (HLA) locus plays a critical role in tissue compatibility and regulates the host response to many diseases, including cancers and autoimmune di3orders. Recent improvements in the quality and accessibility of next-generation sequencing have made HLA typing from standard short-read data practical. However, this task remains challenging given the high level of polymorphism and homology between HLA genes. HLA typing from RNA sequencing is further complicated by post-transcriptional modifications and bias due to amplification.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31173059 PMCID: PMC6956775 DOI: 10.1093/bioinformatics/btz474
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Concordance with 1000 genomes gold-standard HLA typing for 358 RNA-sequencing samples for arcasHLA along with concordance rates for other tools reported in (Buchkovich )
| Gene | OptiType | seq2HLA | PHLAT | HLAProfiler | arcasHLA |
|---|---|---|---|---|---|
| A | 99.6% | 98.6% | 99.4% | 99.9% |
|
| B | 99.4% | 94.8% | 93.4% | 99.0% |
|
| C |
| 95.1% | 94.3% | 99.6% |
|
| DQB1 | — | 96.0% | 96.0% |
|
|
| DRB1 | — | 98.5% | 98.5% | 99.6% |
|
Note: Bold denotes maximized concordance.
Concordance of calls for arcasHLA, OptiType and HISAT-genotype with xHLA for 447 RNA samples from 69 individuals
| Input (#) | RNA (447) | WES (69) | ||
|---|---|---|---|---|
| Tool | arcasHLA | OptiType | OptiType | HISAT |
| A |
| 95.2% | 98.6% | 99.4% |
| B |
| 94.5% | 96.4% | 98.6% |
| C |
| 97.4% | 98.6% | 100.0% |
| DPB1 |
| — | — | — |
| DQB1 |
| — | — | 94.9% |
| DRB1 |
| — | — | 94.2% |
Note: Bold denotes maximized concordance for RNA sequencing results.
Fig. 1.Overview of arcasHLA pipeline from alignment to genotyping. The HLA de Bruijn graph was generated with Velvet (Zerbino and Birney, 2008) and visualized with Bandage (Wick )
Fig. 2.Runtime analysis on 30 randomly selected samples from 1000 Genomes dataset for arcasHLA (extract and genotype steps, and overall runtime) and HLAProfiler
Fig. 3.Concordance rates restricted to Virome samples below a threshold for (a) RIN and (b) log-scaled reads by HLA gene, truncated when the number of samples dropped below 55, approximately one-eighth the total sample size. Panels (c) and (d) show the number of samples remaining