| Literature DB >> 25708870 |
Naoki Nariai, Kaname Kojima, Sakae Saito, Takahiro Mimori, Yukuto Sato, Yosuke Kawai, Yumi Yamaguchi-Kabata, Jun Yasuda, Masao Nagasaki.
Abstract
BACKGROUND: Human leucocyte antigen (HLA) genes play an important role in determining the outcome of organ transplantation and are linked to many human diseases. Because of the diversity and polymorphisms of HLA loci, HLA typing at high resolution is challenging even with whole-genome sequencing data.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25708870 PMCID: PMC4331721 DOI: 10.1186/1471-2164-16-S2-S7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Overview of the HLA-VBSeq pipeline. Given read data, the human reference genome sequence, and the HLA allele reference sequences, HLA-VBSeq estimates the expected read counts on HLA alleles for HLA typing.
Figure 2The generative model of read data in HLA-VBSeq. The read abundance parameter, indicator variable for the HLA allele choice, nucleotide sequence of the first and second pair of read n are represented by , Z, and , respectively.
Prediction accuracy of HLA-VBSeq and existing tools for HLA typing in the 30x simulation data analysis.
| HLA resolution | HLA-VBSeq | PHLAT | HLAminer |
|---|---|---|---|
| 8-digit | 99.94% | - | - |
| 6-digit | 99.95% | 80.80% | - |
| 4-digit | 99.95% | 88.75% | 50.12% |
| 2-digit | 100% | 96.39% | 77.82% |
The accuracies of each method are calculated at 8-digit, 6-digit, 4-digit, and 2-digit resolution from the 30x simulation data. PHLAT only predicts HLA types at 6-digit, 4-digit and 2-digit resolution, and HLAminer only predicts HLA types at 4-digit and 2-digit resolution.
Figure 3Prediction accuracy of HLA-VBSeq and existing tools for HLA typing at 4-digit resolution for 5x, 10x, 20x and 30x simulation data. The accuracies of each method are calculated at 4-digit resolution from 5x, 10x, 20x and 30x simulation data. HLA-VBSeq outperformed others for all the depth of coverage.
Predicted HLA types of the CEU trio samples with HLA-VBSeq.
| Sample | Predicted HLA types | |
|---|---|---|
| NA12878 (child) | ||
| DQA1*01:01:02 | DQA1*05:01:01:02 | |
| DQB1*02:01:01 | DQB1*05:01:01:02 | |
| DRB1*03:01:01:01 | DRB1*01:01:01 | |
| NA12891 (father) | ||
| DQA1*01:02:01:01 | DQA1*05:01:01:02 | |
| DQB1*02:01:01 | DQB1*06:02:01 | |
| DRB1*03:01:01:01 | DRB1*15:01:01:02 | |
| NA12892 (mother) | ||
| DQA1*01:01:02 | DQA1*01:01:02 | |
| DQB1*05:01:01:02 | DQB1*05:01:01:01 | |
| DRB1*01:01:01 | DRB1*01:01:01 | |
Predicted HLA types of HLA-A, -B, and -C loci with HLA-VBSeq were experimentally validated at 4-digit resolution in [10], which is shown in bold text. Predicted HLA types of HLA-DQA1, -DQB1 and -DRB1 loci with HLA-VBSeq were coincided with those with PHLAT at 6-digit resolution except DQA1*01:01:02 (one allele in NA12878 and two alleles in NA12892). PHLAT instead predicted them as DQA1*01:01:01, whose genomic sequence was missing in the IMGT database release 3.15.0., and hence HLA-VBSeq could not predict the HLA allele in our experimental condition.