| Literature DB >> 24935557 |
Albert M Levin1, Indra Adrianto, Indrani Datta, Michael C Iannuzzi, Sheri Trudeau, Paul McKeigue, Courtney G Montgomery, Benjamin A Rybicki.
Abstract
BACKGROUND: The expense of human leukocyte antigen (HLA) allele genotyping has motivated the development of imputation methods that use dense single nucleotide polymorphism (SNP) genotype data and the region's haplotype structure, but the performance of these methods in admixed populations (such as African Americans) has not been adequately evaluated. We compared genotype-based-derived from both genome-wide genotyping and targeted sequencing-imputation results to existing allele data for HLA-DRB1, -DQB1, and -DPB1.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24935557 PMCID: PMC4074844 DOI: 10.1186/1471-2156-15-72
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Comparison of allele imputation accuracy in ACCESS European Americans for class II genes , − , and between HLA*IMP:02* and HIBAG
| | | ||||||
|---|---|---|---|---|---|---|---|
| | | ||||||
| One | 480 | 98.5 | 98.9 | 440 | 91.7 | 99.8 | |
| | Two | 480 | 90.4 | 92.4 | 440 | 91.7 | 94.4 |
| One | 480 | 99.5 | 99.7 | 476 | 99.2 | 99.7 | |
| | Two | 480 | 97.1 | 97.0 | 476 | 99.2 | 97.2 |
| One | 480 | 83.0 | 93.2 | 469 | 97.7 | 93.6 | |
| Two | 480 | 82.1 | 92.7 | 469 | 97.7 | 93.2 | |
Abbreviations: PP denotes, posterior prediction probability; N, count; Accuracy, imputation accuracy.
*HLA*IMP:02 is described in Dilthey et al. 2013.
†HIBAG European ancestry model (published online 8/10/2012).
‡One- and two-field estimates will be very similar for HLA-DPB1 as the first field uniquely identifies the two-field alleles, with the exceptions of *02:01 and *0202, and *04:01 and *04:02.
allele imputation accuracy in ACCESS African Americans for class II genes , − , and
| | | | |||||
|---|---|---|---|---|---|---|---|
| | | | |||||
| HLA*IMP:02* | One | 325 | 93.2 | 268 | 82.5 | 94.8 | |
| | | Two | 325 | 83.8 | 268 | 82.5 | 87.3 |
| | One | 325 | 96.3 | 297 | 91.4 | 97.3 | |
| | | Two | 325 | 93.2 | 297 | 91.4 | 94.6 |
| | One | — | — | — | — | — | |
| | | Two | — | — | — | — | — |
| HIBAG† | One | 325 | 89.5 | 135 | 41.5 | 98.1 | |
| | | Two | 325 | 78.8 | 135 | 41.5 | 94.4 |
| | One | 325 | 96.6 | 170 | 52.3 | 98.8 | |
| | | Two | 325 | 87.1 | 170 | 52.3 | 92.4 |
| | One | 325 | 69.5 | 111 | 34.2 | 89.2 | |
| | | Two | 325 | 69.4 | 111 | 34.2 | 89.2 |
| HIBAG ACCESS‡ | One | 164 | 95.4 | 113 | 68.9 | 99.1 | |
| | | Two | 164 | 89.6 | 113 | 68.9 | 96.0 |
| | One | 188 | 95.4 | 166 | 88.3 | 99.7 | |
| | | Two | 188 | 97.6 | 166 | 88.3 | 99.1 |
| | One | 250 | 87.4 | 147 | 58.8 | 98.3 | |
| | | Two | 250 | 86.4 | 147 | 58.8 | 98.0 |
| BEAGLE ACCESS‡ | One | 164 | 78.7 | — | — | — | |
| | | Two | 164 | 72.3 | — | — | — |
| | One | 188 | 96.0 | — | — | — | |
| | | Two | 188 | 97.9 | — | — | — |
| | One | 250 | 65.2 | — | — | — | |
| Two | 250 | 64.4 | — | — | — | ||
Abbreviations: PP denotes, posterior prediction probability; N, count; Accuracy, imputation accuracy.
*HLA*IMP:02 is described in Dilthey et al. 2013. In this version of the software, reference data for HLA-DPB1 was not available for the African ancestry reference panel.
†HIBAG African ancestry model (published online 8/10/2012).
‡The models trained on the subset of ACCESS African Americans with training samples selected at random (from a total of 325) to match the number of subjects used to construct the published HIBAG African ancestry models: HLA-DRB1 (n = 161), HLA-DQB1 (n = 137), and HLA-DPB1 (n = 75).
§One- and two-field estimates will be very similar for HLA-DPB1 as the first field uniquely identifies the two-field alleles, with the exceptions of *02:01 and *0202, and *04:01 and *04:02.
Figure 1African American allele prediction sensitivity for two-field class II genes by allele frequency using the ACCESS HIBAG models. Sensitivity is equal to the probability that the predicted allele matches the actual genotyped allele prediction (i.e. true positive/(true positive + false negative)).
HIBAG model* two-field allele imputation accuracy in the ACCESS African American gene-specific training sets by posterior probability threshold of the most likely genotype call
| | ||||||
|---|---|---|---|---|---|---|
| >0.10 | 163 | 89.8 | 188 | 97.6 | 249 | 86.3 |
| >0.20 | 160 | 90.3 | 187 | 97.6 | 239 | 87.9 |
| >0.30 | 149 | 92.2 | 184 | 97.8 | 213 | 90.6 |
| >0.40 | 136 | 94.4 | 175 | 98.9 | 174 | 95.4 |
| >0.50 | 113 | 95.8 | 166 | 99.1 | 147 | 98.0 |
| >0.60 | 93 | 97.3 | 156 | 99.4 | 121 | 97.9 |
| >0.70 | 70 | 98.6 | 142 | 99.3 | 87 | 97.1 |
| >0.80 | 53 | 98.1 | 116 | 99.1 | 38 | 96.1 |
| >0.90 | 28 | 96.4 | 74 | 99.3 | 13 | 96.2 |
Abbreviations: N denotes, count; Accuracy, imputation accuracy.
*The models trained on the subset of ACCESS African Americans with training samples selected at random (from a total of 325) to match the number of subjects used to construct the published HIBAG African ancestry models: HLA-DRB1 (n = 161), HLA-DQB1 (n = 137), and HLA-DPB1 (n = 75).
HIBAG model* four-digit HLA allele prediction accuracy in ACCESS African Americans differs by local ancestry at HLA class II genes
| DRB1 | 2 | 240 | 92.9 | 0.005 |
| | 1 | 78 | 80.8 | |
| | 0 | 10 | 80.0 | |
| DQB1 | 2 | 292 | 98.6 | 0.040 |
| | 1 | 76 | 93.4 | |
| | 0 | 8 | 100.0 | |
| DPB1 | 2 | 376 | 89.6 | 8.8*10-4 |
| | 1 | 110 | 75.5 | |
| 0 | 14 | 85.7 |
Abbreviations: N=count; ACC=classification accuracy; P=p-value from a Fishers exact test.
*The models trained on the subset of ACCESS African Americans with training samples selected at random (from a total of 325) to match the number of subjects used to construct the published HIBAG African ancestry models: HLA-DRB1 (n=161), HLA-DQB1 (n=137), and HLA-DPB1 (n=75).
†Number of West African ancestral alleles at each gene estimated by the Local Ancestry in Admixed Populations (LAMP) method.
‡Test for difference in imputation accuracy proportions across individuals with 0, 1, or 2 West African Alleles at locus in question.
Local ancestry differences in two-field allele imputation accuracy resolved by HIBAG models containing equal numbers* of ancestral West African and European chromosomes
| 2 | 91 | 88.5 | 0.12 | |
| | 1 | 75 | 81.3 | |
| | 0 | 9 | 94.4 | |
| | Overall | 175 | 85.7 | |
| 2 | 91 | 95.1 | 1.00 | |
| | 1 | 75 | 95.3 | |
| | 0 | 9 | 100.0 | |
| | Overall | 175 | 95.4 | |
| 2 | 91 | 89.0 | 0.92 | |
| | 1 | 75 | 89.6 | |
| | 0 | 9 | 94.4 | |
| Overall | 175 | 90.6 |
Abbreviations: N denotes, count; Accuracy, imputation accuracy; P, p-value from a Fishers exact test.
*150 individuals (300 chromosomes) from African American with two ancestral West African alleles at each gene as estimated by LAMP and 150 individuals from the ACCESS European American sample (i.e. 300 European chromosomes). The test set for these models included the remaining 175 African Americans who were not a part of the 150 used to build the model.
†Number of West African ancestral alleles at each gene estimated by the Local Ancestry in Admixed Populations (LAMP) method.
‡Test for difference in imputation accuracy proportions across individuals with 0, 1, or 2 West African alleles at each gene.
The effect of adding imputed SNPs to the HIBAG allele imputation accuracy in ACCESS African Americans for class II genes
| | |||||||
|---|---|---|---|---|---|---|---|
| | | | |||||
| ACCESS Subset* - GWAS + 1000GP Imputed dataa | One | 164 | 95.4 | 149 | 90.9 | 97.0 | |
| | Two | 164 | 90.2 | 123 | 75.0 | 95.9 | |
| One | 188 | 99.2 | 185 | 98.4 | 99.5 | ||
| | Two | 188 | 98.1 | 168 | 89.4 | 99.4 | |
| | One | 250 | 87.3 | 160 | 64.2 | 97.5 | |
| | | Two | 250 | 91.8 | 151 | 60.6 | 97.0 |
| ACCESS Subset* - GWAS + 1000GP + TargetSeq Imputed datab | One | 164 | 94.8 | 147 | 89.6 | 97.0 | |
| | Two | 164 | 89.6 | 129 | 78.7 | 95.0 | |
| One | 188 | 99.2 | 186 | 98.9 | 99.2 | ||
| Two | 188 | 97.9 | 171 | 91.0 | 99.4 | ||
Abbreviations: PP denotes, posterior prediction probability; N, count; Accuracy, imputation accuracy.
*The models trained on the subset of ACCESS African Americans genotype data from a)GWAS + 1000 Genome project imputed data, b)GWAS + 1000 Genome project + targeted sequencing imputed data (DRB1 and DQB1 only) with training samples selected at random (from a total of 325) using to match the number of subjects used to construct the published HIBAG African ancestry models: HLA-DRB1 (n=161), HLA-DQB1 (n=137), and HLA-DPB1 (n=75).
†One- and two-field estimates will be very similar for HLA-DPB1 as the first field uniquely identifies the two-field alleles, with the exceptions of *02:01 and *0202, and *04:01 and *04:02.