| Literature DB >> 26111206 |
Konstantin Kozlov, Dmitri Chebotarev, Mehedi Hassan, Martin Triska, Petr Triska, Pavel Flegontov, Tatiana V Tatarinova.
Abstract
The genetic structure of human populations is extraordinarily complex and of fundamental importance to studies of anthropology, evolution, and medicine. As increasingly many individuals are of mixed origin, there is an unmet need for tools that can infer multiple origins. Misclassification of such individuals can lead to incorrect and costly misinterpretations of genomic data, primarily in disease studies and drug trials. We present an advanced tool to infer ancestry that can identify the biogeographic origins of highly mixed individuals. reAdmix can incorporate individual's knowledge of ancestors (e.g. having some ancestors from Turkey or a Scottish grandmother). reAdmix is an online tool available at http://chcb.saban-chla.usc.edu/reAdmix/.Entities:
Mesh:
Year: 2015 PMID: 26111206 PMCID: PMC4480842 DOI: 10.1186/1471-2164-16-S8-S9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Accuracy of reAdmix ancestry predictions for different mixture scenarios from European populations.
| Scenario | Prior | Correct position(%) | At least one correctly predicted origin (%) | Correct populations (%) | Average distance to correct population, km |
|---|---|---|---|---|---|
| 50 × 50 | none | 100 | 83 | 16 | 505 |
| 1 pop. | 100 | 75 | 31 | 8 | |
| equal weights | 100 | 81 | 26 | 251 | |
| 50 × 25 × 25 | none | 98 | 80 | 1 | 572 |
| 1 pop. | 100 | 61 | 2 | 240 | |
| 25 × 25 × 25 × 25 | none | 99 | 79 | 0 | 729 |
| 1 pop. | 100 | 61 | 0 | 427 | |
Percentage of mixed ancestral population is given in the "Scenario" column. "Correct position" is defined as a prediction within 320 km of reported location. "Correct populations" is defined as a geographically correct prediction where the method correctly discriminated between neighboring populations.
Accuracy of reAdmix ancestry predictions for different mixture scenarios from European populations with error term, ϵ, to simulate variability of admixture proportions within populations.
| Scenario | Error, | Correct position(%) | At least one correctly predicted origin (%) | Correct populations (%) | Average distance to correct population, km |
|---|---|---|---|---|---|
| 50 × 50 | 0.01 | 99 | 72 | 6 | 401 |
| 0.03 | 99 | 74 | 5 | 363 | |
| 0.05 | 99 | 73 | 5 | 386 | |
| 50 × 25 × 25 | 0.01 | 99 | 81 | 0 | 588 |
| 0.03 | 99 | 79 | 0 | 553 | |
| 0.05 | 98 | 79 | 0 | 557 | |
| 25 × 25 × 25 × 25 | 0.01 | 99 | 81 | 0 | 600 |
| 0.03 | 98 | 78 | 0 | 618 | |
| 0.05 | 98 | 80 | 0 | 623 | |
Percentage of mixed ancestral population is given in the "Scenario" column. "Correct position" is defined as a prediction within 320 km of reported location. "Correct populations" is defined as a geographically correct prediction where the method correctly discriminated between neighboring populations.
Accuracy of reAdmix ancestry reconstruction for different mixture scenarios from European and Native American populations.
| Scenario | Condition | Correct position(%) | At least one correctly predicted origin (%) | Correct populations (%) | Average distance to correct population, km |
|---|---|---|---|---|---|
| 50 × 50 | none | 98 | 89 | 30 | 329 |
| 1 pop. | 99 | 87 | 36 | 2 | |
| equal weights | 99 | 88 | 36 | 135 | |
| 50 × 25 × 25 | none | 86 | 81 | 18 | 1390 |
| 1 pop. | 94 | 72 | 4 | 362 | |
| 25 × 25 × 25 × 25 | none | 86 | 85 | 0 | 1484 |
| 1 pop. | 90 | 71 | 0 | 759 | |
Percentage of mixed ancestral population is given in the "Scenario" column. "Correct position" is defined as a prediction within 320 km of reported location. "Correct populations" is defined as a geographically correct prediction where the method correctly discriminated between neighboring populations.
Accuracy of reAdmix ancestry predictions for different mixture scenarios from European and Native American populations with error term, ϵ, to simulate variability of admixture proportions within populations.
| Scenario | Error, | Correct positions(%) | At least one correctly predicted origin (%) | Correct populations(%) | Average distance to correct population, km |
|---|---|---|---|---|---|
| 50 × 50 | 0.01 | 97 | 83 | 12 | 354 |
| 0.03 | 97 | 83 | 9 | 391 | |
| 0.05 | 98 | 84 | 7 | 357 | |
| 50 × 25 × 25 | 0.01 | 88 | 80 | 2 | 1156 |
| 0.03 | 85 | 77 | 2 | 1254 | |
| 0.05 | 88 | 81 | 1 | 1147 | |
| 25 × 25 × 25 × 25 | 0.01 | 85 | 82 | 0 | 1554 |
| 0.03 | 85 | 82 | 0 | 1526 | |
| 0.05 | 87 | 82 | 0 | 1441 | |
Percentage of mixed ancestral population is given in the "Scenario" column. "Correct position" is defined as a prediction within 320 km of reported location. "Correct populations" is defined as a geographically correct prediction where the method correctly discriminated between neighboring populations.
Performance of reAdmix, mSpectrum, HAPMIX and LAMP using two-way admixed individuals.
| Ethnicity | True | ReAdmix | mSpectrum | HAPMIX | LAMP |
|---|---|---|---|---|---|
| European | 20 | 20 | 18.9 | 15.7 | 17.1 |
| African | 80 | 80 | 79.5 | 76.7 | 77.8 |
| Nat. American | 0 | 0 | 1.2 | 0.3 | 1.6 |
| East Asian | 0 | 0 | 0.4 | 1.3 | 3.5 |
| Other | 0 | 0 | 0 | 6 | 0 |
Estimation errors for the two-way admixture were 0.01, 1.70, 8.18, and 5.28, respectively.
Performance of reAdmix, mSpectrum, HAPMIX and LAMP using four-way admixed individuals.
| Ethnicity | True | ReAdmix | mSpectrum | HAPMIX | LAMP |
|---|---|---|---|---|---|
| European | 79.3 | 79.2 | 83.5 | 68.1 | 63.2 |
| African | 15 | 15 | 13.5 | 13 | 13.5 |
| Nat. American | 3.5 | 3.5 | 2.6 | 2.6 | 8.9 |
| East Asian | 2.2 | 2.3 | 0.4 | 10.4 | 14.4 |
| Other | 0 | 0 | 0 | 5.9 | 0 |
Estimation errors for the four-way admixture were 0.10,4.89, 15.24, 20.96, respectively.
Figure 1Performance of . Color coding: red - European, green - African, yellow - Native America, blue - East Asian, and white - unassigned.
Figure 2Flowchart of .