| Literature DB >> 26287376 |
Jill A Hollenbach1, Aliya Saperstein2, Mark Albrecht3, Cynthia Vierra-Green4, Peter Parham5, Paul J Norman5, Martin Maiers3.
Abstract
We conducted a nationwide study comparing self-identification to genetic ancestry classifications in a large cohort (n = 1752) from the National Marrow Donor Program. We sought to determine how various measures of self-identification intersect with genetic ancestry, with the aim of improving matching algorithms for unrelated bone marrow transplant. Multiple dimensions of self-identification, including race/ethnicity and geographic ancestry were compared to classifications based on ancestry informative markers (AIMs), and the human leukocyte antigen (HLA) genes, which are required for transplant matching. Nearly 20% of responses were inconsistent between reporting race/ethnicity versus geographic ancestry. Despite strong concordance between AIMs and HLA, no measure of self-identification shows complete correspondence with genetic ancestry. In certain cases geographic ancestry reporting matches genetic ancestry not reflected in race/ethnicity identification, but in other cases geographic ancestries show little correspondence to genetic measures, with important differences by gender. However, when respondents assign ancestry to grandparents, we observe sub-groups of individuals with well- defined genetic ancestries, including important differences in HLA frequencies, with implications for transplant matching. While we advocate for tailored questioning to improve accuracy of ancestry ascertainment, collection of donor grandparents' information will improve the chances of finding matches for many patients, particularly for mixed-ancestry individuals.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26287376 PMCID: PMC4545604 DOI: 10.1371/journal.pone.0135960
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Survey respondent demographics.
| Total | Female | Male | Mean age | |
|---|---|---|---|---|
|
| 1752 | 1324 | 428 | 31 |
|
| ||||
| African American | 27 | 22 | 5 | 30 |
| White | 1414 | 1090 | 324 | 31 |
| Asian or Pacific Islander | 65 | 33 | 32 | 30 |
| Hispanic | 100 | 70 | 30 | 31 |
| Native American | 4 | 4 | 0 | 30 |
| Multi-race | 142 | 105 | 37 | 30 |
|
| ||||
| No | 100 | 61 | 39 | 33 |
| Yes | 1647 | 1259 | 388 | 30 |
|
| ||||
| Neither | 159 | 95 | 64 | 31 |
| One | 145 | 114 | 31 | 31 |
| Both | 1444 | 1113 | 331 | 31 |
|
| ||||
| None | 177 | 111 | 66 | 32 |
| One | 28 | 23 | 5 | 29 |
| Two | 172 | 134 | 38 | 32 |
| Three | 133 | 102 | 31 | 29 |
| Four | 1225 | 944 | 281 | 31 |
Mean genetic ancestry proportions determined via ancestry informative markers and HLA haplotype origin frequencies for subpopulations defined by reported race/ethnicity or geographic ancestry.
| N |
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| African | European | Asian | Amerindian | African American | European | Asian | Amerindian | Multi-origin | |||
|
|
| ||||||||||
|
| 27 | 0.70 | 0.27 | 0.03 | 0.01 | 63% | 11% | 4% | - | 22% | |
|
| 1443 | 0.00 | 0.99 | 0.00 | 0.00 | - | 82% | - | - | 17% | |
|
| 64 | 0.01 | 0.22 | 0.76 | 0.01 | - | - | 97% | - | 3% | |
|
| 71 | 0.05 | 0.63 | 0.05 | 0.28 | - | 13% | 1% | 52% | 34% | |
|
| 2 | 0.00 | 0.99 | 0.01 | 0.00 | - | 50% | - | - | 50% | |
|
| 2 | 0.01 | 0.00 | 0.97 | 0.02 | - | - | 100% | - | - | |
|
|
| ||||||||||
|
| 15 | 0.71 | 0.25 | 0.03 | 0.02 | 67% | 13% | 7% | - | 13% | |
|
| 1236 | 0.00 | 0.99 | 0.00 | 0.00 | - | 82% | - | - | 17% | |
|
| 63 | 0.01 | 0.21 | 0.77 | 0.01 | - | - | 97% | - | 3% | |
|
| 6 | 0.00 | 0.91 | 0.01 | 0.08 | - | 20% | - | - | 80% | |
|
| 36 | 0.04 | 0.51 | 0.07 | 0.38 | - | - | - | 72% | 28% | |
|
| 17 | 0.19 | 0.71 | 0.03 | 0.07 | 6% | - | 6% | 12% | 77% | |
|
|
|
|
|
|
|
|
|
|
|
| |
|
| 13 | 0.36 | 0.60 | 0.03 | 0.02 | 54% | 8% | - | - | 39% | |
|
| 120 | 0.03 | 0.85 | 0.06 | 0.05 | 3% | 48% | 7% | 6% | 36% | |
|
| 18 | 0.00 | 0.66 | 0.31 | 0.02 | - | 22% | 33% | 6% | 39% | |
|
| 46 | 0.03 | 0.84 | 0.03 | 0.11 | 4% | 37% | - | 13% | 46% | |
|
| 53 | 0.04 | 0.91 | 0.02 | 0.03 | 6% | 66% | 2% | 4% | 23% | |
|
| 5 | 0.01 | 0.71 | 0.23 | 0.05 | - | 20% | 60% | - | 20% | |
|
|
|
|
|
|
|
|
|
|
|
| |
|
| 27 | 0.40 | 0.57 | 0.02 | 0.02 | 41% | 22% | - | - | 37% | |
|
| 318 | 0.02 | 0.91 | 0.03 | 0.04 | 2% | 64% | 4% | 5% | 25% | |
|
| 25 | 0.01 | 0.70 | 0.28 | 0.02 | - | 28% | 36% | 4% | 32% | |
|
| 49 | 0.04 | 0.91 | 0.02 | 0.03 | 2% | 72% | 1% | 3% | 23% | |
|
| 68 | 0.05 | 0.78 | 0.02 | 0.15 | 6% | 40% | - | 22% | 32% | |
|
| 21 | 0.19 | 0.75 | 0.04 | 0.02 | 14% | 48% | 5% | - | 33% | |
|
| |||||||||||
|
| 31 | 0.71 | 0.25 | 0.03 | 0.01 | 68% | 10% | - | - | 23% | |
|
| 1545 | 0.00 | 0.98 | 0.01 | 0.01 | - | 80% | - | - | 18% | |
|
| 70 | 0.01 | 0.24 | 0.74 | 0.01 | - | 7% | 90% | 1% | 1% | |
|
| 73 | 0.05 | 0.63 | 0.05 | 0.28 | 1% | 10% | 1% | 52% | 36% | |
|
| 6 | 0.01 | 0.74 | 0.11 | 0.14 | - | 67% | - | - | 33% | |
|
| 3 | 0.01 | 0.67 | 0.31 | 0.01 | - | 33% | 33% | - | 33% | |
|
| 23 | 0.10 | 0.71 | 0.13 | 0.05 | 9% | 26% | 26% | 4% | 35% | |
* All individuals have two HLA haplotypes. Multi-origin haplotype classification indicates that one of the individual’s haplotypes is closely associated with one continental origin while the other haplotype is associated with a different continental origin.
Correlation between self-identification measures and HLA origins and genetic ancestry.
| Self-identification measure | HLA origins | Genetic ancestry (AIMs) | ||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
|
| ||||||||
| Black or African American | 0.71 (0.68–0.74) | 0.90 (0.89–0.91) | ||||||
| White | 0.74 (0.72–0.77) | 0.77 (0.74–0.79) | ||||||
| Asian | 0.84 (0.82–0.85) | 0.84 (0.82–0.86) | ||||||
| Hispanic or Latino | 0.58 (0.54–0.62) | 0.66 (0.63–0.69) | ||||||
| Native Hawaiian or other Pacific Islander | 0.14 (0.09–0.19) | 0.15 (0.10–0.21) | ||||||
|
| ||||||||
| African or African American | 0.69 (0.66–0.72) | 0.82 (0.80–0.83) | ||||||
| European | 0.72 (0.69–0.75) | 0.75 (0.73–0.77) | ||||||
| Asian | 0.83 (0.81–0.85) | 0.83 (0.81–0.85) | ||||||
| Latin American | 0.54 (0.50–0.58) | 0.69 (0.66–0.72) | ||||||
| Caribbean | 0.13 (0.07–0.18) | 0.20 (0.14–0.25) | ||||||
|
| ||||||||
| African or African American | 0.72 (0.69–0.74) | 0.86 (0.85–0.88) | ||||||
| European | 0.77 (0.75–0.79) | 0.81 (0.79–0.83) | ||||||
| Asian | 0.88 (0.87–0.89) | 0.89 (0.88–0.90) | ||||||
| Latin American | 0.62 (0.59–0.66) | 0.80 (0.78–0.82) | ||||||
| Caribbean | 0.10 (0.05–0.16) | 0.17 (0.120.23) | 0.18 (0.12–0.23) | |||||
The correlation coefficient (r) is given with 95% confidence intervals (parentheses). Only statistically significant (p<0.05 after correction) values are shown.
Fig 1African and Amerindian genetic ancestry proportions (a,c) and HLA origin frequencies (b,d) are correlated with the number of grandparents with reported African and Latin American ancestry, respectively.
Genetic ancestry proportions were estimated from AIMs data using Structure and the HGDP reference set. For this analysis, k = 4, reflecting broad continental ancestry. For each subpopulation defined by the number of respondents’ grandparents with Sub-Saharan African/African American or Latin American ancestry, the percentage of individuals with zero, one or two African or Amerindian HLA haplotypes, respectively, was calculated. Individuals reporting one, two, three or four African-ancestry grandparents: n = 4; 4; 2; 21, respectively. Individuals reporting one, two, three or four Latin American-ancestry grandparents: n = 16; 21; 7; 38, respectively.