| Literature DB >> 20886636 |
Oscar Lao1, Peter M Vallone, Michael D Coble, Toni M Diegoli, Mannis van Oven, Kristiaan J van der Gaag, Jeroen Pijpe, Peter de Knijff, Manfred Kayser.
Abstract
The current U.S. population represents an amalgam of individuals originating mainly from four continental regions (Africa, Europe, Asia and America). To study the genetic ancestry and compare with self-declared ancestry we have analyzed paternally, maternally and bi-parentally inherited DNA markers sensitive for indicating continental genetic ancestry in all four major U.S. American groups. We found that self-declared U.S. Hispanics and U.S. African Americans tend to show variable degrees of continental genetic admixture among the three genetic systems, with evidence for a marked sex-biased admixture history. Moreover, for these two groups we observed significant regional variation across the country in genetic admixture. In contrast, self-declared U.S. European and U.S. Asian Americans were genetically more homogeneous at the continental ancestry level. Two autosomal ancestry-sensitive markers located in skin pigmentation candidate genes showed significant differences in self-declared U.S. African Americans or U.S. European Americans, relative to their assumed parental populations from Africa or Europe. This provides genetic support for the importance of skin color in the complex process of ancestry identification.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20886636 PMCID: PMC3051415 DOI: 10.1002/humu.21366
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Genotyping information autosomal SNPs
| Multiplex A | PCR Primers | SBE Primers | Length | μM | ||
|---|---|---|---|---|---|---|
| rs1048610 | F | AGGCAGGTCTCAGAACAATCC | GTGTGCTGCAGGGACCTTTC | F | 20 | 5 |
| R | GTTCAGCATCGACATAGGGC | |||||
| rs1876482 | F | GAGCTGTTGATAGAGCTTTTGTGG | ttttttGGCTGTACCCTCACTATTGGTG | R | 28 | 5 |
| R | ACGTGACACATAAAGAAAATGCCAT | |||||
| rs2179967 | F | AAGAGTGTGTTGTATGCTTTGGAAA | ttttttCTTTGGAAATGGGTGTGCAACA | F | 28 | 6 |
| R | TCCTTCCAGCCCGACTAGAAC | |||||
| rs1858465 | F | GATTTCAAAAAGTCTACAGATTTGG | tttttACTTCCTCTTTAATACTTCAACTGAGT | R | 32 | 7 |
| R | TGACTTTGTCAAACTTCCTCTTTAA | |||||
| rs1371048 | F | CTTAAATAGCCAAATAGCTCTAACT | ttttttttttATTTGAGTATGCTCTGTAGATGCTTC | R | 36 | 5 |
| R | ACAAACGAAATATTTGAGTATGCT | |||||
| rs1369290 | F | GAGGCCCTACATGACCTGTC | tttttttttttttttACCACAGGCTCTTGATAAAGTGTCT | F | 40 | 5 |
| R | GGGCTCCTCTTTCGCTCA | |||||
| rs1465648 | F | ACCAGAAGGAAAGAGAAAAAGCAC | tttttttttttttttttGAAAAAGCACAGTATCAAGTTTGACTT | F | 44 | 6 |
| R | AACAAACTACAGCAACAGAATCTTT | |||||
| rs1391681 | F | GAGTAGTTGCTCATGAAGCTGAAAA | ttttttttttttttttttttttTGTCACCCTTTACAAAACAGTTTGCA | F | 48 | 5 |
| R | GGGCAGCCAAAAATAAAACAAAACA | |||||
| rs1461227 | F | ACTGGGAAATTCTCACTGCAACT | tttttttttttttttttttttttttAACTACAACTAGCCCTAGGCTAATCTA | F | 52 | 5 |
| R | TTGACAGATGGAGACACTGAAGC | |||||
| rs1907702 | F | CCAACTCCTAATCAAGGCCTAC | ttttttttttttttttttttttttttttttCCTAATCAAGGCCTACAGAGACCTTC | F | 56 | 5 |
| R | AGGAACATAAAGGAGGCCAGT | |||||
| rs2052760 | F | ATTCAGAAAAGTGCATGCAGAAATT | ttttttttttttttttttttttttttttttttttATTATCAATGGGTTATTTTTGCCTCA | F | 60 | 5 |
| R | GAGAGAGAGGAGTGAGAAAGGC | |||||
| rs1667751 | F | CTGGTTCTTTTCCATCCAGCCTTTA | ttttttttttttttttttttttttttttttttttttttCTTTACAAGCTACAAGACTTACGCCT | F | 64 | 5 |
| R | GAGATCACCAAGGGAGTAAGTACAG | |||||
| Multiplex B | PCR Primers | SBE Primers | Length | μM | ||
| rs1448484 | F | TCTCCTTCCAAGCCTTCTGAAAAAT | tATGAGAGCTGGCAGCTTCC | F | 20 | 6 |
| R | GCAACCACACAGAACACAGC | |||||
| rs714857 | F | GAAACTTCCCTAATGGGTCTTGTGA | tttCTTGTGAACCTTGGCTCCCTG | F | 24 | 6 |
| R | CCTCCCTCACACATAAAACTTCTCA | |||||
| rs16891982 | F | ATCCAAGTTGTGCTAGACCAGAA | ttttttGAGGAAAACACGGAGTTGATGCA | F | 29 | 5 |
| R | AGAGGAGTCGAGGTTGGATG | |||||
| rs1808089 | F | TGTCAGGCCTTACCACTGCATAAGA | ttttttttACAAATGAGTAATGCCGTGGTGG | R | 31 | 5 |
| R | AAACAACTCAGCGGCACAAA | |||||
| rs1478785 | F | TCCTGGAGGCTTGAGGGCTA | tttttttttAGGGATGTTCATTTAAAATAACATCGC | F | 36 | 5 |
| R | GGCTTGCTGGCTTTTTCTAGAT | |||||
| rs952718 | F | GAGCCTAGATCCTGACTTCCTTG | tttttttttttttAAAATGCAAATTTCACCTTCTTCAAAT | R | 40 | 5 |
| R | CTGTCACTGGAGATGTCATCTCAT | |||||
| rs1405467 | F | AATTTGCAACAAAGAGGAAGGGGA | ttttttttttttttttttAAGTAGTCAGCTGAACTCACCTGAT | F | 43 | 5 |
| R | GAGCAATAAGAGTGACTATGTCTGC | |||||
| rs1344870 | F | CAATCTCAGTTTTAATTGCCATGT | ttttttttttttttttttttttTCGCTCTTAAGTATGTTTTCTTGGTC | F | 48 | 5 |
| R | AGGATGTATTGGGGCCTTTC | |||||
| rs3843776 | F | AGGCCACTGTTGTGGTTTATG | tttttttttttttttttttttttttttTGTTGTGGTTTATGTTTCACTTCGAC | F | 53 | 6 |
| R | TGAGGGCTCTACAACACTGC | |||||
| rs721352 | F | TCTGTGCCCAGATGCAAATCCTTA | tttttttttttttttttttttttttttttTGCTTGATGGCTCCACCTATCA | R | 51 | 6 |
| R | GACCCAGAACTGTGCAGG | |||||
| rs722869 | F | CCTTCTGCACTTGGGCATATT | tttttttttttttttttttttttttttttttttCAAATCCTTCATTTCACAAATGAAGCT | R | 60 | 5 |
| R | AGGTAGAGATCTAACAAACCACAGT | |||||
| rs926774 | F | AATCAAGTTCAGACTTTTGCCTCAT | tttttttttttttttttttttttttttttttttttttAAGCTATTGTAGTGAGGAAGGCTAGA | R | 63 | 7 |
MtDNA haplogroups observed among U.S. Americans and their assumed geographic region of origin
| Assumed continental origin | ||||
|---|---|---|---|---|
| mtDNA haplogroup | Asian | Eurasian | African | Native American |
| A | 1 | |||
| A2 | 1 | |||
| A5 | 1 | |||
| B2 | 1 | |||
| B4a | 1 | |||
| B4b1 | 1 | |||
| B4c | 1 | |||
| B5b | 1 | |||
| C1 | 1 | |||
| D/E/G | 1 | |||
| D/G | 1 | |||
| D1 | 1 | |||
| D4a | 1 | |||
| D4e | 1 | |||
| D4i | 1 | |||
| D4k | 1 | |||
| D5b | 1 | |||
| E2 | 1 | |||
| F1a | 1 | |||
| F1b | 1 | |||
| F2a | 1 | |||
| F3b | 1 | |||
| G | 1 | |||
| H | 1 | |||
| H11 | 1 | |||
| H13a | 1 | |||
| H1a | 1 | |||
| H1b | 1 | |||
| H1c | 1 | |||
| H3a | 1 | |||
| H5 | 1 | |||
| H6 | 1 | |||
| HV0 | 1 | |||
| I | 1 | |||
| J1b | 1 | |||
| J1c | 1 | |||
| J2a | 1 | |||
| K | 1 | |||
| L0a | 1 | |||
| L0a | 1 | |||
| L1b | 1 | |||
| L1c | 1 | |||
| L2a1 | 1 | |||
| L2b | 1 | |||
| L2c | 1 | |||
| L2d | 1 | |||
| L3 | 1 | |||
| L3a | 1 | |||
| L3b | 1 | |||
| L3d | 1 | |||
| L3e1 | 1 | |||
| L3e2 | 1 | |||
| L3e3 | 1 | |||
| L3e4 | 1 | |||
| L3f | 1 | |||
| L3h | 1 | |||
| M10 | 1 | |||
| M35 | 1 | |||
| M7a | 1 | |||
| M7b | 1 | |||
| M8a | 1 | |||
| M9a | 1 | |||
| N1a | 1 | |||
| N1b | 1 | |||
| N9 | 1 | |||
| R* | 0.5 | 0.5 | ||
| T1 | 1 | |||
| T2 | 1 | |||
| U2 | 1 | |||
| U3 | 1 | |||
| U4 | 1 | |||
| U5a | 1 | |||
| U5b | 1 | |||
| U6a | 1 | |||
| U8a | 1 | |||
| W | 1 | |||
| X2 | 1 | |||
| X2a | 1 | |||
Genotyping information NRY SNPs
| Additional | Haplogroup | SNP | Bibliogr aphical source | GenBank | dbSNPs accession (if known) | Position Y-chromosome | Forward Amplification primer (5′–> 3′) | Reverse Amplification primer (5′–> 3′) | concentration in PCR (μM) | Amplic on size (bp) | Minisequencing primers (target-specific sequence in capitals) | Orientation | concentration in miniseqreaction (μM) | Primer size (nt) | Mutation: Wildtype/Mutant** |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| hgE | E | M96 | Refs | AC010889 | rs9306841 | 20238386 | GCCAGCCAAGAATGAAGAGA | TGAGCTGTGATGTGTAACTTGG | 0.1 | 143 | GGAAAACAGGTCTCTCATAATA | R | 0.04 | 22 | G/C |
| hgE | E1a | M33 | 2 | AC009977 | 20199838 | CCGTCATAGGCTGAGACAAGA | CCCCAAGAGAGACAACTGAC | 0.15 | 150 | ccacgtcgtgaaagtctgacaaCAGTTACAAAAGTATAATATGTCTGAGAT | R | 0.06 | 51 | C/G | |
| hgE | E1b1 | P2 | 3 | AC010137 | 20070219 | GAGAATCAGCTCCAGCCATC | TTTTGGATCTTCATGCTGGTT | 0.03 | 100 | gacaaAGGTGCCCCTAGGAGGAGAA | F | 0.2 | 25 | T/C | |
| hgE | E1b1a | M2 | 6 | AC011302 | rs3893 | 12606580 | ACGGAAGGAGTTCTAAAATTCAGG | AAAATACAGCTCCCCCTTTATCCT | 0.1 | 147 | cacgtcgtgaaagtctgacaaTTCATTGTTAACAAAAGTCC | R | 0.06 | 41 | G/A |
| hgE | E1b1a4 | M154 | 2 | AC010889 | 20352065 | AGGCTACAAATTAGTGCGACA | GAGGCACAGATACTTAAACCATTG | 0.06 | 77 | acaaGTTACATGGCCTATAATATTCAGTACA | R | 0.03 | 31 | G/A | |
| hgE | E1b1a7 | M191 | 2 | AC004474 | rs2032590 | 13529007 | AAAAATGGAGTGTTTATCAGAGCTT | CCCAGACACACCAAAATATCTC | 0.3 | 122 | gaaagtctgacaaAAAATATCTCATATTTTCAT | R | 0.25 | 33 | A/G |
| hgE | E1b1b | M215 | 2 | AC006376 | rs2032654 | 13977218 | TCAAACTGTTGGTAAATTTTAGAGAAA | CAGAAGCATCAGCTGGAACA | 0.25 | 97 | gtcgtgaaagtctgacaaCAGCTGGAACAGTTAGAAAG | R | 0.15 | 38 | C/T |
| hgE | E1b1b1 | M35 | 2 | AC009977 | rs1179188 | 20201091 | AGGGCATGGTCCCTTTCTAT | TCCATGCAGACTTTCGGAGT | 0.2 | 96 | actgactaaactaggtgccacgtcgtgaaagtctgacaaTCGGAGTCTCTGCCTGTGTC | R | 0.06 | 59 | G/A |
| hgE | E1b1b1a | M78* | 2 | AC010889 | 20352691 | TGCATTACTCCGTATGTTCGAC | TGGAAGCTTACCATCTTTTTATGA | 0.05* | 132 | aagtctgacaaCTTATTTTGAAATATTTGGAAGGGC | R | 0.02 | 36 | A/C | |
| hgE | E1b1b1a1 | V12 | 7 | AC012068 | 6883099 | CTGAGTTGGATTGTTTTAAGTTGA | TTGGTCTCTCTTCATGTGCTG | 0.15 | 150 | acaaTTGTGTAGATAATTCAAAGT | R | 0.25 | 24 | C/T | |
| hgE | E1b1b1a1a | M224 | 2 | AC010889 | 20352687 | TGCATTACTCCGTATGTTCGAC | TGGAAGCTTACCATCTTTTTATGA | 0.05* | 132 | cgtgaaagtctgacaaAATTGATACACTTAACAAAGATACTTC | F | 0.15 | 43 | A/G | |
| hgE | E1b1b1a1b | V32 | 7 | AC012068 | 6992821 | GCAAATGTTCCATGAATGGTG | CCAGCCAGAGAGGCACTTTA | 0.4 | 111 | CCCaactgactaaactaggtgccacgtcgtgaaagtctgacaaCACACATGTATATACACACC | R | 0.25 | 63 | C/G | |
| hgE | E1b1b1a2 | V13 | 7 | AC012068 | 6902263 | CAACAGTGGAGGACAAAGCA | AAGACCAGCCTGACCAACAT | 0.15 | 106 | cgtcgtgaaagtctgacaaGCTCAAACTTCCCTTG | R | 0.15 | 35 | A/G | |
| hgE | E1b1b1a3 | V22 | 7 | AC012068 | 6919957 | TGGCAATGCCTCAACTTACA | ATTCCCCAAGGTTTCAGAGG | 0.15 | 110 | CaactgactaaactaggtgccacgtcgtgaaagtctgacaaCCAAGGTTTCAGAGGTC | R | 0.15 | 58 | C/G | |
| hgE | E1b1b1b | M81 | 2 | AC010889 | rs2032640 | 20351960 | GCACTATCATACTCAGCTACACATCTC | TTGTTTCTTCTTGGTTTGTGTGA | 0.03 | 99 | acaaCTTGGTTTGTGTGAGTATACTCTATGAC | R | 0.03 | 32 | G/A |
| hgE | E1b1b1c | M123 | 2 | AC010889 | 20223974 | GTTGCCCAGGAATTTGCAT | CACAGAGCAAGTGACTCTCAAAG | 0.15 | 89 | taaactaggtgccacgtcgtgaaagtctgacaaCATTTCTAGGTATTCAGGCGATG | F | 0.1 | 56 | T/G | |
| hgE | E1b1b1d | M281 | 4 | AC010889 | rs13447370 | 20223888 | AGCAAAGTTGAGGTTGCACA | TGGGCAACACCAGAATCTAA | 0.15 | 93 | gtgccacgtcgtgaaagtctgacaaGCACAAACTCAGTATTATTAAAC | F | 0.06 | 48 | T/C |
| hgE | E1b1b1e | V6 | 3 | AC012068 | 6992007 | GATGGCACAGTGTTCGACAG | CTTCTCTCCAAATGCCTGCT | 0.4 | 102 | taggtgccacgtcgtgaaagtctgacaaCCTGCTGCCGCATCTGCA | R | 0.02 | 46 | T/C | |
| hgE | E2 | M75 | 2 | AC010889 | rs2032639 | 20349565 | TGACTTGTCAAAAGCCAAAACA | TTGAACAGAGGCATTTGTGA | 0.1 | 123 | taggtgccacgtcgtgaaagtctgacaaGAAAAGACAATTATCAAACCACATCC | F | 0.1 | 54 | C/T |
Supp. Figure S1Phylogenetic tree of NRY SNPs.
NRY DNA haplogroups observed among U.S. origin Americans and their assumed geographic region of origin
| Assumed continental origin | ||||
|---|---|---|---|---|
| NRY haplogroup | Asian | Eurasian | African | Native American |
| A | 1 | |||
| B | 1 | |||
| C | 1 | |||
| D | 1 | |||
| E1a | 1 | |||
| E1b1a*(xE1b1a4,E1b1a7) | 1 | |||
| E1b1a7 | 1 | |||
| E1b1b1*(xE1b1b1a,E1b1b1b,E1b1b1c,E1b1b1d,E1b1b1e) | 0.5 | 0.5 | ||
| E1b1b1a*(xE1b1b1a1,E1b1b1a2,E1b1b1a3) | 0.5 | 0.5 | ||
| E1b1b1a1*(xE1b1b1a1a,E1b1b1a1b) | 0.5 | 0.5 | ||
| E1b1b1a2 | 1 | |||
| E1b1b1a3 | 0.5 | 0.5 | ||
| E1b1b1b | 1 | |||
| E1b1b1c | 0.8 | 0.2 | ||
| E2 | 1 | |||
| G | 1 | |||
| I | 1 | |||
| J*(xJ2) | 1 | |||
| J2 | 1 | |||
| K*(xL,M1,NO,P) | 0.333 | 0.333 | 0.333 | |
| N1c | 1 | |||
| O | 1 | |||
| Q1a | 1 | |||
| R1a | 1 | |||
| R1b1b2 | 1 | |||
| R2 | 1 | |||
Figure 1Genetic ancestry per individual in the global HGDP-CEPH panel as estimated by STRUCTURE using 24 autosomal ASMs (K=4).
Figure 2Proportions of average continental genetic ancestry in four U.S. American groups of self-declared ancestry based on autosomal DNA, mtDNA and NRY DNA.
Correspondence between self-declared ancestry and STRUCTURE-based genetic ancestry inferred from 24 autosomal ASMs in four major U.S. American self-declared groups
| Clusters from STRUCTURE | ||||
|---|---|---|---|---|
| Self-declared ancestry | K1 | K2 | K3 | K4 |
| U.S. African | 0% | 2.2% | 1.0% | 96.8% |
| U.S. European | 0% | 19.0% | 80.6% | 0.4% |
| U.S. Hispanic | 2.4% | 77.8% | 15.7% | 4.0% |
| U.S. Asian | 99.9% | 0.1% | 0% | 0% |
Figure 3Two-dimensional plots of the first dimension, second dimension and third dimension obtained from a MDS analysis (stress = 0.13) performed with an Identical By State (IBS) distance matrix computed between pairs of individuals. Centroids of the four continental parental populations from HGDP-CEPH are marked by crosses.