| Literature DB >> 23936216 |
Anna Olivieri1, Maria Pala, Francesca Gandini, Baharak Hooshiar Kashani, Ugo A Perego, Scott R Woodward, Viola Grugni, Vincenza Battaglia, Ornella Semino, Alessandro Achilli, Martin B Richards, Antonio Torroni.
Abstract
The current human mitochondrial (mtDNA) phylogeny does not equally represent all human populations but is biased in favour of representatives originally from north and central Europe. This especially affects the phylogeny of some uncommon West Eurasian haplogroups, including I and W, whose southern European and Near Eastern components are very poorly represented, suggesting that extensive hidden phylogenetic substructure remains to be uncovered. This study expanded and re-analysed the available datasets of I and W complete mtDNA genomes, reaching a comprehensive 419 mitogenomes, and searched for precise correlations between the ages and geographical distributions of their numerous newly identified subclades with events of human dispersal which contributed to the genetic formation of modern Europeans. Our results showed that haplogroups I (within N1a1b) and W originated in the Near East during the Last Glacial Maximum or pre-warming period (the period of gradual warming between the end of the LGM, ∼19 ky ago, and the beginning of the first main warming phase, ∼15 ky ago) and, like the much more common haplogroups J and T, may have been involved in Late Glacial expansions starting from the Near East. Thus our data contribute to a better definition of the Late and postglacial re-peopling of Europe, providing further evidence for the scenario that major population expansions started after the Last Glacial Maximum but before Neolithic times, but also evidencing traces of diffusion events in several I and W subclades dating to the European Neolithic and restricted to Europe.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23936216 PMCID: PMC3729697 DOI: 10.1371/journal.pone.0070492
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Phylogenetic tree of haplogroups N1a1b and W.
This schematic representation is based on 196 N1a1b and 223 W mitogenomes whose phylogenetic relationships are illustrated in detail in Figure S1 and Figure S2. The phylogenetic connections between N1a1b and W are also shown. Approximate ages can be inferred from the scale. For haplogroups N1a1b and W, they correspond to the ML ages in Table 1 while previously reported ML ages were employed for nodes N, N1 and N2 [3].
Molecular divergence and age estimates (maximum likelihood and ρ) for haplogroups N1a1b1 and I and their subclades.
| All Nucleotide Substitutions | |||||||||
| Haplogroup |
| ML | SE | Age (ky) | 95% CI (ky) | ρ | σ | Age (ky) | 95% CI (ky) |
| N1a1b | 196 | 10.3 | 0.9 | 28.6 | {23.5; 33.9} | 10.7 | 3.1 | 29.8 | {12.2; 48.5} |
| >N1a1b1 | 4 | 7.7 | 1.1 | 21.1 | {14.9; 27.5} | 8.5 | 3.0 | 23.3 | {6.9; 41.1} |
| >>N1a1b1a | 3 | 7.1 | 0.9 | 19.3 | {14.6; 24.2} | 7.0 | 3.2 | 19.0 | {1.8; 37.8} |
| >>>N1a1b1a1 | 2 | 2.3 | 0.9 | 5.9 | {1.3; 10.6} | 2.0 | 1.0 | 5.2 | {0.1; 10.5} |
| >I | 192 | 7.4 | 0.3 | 20.1 | {18.4; 21.9} | 8.7 | 1.2 | 23.8 | {16.9; 30.9} |
| >>I1 | 70 | 6.1 | 0.3 | 16.3 | {14.6; 18.0} | 8.7 | 3.2 | 23.9 | {6.6; 42.7} |
| >>>I1a | 45 | 4.4 | 0.3 | 11.6 | {9.9; 13.3} | 7.0 | 4.4 | 18.9 | {−4.2; 44.9} |
| >>>>I1a1 | 41 | 1.9 | 0.1 | 4.9 | {4.2; 5.6} | 2.2 | 0.3 | 5.9 | {4.3; 7.4} |
| >>>>>I1a1a | 19 | 1.5 | 0.1 | 3.8 | {3.3; 4.4} | 1.7 | 0.1 | 4.4 | {3.6; 5.1} |
| >>>>>I1a1b | 4 | 0.5 | 0.2 | 1.4 | {0.5; 2.2} | 0.5 | 0.1 | 1.3 | {0.7; 1.9} |
| >>>>>I1a1c | 4 | 1.0 | 0.2 | 2.5 | {1.3; 3.7} | 1.3 | 0.7 | 3.2 | {−0.2; 6.8} |
| >>>>>I1a1d | 2 | 0.7 | 0.2 | 1.8 | {1.0; 2.6} | 1.5 | 0.8 | 3.9 | {0.1; 7.8} |
| >>>I1b | 11 | 5.0 | 0.4 | 13.4 | {11.3; 15.5} | 4.9 | 0.7 | 13.1 | {9.3; 17.0} |
| >>>I1c | 6 | 3.9 | 0.4 | 10.3 | {8.4; 12.2} | 5.3 | 2.7 | 14.3 | {0.3; 29.4} |
| >>>>I1c1 | 5 | 2.7 | 0.3 | 7.2 | {5.4; 9.0} | 3.6 | 1.7 | 9.5 | {0.8; 18.7} |
| >>>>>I1c1a | 4 | 1.5 | 0.3 | 4.0 | {2.5; 5.4} | 1.8 | 0.4 | 4.6 | {2.3; 6.8} |
| >>I5 | 29 | 6.8 | 0.3 | 18.4 | {16.4; 20.3} | 8.2 | 2.4 | 22.6 | {9.1; 36.9} |
| >>>I5a | 24 | 5.9 | 0.4 | 16.0 | {14.0; 17.9} | 6.8 | 1.5 | 18.4 | {10.3; 26.9} |
| >>>>I5a1 | 9 | 3.5 | 0.4 | 9.2 | {7.1; 11.3} | 3.6 | 0.7 | 9.4 | {5.5; 13.4} |
| >>>>I5a2 | 10 | 4.6 | 0.4 | 12.3 | {10.2; 14.4} | 5.4 | 3.3 | 14.5 | {−2.9; 33.6} |
| >>>>>I5a2a | 8 | 0.6 | 0.1 | 1.6 | {1.0; 2.1} | 0.6 | 0.1 | 1.6 | {1.2; 2.0} |
| >>>>I5a3 | 2 | 1.8 | 0.4 | 4.8 | {2.8; 6.8} | 2.0 | 1.0 | 5.2 | {0.1; 10.5} |
| >>>>I5a4 | 3 | 2.1 | 0.4 | 5.6 | {3.5; 7.8} | 2.7 | 1.1 | 7.0 | {1.3; 12.9} |
| >>>I5b | 2 | 3.3 | 0.5 | 8.8 | {6.3; 11.2} | 3.5 | 1.8 | 9.2 | {0.2; 18.8} |
| >>I6 | 4 | 6.8 | 0.4 | 18.4 | {16.2; 20.6} | 8.5 | 3.3 | 23.3 | {5.6; 42.6} |
| >>>I6a | 2 | 2.0 | 0.3 | 5.3 | {3.5; 7.0} | 2.5 | 1.3 | 6.5 | {0.1; 13.2} |
| >>>I6b | 2 | 4.9 | 0.5 | 13.1 | {10.4; 15.8} | 5.5 | 2.8 | 14.8 | {0.3; 30.4} |
| >>I7 | 2 | 3.5 | 0.5 | 9.1 | {6.3; 11.9} | 3.0 | 1.5 | 7.9 | {0.2; 16.0} |
| >>node 152 | 65 | 5.8 | 1.2 | 15.5 | {8.8; 22.5} | 5.5 | 1.7 | 14.9 | {5.8; 24.4} |
| >>>I2'3 | 64 | 4.7 | 0.4 | 12.6 | {10.4; 14.7} | 4.6 | 0.7 | 12.2 | {8.2; 16.2} |
| >>>>I2 | 46 | 2.6 | 0.2 | 6.8 | {6.0; 7.6} | 3.4 | 0.2 | 9.1 | {8.0; 10.2} |
| >>>>>I2a | 10 | 1.8 | 0.2 | 4.7 | {3.8; 5.7} | 2.3 | 0.4 | 6.0 | {3.9; 8.2} |
| >>>>>>I2a1 | 4 | 1.3 | 0.2 | 3.2 | {2.1; 4.4} | 2.0 | 0.8 | 5.2 | {1.4; 9.2} |
| >>>>>I2b | 4 | 0.7 | 0.2 | 1.7 | {0.5; 2.9} | 0.8 | 0.2 | 1.9 | {1.0; 2.9} |
| >>>>>I2c | 6 | 1.8 | 0.2 | 4.7 | {3.6; 5.8} | 1.5 | 0.3 | 3.9 | {2.3; 5.5} |
| >>>>>I2d | 3 | 1.1 | 0.4 | 3.0 | {1.1; 4.8} | 1.0 | 0.3 | 2.6 | {0.9; 4.3} |
| >>>>>I2e | 2 | 1.2 | 0.3 | 3.1 | {1.4; 4.8} | 1.0 | 0.5 | 2.6 | {0.1; 5.2} |
| >>>>I3 | 18 | 4.0 | 0.3 | 10.6 | {8.8; 12.4} | 3.9 | 0.6 | 10.3 | {7.3; 13.4} |
| >>>>>I3a | 10 | 2.8 | 0.2 | 7.4 | {6.1; 8.7} | 2.6 | 0.3 | 6.8 | {5.2; 8.4} |
| >>>>>>I3a1 | 2 | 2.3 | 0.3 | 6.1 | {4.7; 7.5} | 3.5 | 1.8 | 9.2 | {0.2; 18.8} |
| >>>>>I3b | 2 | 1.0 | 0.3 | 2.6 | {1.1; 4.2} | 1.0 | 0.5 | 2.6 | {0.1; 5.2} |
| >>>>>I3c | 3 | 3.5 | 0.3 | 9.4 | {7.6; 11.2} | 6.0 | 2.9 | 16.2 | {0.9; 32.7} |
| >>I4 | 19 | 5.6 | 0.5 | 15.1 | {12.3; 18.0} | 3.7 | 1.1 | 9.9 | {4.3; 15.7} |
| >>>I4a | 17 | 2.4 | 0.2 | 6.4 | {5.4; 7.4} | 2.4 | 0.3 | 6.3 | {5.0; 7.7} |
| >>>>I4a1 | 6 | 2.2 | 0.2 | 5.7 | {4.7; 6.7} | 3.2 | 0.6 | 8.3 | {5.3; 11.5} |
| >>>I4b | 2 | 3.2 | 0.5 | 8.4 | {5.8; 10.9} | 3.5 | 1.8 | 9.2 | {0.2; 18.8} |
Number of mtDNA sequences.
Maximum likelihood molecular divergence.
Using the corrected molecular clock proposed by [21].
Molecular divergence and age estimates (maximum likelihood and ρ) for haplogroup W and its subclades.
| All Nucleotide Substitutions | |||||||||
| Haplogroup |
| ML | SE | Age (ky) | 95% CI (ky) | ρ | σ | Age (ky) | 95% CI (ky) |
| W | 223 | 6.2 | 0.5 | 16.8 | {14.2; 19.5} | 6.8 | 0.8 | 18.4 | {14.1; 22.8} |
| >W1 | 93 | 3.9 | 0.3 | 10.4 | {9.0; 11.9} | 4.0 | 0.4 | 10.7 | {8.5; 12.9} |
| >>W1a | 25 | 0.6 | 0.1 | 1.6 | {1.2; 2.0} | 0.7 | 0.0 | 1.8 | {1.5; 2.0} |
| >>W1b | 15 | 1.1 | 0.2 | 2.8 | {1.9; 3.6} | 1.6 | 0.6 | 4.2 | {1.0; 7.4} |
| >>>W1b1 | 11 | 0.8 | 0.1 | 2.0 | {1.3; 2.6} | 1.0 | 0.1 | 2.6 | {2.0; 3.2} |
| >>W1e | 5 | 1.6 | 0.4 | 4.2 | {2.4; 6.1} | 2.4 | 1.4 | 6.3 | {−1.1; 14.0} |
| >>W1f | 3 | 2.7 | 0.5 | 7.2 | {4.8; 9.6} | 2.7 | 1.6 | 7.0 | {−1.0; 15.4} |
| >>119 node | 27 | 3.3 | 0.2 | 8.6 | {7.4; 9.8} | 4.1 | 0.6 | 11.0 | {7.6; 14.5} |
| >>>W1c | 16 | 3.0 | 0.2 | 7.9 | {6.8; 9.0} | 4.4 | 0.7 | 11.8 | {7.9; 15.8} |
| >>>>W1c1 | 6 | 1.8 | 0.2 | 4.6 | {3.6; 5.7} | 2.3 | 0.6 | 6.1 | {3.2; 9.0} |
| >>W1g | 3 | 0.8 | 1.3 | 2.0 | {−4.6; 8.9} | 0.7 | 0.4 | 1.7 | {−0.5; 4.0} |
| >194 node | 119 | 6.2 | 0.3 | 16.8 | {15.0; 18.7} | 7.4 | 0.9 | 20.3 | {15.5; 25.2} |
| >>W3 | 45 | 5.5 | 0.3 | 14.8 | {13.0; 16.6} | 5.8 | 1.2 | 15.5 | {8.9; 22.5} |
| >>>W3a | 33 | 4.4 | 0.3 | 11.8 | {10.1; 13.6} | 4.8 | 1.0 | 12.7 | {7.4; 18.2} |
| >>>>W3a1 | 29 | 3.7 | 0.2 | 9.8 | {8.7; 10.9} | 3.8 | 0.2 | 10.0 | {8.8; 11.3} |
| >>>>>W3a1a | 5 | 3.5 | 0.2 | 9.1 | {8.0; 10.3} | 5.8 | 1.4 | 15.6 | {8.0; 23.5} |
| >>>>>>W3a1a1 | 2 | 3.2 | 0.2 | 8.3 | {7.1; 9.5} | 7.0 | 3.5 | 19.0 | {0.4; 39.5} |
| >>>>>>W3a1a2 | 2 | 2.4 | 0.3 | 6.3 | {4.6; 8.0} | 3.5 | 1.8 | 9.2 | {0.2; 18.8} |
| >>>>>W3a1b | 5 | 3.3 | 0.2 | 8.7 | {7.5; 9.9} | 4.0 | 1.2 | 10.6 | {4.3; 17.2} |
| >>>>>W3a1c | 3 | 1.4 | 0.3 | 3.7 | {2.2; 5.2} | 1.3 | 0.4 | 3.5 | {1.2; 5.8} |
| >>>>W3a2 | 3 | 2.1 | 1.7 | 5.6 | {−3.0; 14.8} | 2.7 | 1.6 | 7.0 | {−1.0; 15.4} |
| >>>W3b | 11 | 3.8 | 0.3 | 10.0 | {8.2; 11.8} | 3.7 | 0.7 | 9.9 | {6.4; 13.4} |
| >>>>W3b1 | 3 | 0.4 | 0.2 | 1.0 | {0.2; 1.8} | 0.3 | 0.1 | 0.9 | {0.3; 1.4} |
| >>W4 | 14 | 4.6 | 0.4 | 12.2 | {9.9; 14.7} | 4.6 | 0.8 | 12.2 | {7.7; 16.8} |
| >>>W4a | 8 | 3.7 | 0.4 | 9.8 | {7.6; 12.0} | 3.8 | 0.9 | 9.9 | {5.1; 14.8} |
| >>W5 | 29 | 4.6 | 0.3 | 12.2 | {10.5; 14.0} | 5.8 | 2.3 | 15.6 | {3.6; 28.4} |
| >>>W5a | 24 | 3.2 | 0.3 | 8.4 | {6.9; 9.8} | 4.0 | 1.2 | 10.6 | {4.2; 17.2} |
| >>>>W5a1 | 18 | 2.7 | 0.2 | 7.1 | {5.9; 8.4} | 3.4 | 1.0 | 9.1 | {3.7; 14.7} |
| >>>>>W5a1a | 16 | 2.1 | 0.2 | 5.5 | {4.5; 6.6} | 2.4 | 0.3 | 6.4 | {4.9; 7.8} |
| >>>>>>W5a1a1 | 5 | 1.6 | 0.2 | 4.1 | {3.0; 5.2} | 2.2 | 0.6 | 5.7 | {2.6; 8.9} |
| >>>>>>>W5a1a1a | 2 | 0.4 | 0.1 | 1.0 | {0.3; 1.7} | 0.5 | 0.3 | 1.3 | {0.0; 2.6} |
| >>>>W5a2 | 5 | 2.0 | 0.3 | 5.1 | {3.7; 6.5} | 2.0 | 0.5 | 5.2 | {2.7; 7.7} |
| >>>W5b | 4 | 3.5 | 0.4 | 9.3 | {7.3; 11.4} | 4.3 | 1.9 | 11.3 | {1.2; 22.0} |
| >>W6 | 29 | 4.7 | 0.3 | 12.6 | {10.8; 14.5} | 5.4 | 1.1 | 14.4 | {8.3; 20.7} |
| >>>16192 node | 26 | 4.2 | 0.3 | 11.2 | {9.8; 12.6} | 4.3 | 0.4 | 11.6 | {9.5; 13.6} |
| >>>>W6a | 5 | 1.9 | 0.4 | 5.0 | {3.1; 6.8} | 1.6 | 0.6 | 4.2 | {1.3; 7.1} |
| >>>>W6b | 6 | 3.1 | 0.3 | 8.1 | {6.4; 9.7} | 4.3 | 1.1 | 11.5 | {5.6; 17.6} |
| >>>W6c | 2 | 3.6 | 0.4 | 9.6 | {7.7; 11.4} | 5.0 | 2.5 | 13.4 | {0.3; 27.5} |
| >W7 | 4 | 3.7 | 0.7 | 9.9 | {6.0; 14.0} | 2.8 | 1.2 | 7.2 | {1.1; 13.6} |
Number of mtDNA sequences.
Maximum likelihood molecular divergence.
Using the corrected molecular clock proposed by [21].
Figure 2Spatial frequency distribution of haplogroups I and W and the sub-clades I1a and W6.
Note that different frequency scales (%) were used in the maps. The dots in the lower map indicate the geographic location of the population samples included in the survey (Table S3 in File S1).
Figure 3Bayesian Skyline Plots (BSPs) of haplogroups N1a1b and W.
Hypothetical effective population sizes through time are based on the mitogenomes listed in Table S1 and Table S2 (File S1).
List of ancient specimen belonging to mtDNA haplogroups I and W and control-region haplotypes.
| Country | Site | Date (BC) | Cultural Period | Haplogroup | Control-Region Haplotype | Sequence Range | Reference |
| Spain | Paternanbidea, Navarra | 6090–5960 | Neolithic | I | 16129 16233 | HVS-I, HVS-II |
|
| Spain | Cami de Can GrauGranollers, Barcelona | 3500–3000 | Neolithic | I1c1 | (16129) 16223 16264 16270 16311 16319 16362 | HVS-I |
|
| Germany | Kromsdorf | 2600–2500 | Late Neolithic - Bell Beaker culture | I1a1 | 16129 16172 16223 16311 16391 73199 203 204 250 263 | 15995–16429, 34–287 |
|
| Germany | Eulau | 2563–2465 | Late Neolithic - Corded Ware culture | I | 16129 16223 16391 | 15997–16409 |
|
| Germany | Esperstedt | 2050–1800 | Late Neolithic - Unetice culture | I | 16129 16223 16278 16311 16391 | 15997–16409 |
|
| Germany | Esperstedt | 2050–1800 | Late Neolithic - Unetice culture | I | 16086 16129 16223 16391 | 15997–16409 |
|
| Kazakhstan | Vodokhranilische | 1400–1000 | Bronze Age - Kurgan culture | I | 16129 16223 16294 | HVS-I |
|
| Germany | Derenburg Meerenstieg II | 5000 | Early Neolithic - LBK culture | W | 16093 16223 16292 | 15997–16409 |
|
| Germany | Derenburg Meerenstieg II | 5000 | Early Neolithic - LBK culture | W | 16093 16223 16292 | 15997–16409 |
|
| Spain | Cami de Can GrauGranollers, Barcelona | 3500–3000 | Neolithic | W | 16223 16292 16295 16304 | HVS-I |
|
| Germany | Kromsdorf | 2600–2500 | Late Neolithic - Bell Beaker culture | W5a? | 16223 16292 16362 73 189 194 195204 207 263 | 15995–16429, 34–287 |
|
| Germany | Esperstedt | 2700–2000 | Late Neolithic - Corded Ware culture | W6 | 16192 16223 16292 16325 | 15997–16409 |
|
| Germany | Esperstedt | 2050–1800 | Late Neolithic - Unetice culture | W | 16147G 16223 16292 | 15997–16409 |
|
| Kazakhstan | Zevakinskiy | 800–700 | Bronze/Iron Age | W | 16223 16292 | HVS-I |
|
Control-region haplotypes are from the root of haplogroup N.