| Literature DB >> 23734255 |
Alessio Boattini1, Begoña Martinez-Cruz, Stefania Sarno, Christine Harmant, Antonella Useli, Paula Sanz, Daniele Yang-Yao, Jeremy Manry, Graziella Ciani, Donata Luiselli, Lluis Quintana-Murci, David Comas, Davide Pettener.
Abstract
Located in the center of the Mediterranean landscape and with an extensive coastal line, the territory of what is today Italy has played an important role in the history of human settlements and movements of Southern Europe and the Mediterranean Basin. Populated since Paleolithic times, the complexity of human movements during the Neolithic, the Metal Ages and the most recent history of the two last millennia (involving the overlapping of different cultural and demic strata) has shaped the pattern of the modern Italian genetic structure. With the aim of disentangling this pattern and understanding which processes more importantly shaped the distribution of diversity, we have analyzed the uniparentally-inherited markers in ∼900 individuals from an extensive sampling across the Italian peninsula, Sardinia and Sicily. Spatial PCAs and DAPCs revealed a sex-biased pattern indicating different demographic histories for males and females. Besides the genetic outlier position of Sardinians, a North West-South East Y-chromosome structure is found in continental Italy. Such structure is in agreement with recent archeological syntheses indicating two independent and parallel processes of Neolithisation. In addition, date estimates pinpoint the importance of the cultural and demographic events during the late Neolithic and Metal Ages. On the other hand, mitochondrial diversity is distributed more homogeneously in agreement with older population events that might be related to the presence of an Italian Refugium during the last glacial period in Europe.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23734255 PMCID: PMC3666984 DOI: 10.1371/journal.pone.0065441
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Spatial Principal Component Analysis (sPCA) based on frequencies of Y-chromosome haplogroups.
The first two global components, sPC1 (a) and sPC2 (b), are depicted. Positive values are represented by black square; negative values are represented by white squares; the size of the square is proportional to the absolute value of sPC scores.
Frequencies of Y-Chromosome DAPC cluster for each Italian sPCA-identified group.
| HG | DAPC CLUSTER | N. HAPLOTYPES | N. INDIVIDUALS | MAX% (GROUP) | ||||||
| NWI | SEI | SAR | TOT | NWI | SEI | SAR | TOT | |||
|
| 1 | 8 | 10 | 1 | 19 | 8 | 10 | 1 | 19 | 53% (SEI) |
| 2 | 6 | 6 | 0 | 12 | 6 | 6 | 0 | 12 | 50% (NWI, SEI) | |
|
| 3 | 11 | 1 | 15 | 3 |
| 1 | 15 |
| |
| 4 | 5 | 6 | 0 | 11 | 5 | 6 | 0 | 11 | 55% (SEI) | |
| 5 | 6 | 6 | 0 | 12 | 6 | 6 | 0 | 12 | 50% (NWI, SEI) | |
|
| 1 | 18 | 6 | 2 | 26 |
| 6 | 2 | 28 |
|
| 2 | 1 | 3 | 0 | 4 | 1 | 3 | 0 | 4 | 75% (SEI) | |
| 3 | 8 | 10 | 8 | 26 | 8 | 10 | 8 | 26 | 38% (SEI) | |
|
| 2 | 20 | 0 | 22 | 2 |
| 0 | 22 |
| |
|
| 1 | 15 | 1 | 17 | 1 |
| 1 | 18 |
| |
|
|
| 0 | 1 | 18 | 19 | 0 | 1 |
| 20 |
|
|
| 2 | 1 | 12 | 15 | 2 | 1 |
| 16 |
| |
|
| 1 | 7 | 9 | 3 | 19 | 7 | 9 | 3 | 19 | 47% (SEI) |
| 2 | 8 | 18 | 2 | 28 | 8 | 19 | 2 | 29 | 66% (SEI) | |
| 3 | 7 | 11 | 0 | 18 | 7 | 12 | 0 | 19 | 63% (SEI) | |
|
|
| 11 | 4 | 1 | 16 |
| 4 | 1 | 17 |
|
| 2 | 13 | 8 | 0 | 21 | 13 | 9 | 0 | 22 | 59% (NWI) | |
| 3 | 6 | 5 | 0 | 11 | 6 | 5 | 0 | 11 | 55% (NWI) | |
|
| 1 | 16 | 7 | 2 | 25 | 16 | 7 | 2 | 25 | 64% (NWI) |
|
| 21 | 1 | 0 | 22 |
| 1 | 0 | 22 |
| |
| 3 | 23 | 8 | 2 | 33 | 24 | 10 | 2 | 36 | 67% (NWI) | |
|
| 16 | 4 | 2 | 22 |
| 5 | 2 | 24 |
| |
|
|
| 18 | 1 | 1 | 20 |
| 1 | 1 | 20 |
|
|
| 18 | 6 | 1 | 25 |
| 6 | 1 | 25 |
| |
|
| 10 | 4 | 0 | 14 |
| 4 | 0 | 14 |
| |
*Number of individuals <10
The absolute number of haplotypes and individuals are shown for each DAPC-cluster, and the maximum frequency for each cluster is expressed in percentage (max%). NWI: North-Western Italy; SEI: Southern and Eastern Italy; SAR: Sardinia.
Figure 2Discriminant Analysis of Principal Components (DAPC) for G2a-P15 haplotypes.
Samples are grouped according to their affiliation at the sPCA-identified groups (NWI; SEI; SAR; symbols in the top right table). The table in the bottom left shows the number of haplotypes in each of the five G2a clusters and their geographical distribution in the three Italian areas. DAPC eigenvalues are depicted in the enclosed barplot.
Age estimates (in YBP) of STR and HVS variation for the most common haplogroups in the Italian data set.
| Y Chromosome Haplogroups | SD | SE | Age estimate | SE |
|
| 146.46 | 51.78 | 3662 | 1295 |
| Cluster3 (SEI 70.3%) | 139.52 | 49.33 | 3488 | 1233 |
|
| 600.79 | 212.41 | 15020 | 5310 |
| Cluster1 (NWI 71.4%) | 144.31 | 51.02 | 3608 | 1276 |
| Cluster3 | 505.72 | 178.80 | 12643 | 4470 |
| Cluster4 (SEI 90.9%) | 111.40 | 39.39 | 2785 | 985 |
| Cluster5 (SEI 88.9%) | 240.62 | 85.07 | 6016 | 2127 |
|
| 206.11 | 72.87 | 5153 | 1822 |
| Cluster 1 (SAR 95.0%) | 48.26 | 17.06 | 1207 | 427 |
| Cluster 2 (SAR 81.3%) | 227.81 | 80.54 | 5695 | 2014 |
|
| 137.29 | 48.54 | 3432 | 1214 |
| Cluster2 (NWI 95.5%) | 199.16 | 70.41 | 4979 | 1760 |
| Cluster4 (NWI 70.8%) | 184.29 | 65.16 | 4607 | 1629 |
|
| 129.67 | 45.85 | 3242 | 1146 |
| Cluster1 (NWI 90.0%) | 250.32 | 88.50 | 6258 | 2213 |
| Cluster2 (NWI 72.0%) | 185.52 | 65.59 | 4638 | 1640 |
| Cluster3 (NWI 71.4%) | 148.55 | 52.52 | 3714 | 1313 |
|
| 302.55 | 106.97 | 7564 | 2674 |
| Cluster1 (NWI 70.6%) | 130.05 | 45.98 | 3251 | 1149 |
Standard deviation (SD) estimator (Sengupta et al. 2006) and ñ statistic calculator (Soares et al. 2009) were used for Y-chromosome and mtDNA haplogroups respectively. Ages were estimated for the entire haplogroups as well as for each DAPC cluster with at least 10 individuals and frequencies >70% in NWI, SEI, or SAR (excepted for G-P15, cluster 2, see Methods).
Figure 3Spatial Principal Component Analysis (sPCA) based on frequencies of mtDNA haplogroups.
The first two global components sPC1 (a) and sPC2 (b) are depicted. Positive values are represented by black squares; negative values are represented by white squares; the size of the square is proportional to the absolute value of sPC scores.