| Literature DB >> 26130483 |
Stefania Sarno1, Sergio Tofanelli2, Sara De Fanti1, Andrea Quagliariello1, Eugenio Bortolini1, Gianmarco Ferri3, Paolo Anagnostou4,5, Francesca Brisighelli6,7, Cristian Capelli6, Giuseppe Tagarelli8, Luca Sineo9, Donata Luiselli1, Alessio Boattini1, Davide Pettener1.
Abstract
The relationship between genetic and linguistic diversification in human populations has been often explored to interpret some specific issues in human history. The Albanian-speaking minorities of Sicily and Southern Italy (Arbereshe) constitute an important portion of the ethnolinguistic variability of Italy. Their linguistic isolation from neighboring Italian populations and their documented migration history, make such minorities particularly effective for investigating the interplay between cultural, geographic and historical factors. Nevertheless, the extent of Arbereshe genetic relationships with the Balkan homeland and the Italian recipient populations has been only partially investigated. In the present study we address the genetic history of Arbereshe people by combining highly resolved analyses of Y-chromosome lineages and extensive computer simulations. A large set of slow- and fast-evolving molecular markers was typed in different Arbereshe communities from Sicily and Southern Italy (Calabria), as well as in both the putative Balkan source and Italian sink populations. Our results revealed that the considered Arbereshe groups, despite speaking closely related languages and sharing common cultural features, actually experienced diverging genetic histories. The estimated proportions of genetic admixture confirm the tight relationship of Calabrian Arbereshe with modern Albanian populations, in accordance with linguistic hypotheses. On the other hand, population stratification and/or an increased permeability of linguistic and geographic barriers may be hypothesized for Sicilian groups, to account for their partial similarity with Greek populations and their higher levels of local admixture. These processes ultimately resulted in the differential acquisition or preservation of specific paternal lineages by the present-day Arbereshe communities.Entities:
Mesh:
Year: 2015 PMID: 26130483 PMCID: PMC4929864 DOI: 10.1038/ejhg.2015.138
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Figure 1Sampling map showing the location of Arbershe groups, Balkan source and Italian recipient populations. The enlarged box at the top-left details the communities decomposition for each of the two Arbereshe groups. The table at the bottom of the plot outlines the sampling structure and the number of samples analyzed for each population. Population codes as in Supplementary Table S1.
Figure 2Correspondence Analysis (CA) based on Y-chromosome haplogroup frequencies. Population codes as in Supplementary Table S1. Symbols and color codes as in the legends at the top-left. Abbreviations: ARB_CAL, Arbereshe of Calabria; ARB_SIC, Arbereshe of Sicily; SSI, Sicilians and Southern-Italians; ALB, Albanians; GRE, Greeks.
Fitting of observed and expected values of the DHS statistics for STR and SNP genetic systems under a stationary model
| SNP | Tosks | 0.0809 | 0.1957 | 0.3054 | 0.4993 | ||
| 0.0291 | 0.0828 | 0.1441 | 0.2527 | ||||
| Ghegs | 0.0687 | 0.1459 | 0.2165 | 0.4209 | |||
| 0.0255 | 0.0702 | 0.0626 | 0.1348 | ||||
| Tosks–Greek | 0.1187 | 0.2441 | 0.3664 | 0.5592 | |||
| 0.0524 | 0.1239 | 0.1988 | 0.3409 | ||||
| STR | Tosks | 0.6751 | 0.7967 | 0.9252 | 0.9594 | ||
| 0.5726 | 0.7483 | 0.8796 | 0.9140 | ||||
| Ghegs | 0.6211 | 0.7448 | 0.8187 | 0.9379 | |||
| 0.5468 | 0.6845 | 0.8093 | 0.8629 | ||||
| Tosks–Greek | 0.8415 | 0.8973 | 0.9491 | 0.9659 | 0.9815 | ||
| 0.7988 | 0.8624 | 0.9183 | 0.9406 | 0.9619 | |||
| SNP | Tosks | 0.0060 | 0.0236 | 0.0681 | 0.1079 | ||
| 0.0000 | 0.0023 | 0.0188 | 0.0288 | ||||
| Ghegs | 0.0105 | 0.0267 | 0.0652 | 0.0773 | 0.1841 | ||
| 0.0010 | 0.0055 | 0.0229 | 0.0313 | 0.0421 | |||
| Tosks–Greek | 0.0140 | 0.0337 | 0.0845 | 0.1401 | |||
| 0.0021 | 0.0116 | 0.0356 | 0.0564 | ||||
| STR | Tosks | 0.4861 | 0.6105 | 0.7649 | 0.8316 | 0.8987 | |
| 0.3988 | 0.5591 | 0.7088 | 0.7833 | 0.8504 | |||
| Ghegs | 0.4379 | 0.5810 | 0.7038 | 0.7760 | 0.8537 | ||
| 0.3128 | 0.5319 | 0.6475 | 0.7123 | 0.7831 | |||
| Tosks–Greek | 0.6732 | 0.7805 | 0.8810 | 0.9188 | 0.9490 | ||
| 0.5562 | 0.7428 | 0.8472 | 0.8889 | 0.9224 | |||
A percent contribution of local versus putative source population was accepted as the most likely model when the observed value fell within a two SD interval (in bold) of the simulated distribution at time t=20 generations.
Figure 3Admixture-like barplots for the Arbereshe groups (a) and the single-Arbereshe communities (b). The inferred cluster affiliation of each population (mclust algorithm) and the corresponding cluster's color code (1: black; 2: gray; 3: white) are represented by numbers (within colored squares) at the top of each bar. The probability (DAPC-based posterior membership probabilities) of each population belonging to the inferred clusters is represented by vertical bars in the plot. Abbreviations: ARB_CAL, Arbereshe of Calabria; ARB_SIC, Arbereshe of Sicily; CAL, Italians of Calabria; SIC, Italians of Sicily; GRE, Greeks; TOSK, Tosks. Codes for single-Arbereshe communities as in Supplementary Table S1.
Figure 4Plots of observed and expected DHS values for fitting scenarios under a stationary model. Evolutionary scenarios with the three putative source populations (Tosks, Ghegs and a 50% random blend of Tosks and Greeks) were compared for those models (bold in Table 1) showing at least one fit in the simulations results: (a) Calabria SNPs 5%, (b) Sicily SNPs 50% and (c) Calabria STRs 20–30%. The SD intervals of the simulated distributions are represented by vertical bars. Observed DHS values are represented by symbols filled in black or white depending on whether or not they match the simulated intervals. Squares and diamonds stand for Calabrian and Sicilian Arbereshe, respectively.