| Literature DB >> 28512355 |
Stefania Sarno1, Alessio Boattini2, Luca Pagani3,4, Marco Sazzini2, Sara De Fanti2, Andrea Quagliariello2, Guido Alberto Gnecchi Ruscone2, Etienne Guichard2, Graziella Ciani2, Eugenio Bortolini5,6, Chiara Barbieri7, Elisabetta Cilli8, Rosalba Petrilli2, Ilia Mikerezi9, Luca Sineo10, Miguel Vilar11, Spencer Wells11, Donata Luiselli12, Davide Pettener2.
Abstract
The Mediterranean shores stretching between Sicily, Southern Italy and the Southern Balkans witnessed a long series of migration processes and cultural exchanges. Accordingly, present-day population diversity is composed by multiple genetic layers, which make the deciphering of different ancestral and historical contributes particularly challenging. We address this issue by genotyping 511 samples from 23 populations of Sicily, Southern Italy, Greece and Albania with the Illumina GenoChip Array, also including new samples from Albanian- and Greek-speaking ethno-linguistic minorities of Southern Italy. Our results reveal a shared Mediterranean genetic continuity, extending from Sicily to Cyprus, where Southern Italian populations appear genetically closer to Greek-speaking islands than to continental Greece. Besides a predominant Neolithic background, we identify traces of Post-Neolithic Levantine- and Caucasus-related ancestries, compatible with maritime Bronze-Age migrations. We argue that these results may have important implications in the cultural history of Europe, such as in the diffusion of some Indo-European languages. Instead, recent historical expansions from North-Eastern Europe account for the observed differentiation of present-day continental Southern Balkan groups. Patterns of IBD-sharing directly reconnect Albanian-speaking Arbereshe with a recent Balkan-source origin, while Greek-speaking communities of Southern Italy cluster with their Italian-speaking neighbours suggesting a long-term history of presence in Southern Italy.Entities:
Year: 2017 PMID: 28512355 PMCID: PMC5434004 DOI: 10.1038/s41598-017-01802-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Sampling map showing the approximate geographic location of the 23 newly analysed populations. Sampling points are labelled according to the population name or province as in Supplementary Table S1. The box at the bottom of the figure details the sampling location of the two Greek-speaking groups of Calabria: GRI_BOV includes individuals collected in the municipality of Bova, whereas GRI_CAL includes individuals from the other Greek-speaking villages laying in the Aspromonte mountainous area of Bovesia (see also Supplementary Information). The geographical map has been generated with the package RgoogleMaps [v. 1.4.1] (Loecher, M. & Ropkins, K. RgoogleMaps and loa: Unleashing R Graphics Power on Map Tiles. J. Stat. Softw. 63, 1–18 (2015). URL: http://www.jstatsoft.org/v63/i04/) of the software R [v. 3.2.4] (https://www.r-project.org/).
Figure 2Principal component analysis performed on the extended comparison dataset with ancient samples projected onto the first two PCs. (a) Scatterplot of the first and second PCs for 1,469 modern samples from 68 Euro-Mediterranean populations and 263 projected ancient samples. Ancient individuals are labelled and symbol-coded according to their associated culture, as reported in the legend at the bottom of the plot and detailed in Supplementary Table S3. Modern individuals and median population coordinates (enlarged black-bordered circles) are colour-coded based on their geographic or ethnic affiliation, as in the legend at the top-right and in Supplementary Table S2. (b) A magnification of the plot details the position of the newly analysed Southern Italian and Southern Balkan populations within the observed large-scale genetic diversity.
Figure 3Clustering of the 1,366 modern Euro-Mediterranean individuals into 14 genetic-based clusters as inferred by ChromoPainter/fineSTRUCTURE analysis. At the considered hierarchical level of K = 14, each cluster has at least 10 members. For each of the analysed populations, the relative proportions of inferred genetic clusters are summarized by corresponding pie charts. Cluster names are detailed in the legend at the bottom-left of the plot. The geographical map has been plotted using the R software [v.3.2.4] (R: A Language and Environment for Statistical Computing, R Core Team, R Foundation for Statistical Computing, Vienna, Austria (2016) https://www.R-project.org).
Figure 4Differences in IBD-sharing between Southern Italian and Southern Balkan population groups. Vectors of IBD-sharing with the 35 comparison populations reported on the x-axis were pairwise subtracted between pairs of Southern Balkan and/or Southern Italian population groups. The plots at the bottom of the figure provide a schematic representation of the pairwise-subtracting procedure (left: IBD-sharing of a Southern Balkan group with all comparison populations; centre: IBD-sharing of a Southern Italian group with all comparison populations; right: difference in IBD-sharing between the two considered groups). Comparison populations for which we observed differences in IBD-sharing between the two tested groups exceeding the lower (0.10 percentile) or the upper (0.90 percentile) bounds of the distribution are marked by coloured boxes in the table (green: Grp1 > Grp2, red Grp2 > Grp1). Significant differences in IBD-sharing, after the grubbs.test for significance (R software package outliers [v. 0.14] Komsta, L. Outliers: Tests for outliers. R package version 0.14. URL: https://CRAN.R-project.org/package=outliers (2011)), are indicated by corresponding p-values (* P-value < 0.05, **P-value < 0.01, ***P-value < 0.001). Southern Italian and Southern Balkan compared population groups were considered as follow: SSI-Extreme (Apulia-LE, Basilicata-MT and the westernmost province of Sicily-TP); SSI-Central (including the remaining populations of Central-Eastern Sicily and Calabria); Mediterranean Greek-speaking islands (Crete, Cyprus and Anatolian/Dodecanese Greece); Continental Southern Balkan populations (Albania, North-Central Greece and Peloponnesus). The geographical map has been plotted using the R software [v.3.2.4] (R: A Language and Environment for Statistical Computing, R Core Team, R Foundation for Statistical Computing, Vienna, Austria (2016) https://www.R-project.org).
Figure 5Inferred events and times of admixture. All possible trios of the 14 genetic-based clusters identified by ChromoPainter/fineSTRUCTURE analysis were tested for admixture with the f3-statistics and dated with Alder. Admixture events resulting statistically significant for both f3 and Alder are detailed in Supplementary Table S7 and visually summarized in this time-line plot following the same listing order. Each vertical coloured line indicates an admixture event for the corresponding genetic cluster; points and bars-width reflect the estimated dates of admixture and relative SE. Different symbol-codes have been used based on the regional ancestry of the pairs of source populations involved in each admixture event (as in the legend at the top-right). Admixture events involving Southern Italian and Southern Balkan clusters are included in the grey box.