| Literature DB >> 23249956 |
Tõnu Esko1, Massimo Mezzavilla, Mari Nelis, Christelle Borel, Tadeusz Debniak, Eveliina Jakkula, Antonio Julia, Sena Karachanak, Andrey Khrunin, Peter Kisfali, Veronika Krulisova, Zita Aušrelé Kučinskiené, Karola Rehnström, Michela Traglia, Liene Nikitina-Zake, Fritz Zimprich, Stylianos E Antonarakis, Xavier Estivill, Damjan Glavač, Ivo Gut, Janis Klovins, Michael Krawczak, Vaidutis Kučinskas, Mark Lathrop, Milan Macek, Sara Marsal, Thomas Meitinger, Béla Melegh, Svetlana Limborska, Jan Lubinski, Aarno Paolotie, Stefan Schreiber, Draga Toncheva, Daniela Toniolo, H-Erich Wichmann, Alexander Zimprich, Mait Metspalu, Paolo Gasparini, Andres Metspalu, Pio D'Adamo.
Abstract
Population genetic studies on European populations have highlighted Italy as one of genetically most diverse regions. This is possibly due to the country's complex demographic history and large variability in terrain throughout the territory. This is the reason why Italy is enriched for population isolates, Sardinia being the best-known example. As the population isolates have a great potential in disease-causing genetic variants identification, we aimed to genetically characterize a region from northeastern Italy, which is known for isolated communities. Total of 1310 samples, collected from six geographically isolated villages, were genotyped at >145000 single-nucleotide polymorphism positions. Newly genotyped data were analyzed jointly with the available genome-wide data sets of individuals of European descent, including several population isolates. Despite the linguistic differences and geographical isolation the village populations still show the greatest genetic similarity to other Italian samples. The genetic isolation and small effective population size of the village populations is manifested by higher levels of genomic homozygosity and elevated linkage disequilibrium. These estimates become even more striking when the detected substructure is taken into account. The observed level of genetic isolation in Friuli-Venezia Giulia region is more extreme according to several measures of isolation compared with Sardinians, French Basques and northern Finns, thus proving the status of an isolate.Entities:
Mesh:
Year: 2012 PMID: 23249956 PMCID: PMC3658181 DOI: 10.1038/ejhg.2012.229
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Figure 1Map showing approximate location of analyzed Italian samples and revealed population structure in analyzed populations. (a) A geographical map of Italy and shown are the approximate sampling regions. (b) A detailed geographical map of FVG and shown are the approximate location of the six villages. (c) Ancestry proportions of the studied 1008 individuals from 39 European and Near-Eastern populations (including the six FVG village populations) as revealed by the ADMIXTURE program[29] with K=2 to K=10. A stacked column of the K proportions represents each individual, with fractions indicated on the y axis. From all non-FVG populations a subset of 24 randomly drawn individuals (if applicable) was used.
Figure 2Model-based mapping convergence with SPA. Label position indicates the (a) specific PC1 and PC2 coordinate values for each individual and (b) the mean PC1 and PC2 coordinate values for each population. For (a, b), the colors have a following meaning: (1) dark blue color: a homogeneous fraction of the FVG population; a blue color: more general fraction of the FVG population; a red color: other Italian samples; a violet color: Basques; an orange color: Slovenians; and green color: all other populations. For (a, b), the following population abbreviation labels are used: AT, Austrians; BA, French Basques; BG, Bulgarians; BO, Borbera; CA, Carlantino; CL, Clauzetto; CH, Swiss; CZ, Czechs; GR, Germans; ER, Erto; ES, Spaniards; FR, French; HU, Hungarians; IL, Illegio; IT, Italians; JW_A, Ashkenazy Jews; JW_S, Sephardic Jews; OR, Orcadians; RE, Resia; RO, Romanians; SA, Sardinians; SA_, Sauris; SMC, San Martino del Carso; SI, Slovenians; TU, Tuscans. The extra abbreviations: N, northern; S, southern; I, a more homogeneous sub-population; G, a more general sub-population.
Figure 3Predicted cluster membership for each individual from DAPCs. Model optimum was found on K=7 and individuals are assigned to best-fitting cluster. The size of the rectangle scales with the number of assigned samples. In population names: N, northern; S, southern; I, a more homogeneous sub-population; G, a more general sub-population.
Figure 4Genome-wide LD length based on 145 000 SNPs. Each line represents the LD decay averaged across populations and sub-populations and LD (r2) between SNPs shown in 5 kb bins. Sub-populations drawn from same population have the same color coding. A dashed line represents the more general sub-populations and the solid line with rectangles represents the more homogeneous sub-populations respectively. A solid thin line represents the European reference populations (Estonians, Slovenians and Swiss). The LD extent in reference population isolates (Sardinians, French Basques and northern Finns) is also shown with a solid line with rectangles. In population names: I, a more homogeneous sub-population; G, a more general sub-population.
Figure 5The genomic runs of homozygosity (gROH) based on 145 000 SNPs. (a) Distribution of population mean homozygous segments and mean count of gROH>1.5 Mb per sample. (b) Zoom in view for the region indicated with a dashed ellipse on (a). FVG sub-populations are indicated in bold letters. In population names: I, a more homogeneous sub-population; G, a more general sub-population.