Literature DB >> 21637506

Analysis of 31 STR loci in the genetic isolate of Carloforte (Sardinia, Italy).

Renato Robledo1, Ignazio Piras, William Beggs, Carla Calò.   

Abstract

The genotypes of 31 autosomal short tandem repeat loci in the population of Carloforte were analyzed, these representing a linguistic and genetic isolate located on the island of Sardinia (Italy). The markers span the entire length of chromosomes 19, 20, 21 and 22. Allele frequencies and statistical parameters were presented for all loci. Observed heterozygosity ranged from 0.279 to 0.884, and polymorphism information content from 0.552 to 0.886. All but two loci showed Hardy-Weinberg equilibrium after Bonferroni correction. The 31 short tandem repeat loci examined in the present work provide additional data on the genetic structure of the Carloforte population.

Entities:  

Keywords:  Carloforte; Sardinia; autosomal STRs; genotyping; population data

Year:  2009        PMID: 21637506      PMCID: PMC3036058          DOI: 10.1590/S1415-47572009005000057

Source DB:  PubMed          Journal:  Genet Mol Biol        ISSN: 1415-4757            Impact factor:   1.771


Carloforte is the only village located on the small island of San Pietro, off the southwestern coast of Sardinia. The history of the Carloforte population dates back to the 16th century, when a number of families emigrated from Pegli, a small village in Liguria (Italy), now part of the city of Genova, to Tabarka, an island off the coast of Tunisia. During their time in Tabarka, a period of about two centuries, the Pegli community developed a successful business in tuna and coral fishing. At the beginning of the 18th century, due to a decline in business and worsening relationship with the Bey of Tunis and Algiers, the community migrated to Sardinia and settled on the deserted island of San Pietro, where they founded Carloforte in 1738 (Vallebona, 1974). For about 10 generations, these Genovese migrants had very little contact with the mainland populations of both Tunisia and Sardinia, maintaining a separate cultural as well as genetic identity. The cultural aspect is evident in the Pegli dialect, which is still spoken nowadays, making the Carloforte population a linguistic isolate (Vona ). Earlier studies based on matrimonial structure, classical genetic markers and the incidence of a specific disease provided evidence that Carloforte is also a genetic isolate (Vona et al.., 1996; Heath ). In this paper, we present further data on the genetic structure of the Carloforte population by reporting on the distribution of 31 micro-satellite DNA loci. The markers employed were CA di-nucleotide repeats clustered in four linkage groups, representing chromosomes 19, 20, 21 and 22. Historical documents list the names of the first settlers. Individuals selected for the present study were proven to be descendants of the village founders. Moreover, the participants were chosen for not having ancestors in common, at least up to the grandparental generation. Both criteria were ascertained by a complete genealogical search through the detailed family information available from City Hall records (Siniscalco ; Heath ). About 10 mL of peripheral blood were collected from 50 voluntary participants, in vacuum tubes containing EDTA as anticoagulant. Permanent cultures of EBV-transformed B-lymphocytes were prepared and stored anonymously in the Coriell Repository. DNA was extracted by standard laboratory procedures. The study was approved by local ethics committees and all voluntary participants read and signed an informed consent form, in accordance with Declaration of Helsinki guidelines. Individuals were genotyped using the ABI Prism Linkage Mapping Panel, version 2. The markers were amplified as single reactions and products pooled for analysis. About 50 ng of genomic DNA were amplified in a final PCR volume of 15 μL containing 1.5 μL 10x GeneAmp PCR buffer II, 1.5 μL 25 mM MgCl2, 1.5 μL 2.5 mM dNTP mix, 1.0 μL Primer Mix and 0.12 μL AmpliTaq Gold DNA Polymerase, with an ABI GeneAmp PCR System 9700 Thermal Cycler. PCR reactions were amplified by using the following PCR conditions: a 12 min hold at 95 °C was followed by 10 cycles of 15 s at 94 °C, 15 s at 55 °C, 30 s at 72 °C and 20 additional cycles of 15 s at 89 °C, 15 s at 55 °C and 30 s at 72 °C, with a final extension of 10 min at 72 °C. The amplified PCR products were pooled at a ratio of 5 μL of each FAM labeled product and 10 μL of each HEX and NED labeled one. D.I. water was added to a final volume of 100 ul. The pooled PCR product was mixed with loading buffer at a 2:3 ratio for a 5 μL final volume (loading buffer contained 4.5 μL formamide, 1.0 μL blue dextran and 1.5 μL of S400HD-ROX as an internal size standard). The mixture was heated at 95 °C for 3 min and cooled on ice for 3 min, before being separated on an ABI 377 Sequencer, using a 36 cm denaturing polyacrylamide gel for resolution of di-nucleotide repeat products. Separated allele fragments were analyzed using ABI GeneScan and Genotyper software and genotypes scored. Allele frequencies were estimated by gene counting. Both observed and expected heterozygosity were calculated with the GENETIX program (v. 4.0). Deviation from Hardy-Weinberg equilibrium (HWE) was tested by the Markov chain, using the GENEPOP program (v. 4.0). Statistical parameters of population genetics and forensic interest, namely polymorphism information content (PIC), power of discrimination (PD), and power of exclusion (PE), were calculated using Power Stats package v. 1.2 (Promega Corporation, USA). To verify whether the population had incurred a recent bottleneck, data were analysed using the BOTTLENECK program, v 1.2 (Cornuet and Luikart, 1996). This program allows for the evaluation of observed and expected heterozygosity, and determines statistical difference based on equilibrium mutation/genetic drift. This was obtained through simulating the coalescent process of n genes under the two-phase mutation model (TPM), which was proved to better fit micro-satellite analysis (Di Rienzo ). The TPM mainly consists of one-step mutations, with a small percentage (5%-10%) of multi-step changes. Allelic frequencies of the 31 STR loci tested in the population of Carloforte are reported in Table S1. Hardy-Weinberg equilibrium (HWE) and population parameters are shown in Table 1.
Table 1

Heterozygosity, number of alleles detected and HWE evaluation (indicated as probability values) of 31 investigated markers in the Carloforte population.

LocusObserved heteroz.Expected heteroz.N. of allelesHWE
D19S2160.6190.67180.722
D19S8840.6830.867100.020
D19S2260.8290.828140.909
D19S4140.7500.75980.551
D19S2200.7500.798100.197
D19S4200.7270.74190.154
D19S9020.8100.75980.865
D19S5710.7380.79880.009
D19S4180.7270.69770.124
D19S2100.6410.70750.149
D20S8890.7690.825140.742
D20S1150.6140.64640.249
D20S1860.8640.840100.325
D20S1120.7140.753110.338
D20S1950.7250.843110.116
D20S1070.8410.82990.840
D20S1190.6980.76460.337
D20S1780.8780.85180.862
D20S1960.7440.819100.622
D20S1000.6980.768100.220
D20S1730.3160.58160.000
D21S12560.2790.73670.000
D21S19140.6180.86580.001
D21S12520.8640.84280.568
D21S2660.6980.727100.720
D22S4200.5900.73370.069
D22S3150.7380.743100.943
D22S2800.8370.80080.308
D22S2830.8840.895110.828
D22S4230.8640.823110.705
D22S2740.8600.77370.125
The 31 loci showed a high degree of heterozygosity, varying from 27.9% for marker D21S1256 to 88.4% for marker D22S283. The exact test based on Markov chain analysis revealed that 6 out of 31 markers did not meet Hardy-Weinberg equilibrium, whereas after Bonferroni correction, only two of these, D20S173 and D21S1256, showed an excess of observed homozygotes. The high degree of isolation of the Carloforte population, together with the reported deviation from Hardy Weinberg expectation, prompted us to verify the possibility of a recent bottleneck in Carloforte through the BOTTLENECK program. Under the assumption of the two-phase model of mutation (TMP), the Wilcoxon test revealed non-significant heterozygosity (p = 0.459), thereby indicating that, following initial colonization events, there were no further contractions in the size of the Carloforte population, this being in agreement with historical records. Moreover, the mode shape test showed an L-shaped distribution, which is also consistent with the absence of a recent bottleneck (data not shown). The high level of heterozygosity in a genetic isolate like Carloforte, with a long history of endogamous and consanguineous marriages, may at first be surprising. We believe that the sampling strategy accounts for the findings: differences among individuals were maximized by selecting participants with no ancestor in common, at least up to the grandparental generation. This could be very important in association studies, where it is necessary to analyze large numbers of individuals in order to detect statistically significant deviations in allelic distribution between cases and matched controls. In a previous study, on applying the sampling strategy as described above, we genotyped 55 individuals from the Carloforte population at 5 unlinked micro-satellite loci in order to obtain an accurate description of their genomic profile. We showed that allele frequencies at all loci were practically the same down to a subset of 20 individuals (Siniscalco ). Therefore, once a breeding unit is identified through the recording of marriage patterns over the last five generations, a small sampling of contemporary descendants is still representative of its founder group, provided that the individuals selected had their ancestors all born in the same breeding unit, but unrelated for, at least, the last two generations. An obvious consequence of the proposed sampling strategy is a reduction in the cost of association studies, due to a lower number of individuals having to be genotyped. Finally, Table 2 shows parameters of forensic interest. The combined power of both discrimination and exclusion were absolutely discriminating, their values being, respectively, 7.7 x 10-33 and 0.999999999918. It has been reported that genetic isolation could reduce the capacity for paternity exclusion, without significantly affecting the power of discrimination (de Pancorbo ). This could be a critical issue in forensics, for example when setting up a database from a small village or population. In our case, the power of exclusion was still very high, again probably as a result of the sampling strategy. The combined matching probability for the 31 STRs was 0.9999999999923. Therefore, the 31 STR loci examined in the present work, specifically designed for population genetics studies, also turned out to be suitable for general forensic applications.
Table 2

Statistical relevant data for 31 investigated markers in the Carloforte population. MP: matching probability, PD: power of discrimination, PIC: polymorphism information content, PE: power of exclusion.

LocusMPPDPICPE
D19S2160.1620.8380.6160.314
D19S8840.0490.9510.8520.402
D19S2260.0530.9470.8100.654
D19S4140.1120.8880.7230.510
D19S2200.0810.9190.7740.510
D19S4200.1200.8800.7140.472
D19S9020.0920.9080.7350.617
D19S5710.0990.9010.7680.490
D19S4180.1460.8540.6690.472
D19S2100.1430.8570.6660.343
D20S8890.0550.9450.8090.543
D20S1150.2040.7960.5770.308
D20S1860.0690.9310.8200.722
D20S1120.1010.8990.7240.451
D20S1950.0600.9400.8260.468
D20S1070.0630.9370.8060.677
D20S1190.1000.9000.7260.425
D20S1780.0550.9450.8330.751
D20S1960.0620.9380.7950.500
D20S1000.0940.9060.7400.425
D20S1730.2590.7410.5520.070
D21S12560.1580.8420.6920.055
D21S19140.0570.9430.8490.313
D21S12520.0620.9380.8220.722
D21S2660.0950.9050.7050.425
D22S4200.1220.8780.6900.279
D22S3150.0950.9050.7140.490
D22S2800.0970.9030.7730.670
D22S2830.0350.9650.8860.762
D22S4230.0630.9370.8010.722
D22S2740.1200.8800.7390.716

Supplementary Material

The following online material is available for this article: This material is available as part of the online article from http://www.scielo.br/gmb.

Table S1

Allele frequencies of the 31 STR loci in the population of Carloforte.
  5 in total

1.  Genetics, geography, and culture: the population of S. Pietro Island (Sardinia, Italy).

Authors:  G Vona; C M Calò; G Lucia; G E Mameli; V Succa; E Esteban; P Moral
Journal:  Am J Phys Anthropol       Date:  1996-08       Impact factor: 2.868

2.  Description and power analysis of two tests for detecting recent population bottlenecks from allele frequency data.

Authors:  J M Cornuet; G Luikart
Journal:  Genetics       Date:  1996-12       Impact factor: 4.562

3.  A novel approach to search for identity by descent in small samples of patients and controls from the same mendelian breeding unit: a pilot study on myopia.

Authors:  S Heath; R Robledo; W Beggs; G Feola; C Parodo; A Rinaldi; L Contu; D Dana; D Stambolian; M Siniscalco
Journal:  Hum Hered       Date:  2001       Impact factor: 0.444

4.  Population genomics in Sardinia: a novel approach to hunt for genomic combinations underlying complex traits and diseases.

Authors:  M Siniscalco; R Robledo; P K Bender; C Carcassi; L Contu; J C Beck
Journal:  Cytogenet Cell Genet       Date:  1999

5.  Mutational processes of simple-sequence repeat loci in human populations.

Authors:  A Di Rienzo; A C Peterson; J C Garza; A M Valdes; M Slatkin; N B Freimer
Journal:  Proc Natl Acad Sci U S A       Date:  1994-04-12       Impact factor: 11.205

  5 in total
  1 in total

1.  Genetic sub-structuring of Croatian island populations in the Southeastern European context: a meta-analysis.

Authors:  Natalija Novokmet; Ana Galov; Vedrana Škaro; Petar Projić; Jelena Šarac; Dubravka Havaš Auguštin; Pavao Rudan; Dragan Primorac; Damir Marjanović
Journal:  Croat Med J       Date:  2022-06-22       Impact factor: 2.415

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.