| Literature DB >> 27441366 |
Ruth Barral-Arca1,2, Sara Pischedda1,2, Alberto Gómez-Carballa1,2,3, Ana Pastoriza1, Ana Mosquera-Miguel1, Manuel López-Soto4, Federico Martinón-Torres3,5, Vanesa Álvarez-Iglesias1, Antonio Salas1,2.
Abstract
The Iberian Peninsula has been the focus of attention of numerous studies dealing with mitochondrial DNA (mtDNA) variation, most of them targeting the control region segment. In the present study we sequenced the control region of 3,024 Spanish individuals from areas where available data were still limited. We also compiled mtDNA haplotypes from the literature involving 4,588 sequences and 28 population groups or small regions. We meta-analyzed all these data in order to shed further light on patterns of geographic variation, taking advantage of the large sample size and geographic coverage, in contrast with the atomized sampling strategy of previous work. The results indicate that the main mtDNA haplogroups show primarily clinal geographic patterns across the Iberian geography, roughly along a North-South axis. Haplogroup HV0 (where haplogroup U is nested) is more prevalent in the Franco Cantabrian region, in good agreement with previous findings that identified this area as a climate refuge during the Last Glacial Maximum (LGM), prior to a subsequent demographic re-expansion towards Central Europe and the Mediterranean. Typical sub-Saharan and North African lineages are slightly more prevalent in South Iberia, although at low frequencies; this pattern has been shaped mainly by the transatlantic slave trade and the Arab invasion of the Iberian Peninsula. The results also indicate that summary statistics that aim to measure molecular variation, or AMOVA, have limited sensitivity to detect population substructure, in contrast to patterns revealed by phylogeographic analysis. Overall, the results suggest that mtDNA variation in Iberia is substantially stratified. These patterns might be relevant in biomedical studies given that stratification is a common cause of false positives in case-control mtDNA association studies, and should be also considered when weighting the DNA evidence in forensic casework, which is strongly dependent on haplotype frequencies.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27441366 PMCID: PMC4956223 DOI: 10.1371/journal.pone.0159735
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Map showing the location of the main Iberian regions analyzed in the present study.
The pie charts show the frequency values for the main haplogroup categories in these main Iberian regions. Note that the category “Other” in the pie charts denotes an artificial polyphyletic cluster (paragroup). Other haplogroup labels represent the following haplogroup categories: L = L(× M,N); N = N(×R,I,X,W).
Summary statistics of HVS-I sequences in Spain and Portugal (excluding islands) carried out on main geographic regions.
All the computations were carried out on the common sequence segment of the HVS-I that ranges from 16024 to 16362. Population codes are as follows: a) Cantabrian coast = Galicia + Asturias + Cantabria + Basque Country + Navarra; b) Iberian Plateau = Castilla y León + Castilla La Mancha + La Rioja + Extremadura + Madrid + Aragón; c) Mediterranean coast = Murcia + Valencia + Catalonia; d) Andalusia; e) North Portugal = Viana do Castelo + Braga + Vila Real + Porto + Braganca; f) Centre Portugal = Aveiro + Viseu + Guarda + Coimbra + Castelo Branco + Leiria + Lisboa + Santarem + Portalegre; and g) South Portugal = Setubal + Evora + Beja + Faro. Note that sum of sample sizes of population sub-sets do not fully match with the total numbers mentioned above (4,588 from the literature + 3024 newly generated in the present study) because there was not enough geographic information for a few of them (the same applies to Tables 2 and 3).
| Population | k | k/ | S | H ± SE | Π ± SE | M | ||
|---|---|---|---|---|---|---|---|---|
| 1520 | 449 | 0.30 | 152 | 180 | 0.9639±0.0030 | 0.01132±0.00103 | 3.83 | |
| 1336 | 489 | 0.37 | 144 | 164 | 0.9723±0.0029 | 0.01218±0.00200 | 4.10 | |
| 1063 | 430 | 0.41 | 175 | 220 | 0.9677±0.0050 | 0.01421±0.00093 | 4.84 | |
| 1253 | 463 | 0.37 | 175 | 213 | 0.9720±0.0030 | 0.01286±0.00037 | 4.36 | |
| 190 | 110 | 0.58 | 81 | 86 | 0.9460±0.0130 | 0.01288±0.00070 | 4.37 | |
| 253 | 142 | 0.56 | 96 | 103 | 0.9630±0.0080 | 0.01324±0.00060 | 4.49 | |
| 149 | 91 | 0.61 | 72 | 74 | 0.9560±0.0130 | 0.01363±0.00098 | 4.62 | |
| 6021 | 1344 | 0.22 | 232 | 351 | 0.9675±0.0010 | 0.01216±0.00080 | 4.07 | |
| 1591 | 478 | 0.30 | 168 | 207 | 0.9650±0.0030 | 0.01402±0.00095 | 4.74 |
n = Sample Size; k = Number of different haplotypes; S = number of polymorphic (segregating sites); Nmut = total number of mutations; H = haplotype diversity and standard error; π = nucleotide diversity and standard error; M = Average number of nucleotide differences
Summary statistics of HVS-I sequences in Spain and Portugal (excluding islands) carried out on main political regions (“Autonomous Regions”).
All the computations were carried out on the common sequence segment of the HVS-I that ranges from 16024 to 16362.
| Population | k | k/ | S | H ± SE | Π ± SE | M | ||
|---|---|---|---|---|---|---|---|---|
| 1253 | 463 | 0.37 | 175 | 213 | 0.972±0.003 | 0.01286±0.00037 | 4.36 | |
| 78 | 55 | 0.71 | 56 | 56 | 0.969±0.013 | 0.01132±0.00092 | 3.84 | |
| 112 | 72 | 0.64 | 63 | 65 | 0.978±0.008 | 0.00800±0.00074 | 3.81 | |
| 273 | 128 | 0.47 | 89 | 91 | 0.974±0.006 | 0.01314±0.00049 | 4.45 | |
| 343 | 153 | 0.44 | 107 | 114 | 0.965±0.006 | 0.01481±0.00052 | 5.02 | |
| 438 | 133 | 0.30 | 122 | 139 | 0.968±0.004 | 0.01201±0.00082 | 4.07 | |
| 671 | 304 | 0.45 | 124 | 139 | 0.971±0.004 | 0.01222±0.00034 | 4.15 | |
| 256 | 146 | 0.57 | 103 | 106 | 0.976±0.006 | 0.01295±0.00059 | 4.39 | |
| 731 | 417 | 0.44 | 195 | 205 | 0.985±0.002 | 0.01480±0.00055 | 5.27 | |
| 200 | 121 | 0.61 | 84 | 88 | 0.970±0.008 | 0.01215±0.00060 | 4.12 | |
| 92 | 69 | 0.76 | 60 | 64 | 0.980±0.009 | 0.01276±0.00080 | 4.33 | |
| 548 | 256 | 0.47 | 161 | 197 | 0.961±0.006 | 0.01267±0.00090 | 4.30 | |
| 218 | 121 | 0.56 | 87 | 80 | 0.972±0.006 | 0.01190±0.00060 | 4.03 | |
| 134 | 83 | 0.62 | 62 | 63 | 0.962±0.012 | 0.01227±0.00075 | 4.16 | |
| 137 | 56 | 0.41 | 126 | 131 | 0.957±0.009 | 0.01576±0.00341 | 5.34 | |
| 285 | 111 | 0.39 | 77 | 77 | 0.942±0.010 | 0.00940±0.00043 | 3.19 | |
| 20 | 17 | 0.85 | 21 | 21 | 0.982±0.026 | 0.01045±0.00118 | 3.54 |
n = Sample Size; k = Number of different haplotypes; S = number of polymorphic (segregating sites); Nmut = total number of mutations; H = haplotype diversity and standard error; π = nucleotide diversity and standard error; M = Average number of nucleotide differences
Summary statistics of HVS-I sequences in Spain (including islands) carried out on Spanish provinces.
All the computations were carried out on the common sequence segment that ranges from 16024 to 16362.
| Population | K | k/ | S | H ± SE | Π ± SE | M | ||
|---|---|---|---|---|---|---|---|---|
| 7 | 5 | 0.71 | 10 | 10 | 0.905±0.103 | 0.01011±0.00338 | 3.43 | |
| 7 | 7 | 1.00 | 11 | 11 | 1.000±0.076 | 0.01096±0.00173 | 3.71 | |
| 45 | 35 | 0.78 | 44 | 44 | 0.982±0.982 | 0.01241±0.00120 | 4.21 | |
| 51 | 36 | 0.71 | 41 | 42 | 0.971±0.013 | 0.01110±0.00113 | 3.76 | |
| 66 | 43 | 0.65 | 48 | 52 | 0.979±0.009 | 0.01427±0.00119 | 4.84 | |
| 112 | 72 | 0.64 | 63 | 65 | 0.978±0.008 | 0.00800±0.00074 | 3.81 | |
| 35 | 26 | 0.74 | 40 | 42 | 0.950±0.030 | 0.01075±0.00140 | 6.24 | |
| 46 | 41 | 0.89 | 40 | 42 | 0.989±0.989 | 0.01246±0.00095 | 4.22 | |
| 273 | 128 | 0.47 | 89 | 91 | 0.974±0.006 | 0.01314±0.00049 | 4.45 | |
| 280 | 164 | 0.59 | 123 | 140 | 0.973±0.006 | 0.00600±0.00177 | 5.33 | |
| 9 | 9 | 1.00 | 17 | 17 | 1.000±0.052 | 0.01295±0.00249 | 4.39 | |
| 37 | 28 | 0.76 | 40 | 40 | 0.964±0.022 | 0.01154±0.00159 | 3.91 | |
| 44 | 37 | 0.84 | 46 | 49 | 0.982±0.982 | 0.01363±0.01363 | 4.62 | |
| 101 | 66 | 0.65 | 58 | 60 | 0.977±0.008 | 0.01285±0.00087 | 4.36 | |
| 438 | 133 | 0.30 | 122 | 139 | 0.968±0.004 | 0.01201±0.00082 | 4.07 | |
| 22 | 17 | 0.77 | 25 | 25 | 0.974±0.022 | 0.01208±0.00173 | 4.10 | |
| 49 | 39 | 0.80 | 49 | 50 | 0.984±0.010 | 0.01413±0.00173 | 4.79 | |
| 214 | 117 | 0.55 | 128 | 142 | 0.972±0.006 | 0.01324±0.00186 | 4.49 | |
| 40 | 33 | 0.83 | 43 | 43 | 0.964±0.023 | 0.01129±0.00124 | 3.83 | |
| 49 | 40 | 0.82 | 42 | 42 | 0.980±0.013 | 0.01243±0.00108 | 4.21 | |
| 11 | 9 | 0.82 | 16 | 16 | 0.964±0.051 | 0.01148±0.00199 | 3.89 | |
| 39 | 25 | 0.64 | 33 | 33 | 0.950±0.024 | 0.01010±0.00111 | 3.42 | |
| 133 | 81 | 0.61 | 67 | 69 | 0.978±0.007 | 0.01157±0.00068 | 3.92 | |
| 27 | 25 | 0.93 | 32 | 32 | 0.991±0.014 | 0.01434±0.00151 | 4.86 | |
| 66 | 40 | 0.61 | 47 | 48 | 0.961±0.016 | 0.01329±0.00117 | 4.50 | |
| 27 | 21 | 0.78 | 35 | 35 | 0.966±0.025 | 0.01163±0.00157 | 3.94 | |
| 154 | 95 | 0.61 | 80 | 86 | 0.957±0.012 | 0.01242±0.00073 | 4.21 | |
| 162 | 86 | 0.53 | 70 | 74 | 0.939±0.016 | 0.01071±0.00069 | 3.63 | |
| 72 | 46 | 0.64 | 51 | 52 | 0.948±0.020 | 0.01227±0.00112 | 4.16 | |
| 22 | 18 | 0.82 | 28 | 28 | 0.983±0.018 | 0.01037±0.00135 | 3.52 | |
| 218 | 121 | 0.56 | 87 | 80 | 0.972±0.006 | 0.01190±0.00060 | 4.03 | |
| 163 | 99 | 0.60 | 81 | 85 | 0.972±0.008 | 0.01245±0.00068 | 4.22 | |
| 134 | 83 | 0.62 | 62 | 63 | 0.962±0.012 | 0.01227±0.00075 | 4.16 | |
| 137 | 56 | 0.41 | 126 | 131 | 0.957±0.009 | 0.01576±0.00341 | 5.34 | |
| 18 | 16 | 0.89 | 21 | 21 | 0.987±0.023 | 0.01124±0.00139 | 3.81 | |
| 24 | 21 | 0.88 | 32 | 33 | 0.986±0.018 | 0.01219±0.00162 | 4.13 | |
| 13 | 12 | 0.92 | 28 | 28 | 0.987±0.035 | 0.01528±0.00283 | 5.18 | |
| 46 | 37 | 0.80 | 45 | 45 | 0.977±0.015 | 0.01345±0.00117 | 4.56 | |
| 20 | 17 | 0.85 | 21 | 21 | 0.982±0.026 | 0.01045±0.00118 | 3.54 | |
| 182 | 103 | 0.56 | 74 | 79 | 0.976±0.006 | 0.01208±0.00059 | 4.10 | |
| 2 | 2 | 1.00 | 4 | 4 | 1.000±0.500 | 0.01180±0.00590 | 4.00 | |
| 42 | 27 | 0.64 | 32 | 32 | 0.941±0.028 | 0.01059±0.00106 | 3.59 | |
| 133 | 93 | 0.70 | 73 | 74 | 0.984±0.006 | 0.01336±0.00073 | 4.53 | |
| 17 | 12 | 0.71 | 13 | 13 | 0.949±0.037 | 0.00829±0.00147 | 2.81 | |
| 58 | 41 | 0.71 | 51 | 51 | 0.972±0.014 | 0.01171±0.00106 | 3.97 | |
| 20 | 16 | 0.80 | 22 | 22 | 0.974±0.025 | 0.01082±0.00140 | 3.67 | |
| 49 | 37 | 0.76 | 49 | 50 | 0.965±0.019 | 0.01276±0.00138 | 4.32 | |
| 125 | 86 | 0.69 | 106 | 116 | 0.970±0.011 | 0.01490±0.00255 | 5.05 | |
| 30 | 27 | 0.90 | 36 | 38 | 0.993±0.011 | 0.01385±0.00149 | 4.69 | |
| 142 | 113 | 0.80 | 81 | 84 | 0.992±0.003 | 0.01520±0.00078 | 5.15 | |
| 30 | 25 | 0.83 | 32 | 32 | 0.975±0.021 | 0.01134±0.00148 | 3.84 |
n = Sample Size; k = Number of different haplotypes; S = number of polymorphic (segregating sites); Nmut = total number of mutations; H = haplotype diversity and standard error; π = nucleotide diversity and standard error; M = Average number of nucleotide differences
Fig 2Geographic maps of sample sizes (A) and diversity indices (B and C).
Fig 3Geographic maps of the haplogroup frequencies.
See legend of Fig 1 for more information.
Analysis of molecular variance (AMOVA) accounting for main geographic regions and Autonomous Regions (P-value < 0.0000).
| Source of Variation | Percentage of Variation | |
|---|---|---|
| Main geographic regions | Autonomous Regions | |
| Among groups | 0 | 0.15 |
| Among populations within groups | 0.30 | 0.16 |
| Within populations | 99.78 | 99.69 |
Fig 4(A) Dispersal of H14a and H14a2 haplogroups along the Mediterranean coast. (B) Maximum parsimony tree of haplogroups represented by the Iberian mitogenomes sequenced in the present study. The position of the revised Cambridge reference sequence (rCRS) is indicated for reading sequence motifs [53]. Mitochondrial DNA variants are indicated along the branches of the phylogenetic tree. Mutations are transitions unless a suffix A, C, G, or T indicates a transversion. A suffix ‘+’ indicates insertions. Variants underlined represent recurrent mutations in this tree while a prefix ‘@’ indicates a back mutation. Mutational hotspot variants at positions 16182, 16183, and 16519, as well as variation around position 310 and length or point heteroplasmies were not considered for the phylogenetic reconstruction. The color of the circles indicates their geographic origin according to the legend inset. The Iberian complete genomes obtained in the present study are indicated with yellow circles. More details on the geographic or ethnic origin of all the mitogenomes used in this network are provided in S3 Table. Haplogroups in orange boxes represent new branches of the mtDNA phylogeny, and are therefore not present in Phylotree Build 17.
TMRCA for haplogroup H14 and its sub-lineages.
| Haplogroup | Genetic distance | S.E. | Mean (ky) | 95% CI |
|---|---|---|---|---|
| H14 | 8.31 | 1.76 | 22.8 | 13.0–33.1 |
| H14a | 6.23 | 0.85 | 16.8 | 12.2–21.6 |
| H14a2 | 5.67 | 0.91 | 15.2 | 10.3–20.3 |
| H14a2a | 2.74 | 1.23 | 7.2 | 0.9–13.8 |
| H14a2c | 5.67 | 1.26 | 15.2 | 8.4–22.3 |
| H14a2c1 | 1.67 | 1.10 | 4.3 | 0–10.1 |
| H14b | 5.13 | 0.98 | 13.7 | 8.5–19.1 |
| H14b1 | 0.44 | 0.44 | 1.1 | 0–3.4 |
| H14b2 | 3.65 | 1.13 | 9.7 | 3.7–15.8 |
| H14b3 | 3.21 | 1.16 | 8.5 | 2.4–14.7 |
| H14b4 | 3.49 | 1.35 | 9.2 | 2.2–16.6 |