| Literature DB >> 28525980 |
Candela L Hernández1, Jean M Dugoujon2, Andrea Novelletto3, Juan N Rodríguez4, Pedro Cuesta5, Rosario Calderón6.
Abstract
BACKGROUND: The structure of haplogroup H reveals significant differences between the western and eastern edges of the Mediterranean, as well as between the northern and southern regions. Human populations along the westernmost Mediterranean coasts, which were settled by individuals from two continents separated by a relatively narrow body of water, show the highest frequencies of mitochondrial haplogroup H. These characteristics permit the analysis of ancient migrations between both shores, which may have occurred via primitive sea crafts and early seafaring. We collected a sample of 750 autochthonous people from the southern Iberian Peninsula (Andalusians from Huelva and Granada provinces). We performed a high-resolution analysis of haplogroup H by control region sequencing and coding SNP screening of the 337 individuals harboring this maternal marker. Our results were compared with those of a wide panel of populations, including individuals from Iberia, the Maghreb, and other regions around the Mediterranean, collected from the literature.Entities:
Keywords: Gene flow; Human evolution; Iberian Peninsula; North Africa; Phylogeography; Population structure
Mesh:
Substances:
Year: 2017 PMID: 28525980 PMCID: PMC5437654 DOI: 10.1186/s12863-017-0514-6
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Fig. 1Map of the region. a Location of the Andalusian territory in the context of the Mediterranean region. b Municipalities of sampling in Huelva and Granada provinces: 1: El Repilado, 2: Aracena, 3: El Cerro del Andévalo, 4: La Puebla de Guzmán, 5: Valverde del Camino, 6: Villablanca, 7: Niebla, 8: Huéscar, 9: Baza, 10: Montefrío, 11: Loja, 12: Alhama de Granada, 13: Órgiva. The capital cities are highlighted with a square. Basemaps were adapted from ArcGIS Online repository (World Imagery, Esri) and NASA Visible Earth (Spain and Portugal. Credit: Jacques Descloitres, MODIS Land Rapid Response Team, NASA/GSFC. https://visibleearth.nasa.gov/). Both images are freely available for re-publication or re-use
Frequencies of H lineages and sub-lineages in Andalusia (Southern Iberia)
| Haplogroup | W-Andalusia ( | E-Andalusia ( | ||
|---|---|---|---|---|
|
| % |
| % | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| H1* | 38 | 34.55 | 58 | 25.55 |
| H1a | – | – | 1 | 0.44 |
| H1a1 | 1 | 0.91 | – | – |
| H1b1 | – | – | 5 | 2.20 |
| H1c3 | 4 | 3.64 | – | – |
| H1e1a3 | – | – | 5 | 2.20 |
| H1k | 1 | 0.91 | – | – |
| H1t1a1 | – | – | 1 | 0.44 |
| H1ah1 | 3 | 2.73 | – | – |
| H1ba | – | – | 1 | 0.44 |
| H1bf1 | 1 | 0.91 | 2 | 0.88 |
|
|
|
|
|
|
| H2a1 | 3 | 2.73 | 1 | 0.44 |
| H2a2b | – | – | 2 | 0.88 |
|
|
|
|
|
|
| H3* | 14 | 12.73 | 30 | 13.22 |
| H3c | – | – | 10 | 4.41 |
| H3c2 | 2 | 1.82 | 1 | 0.44 |
|
|
|
|
|
|
| H4a1 | 1 | 0.91 | 12 | 5.29 |
| H4a1a4b1 | 2 | 1.82 | – | – |
|
|
|
|
|
|
| H5* | 4 | 3.64 | 6 | 2.64 |
| H5a | – | – | 5 | 2.20 |
| H5a3a1 | – | – | 5 | 2.20 |
| H5a4a | 1 | 0.91 | 1 | 0.44 |
|
|
|
|
|
|
| H6a | 6 | 5.45 | 9 | 3.96 |
| H6a1a1a | – | – | 4 | 1.76 |
| H6a1a7 | 2 | 1.82 | 2 | 0.88 |
| H6a1b4 | – | – | 1 | 0.44 |
|
|
|
|
|
|
| H7* | 5 | 4.55 | 6 | 2.64 |
| H7h | – | – | 1 | 0.44 |
|
|
|
| – | – |
| H10a1 | 1 | 0.91 | – | – |
| H10e | 2 | 1.82 | – | – |
|
| – | – |
|
|
| H11* | – | – | 4 | 1.76 |
| H11a | – | – | 1 | 0.44 |
|
| – | – |
|
|
|
|
|
|
|
|
| H17* | 1 | 0.91 | 3 | 1.32 |
| H17c | – | – | 2 | 0.88 |
|
|
|
|
|
|
|
| – | – |
|
|
| H20* | – | – | 1 | 0.44 |
| H20a | – | – | 3 | 1.32 |
|
|
|
|
|
|
|
| – | – |
|
|
|
| – | – |
|
|
The first row on the left (in italics) indicates the frequency of the whole Hg H in each population. From there below, frequencies indicate the proportion within the Hg. Main Hg H sub-clades are highlighted in boldface. Asterisks (*) indicate those samples unclassified at the present resolution level or belonging to a specific paragroup
Fig. 2Mismatch distributions for Huelva (a) and Granada (b) and statistical parameters of control region sequences. For both cases, expected distributions are shown in black. K: number of different sequences and percentage of sample size in parentheses; P: number of polymorphic sites; H: haplotype diversity and standard deviation; D: nucleotide diversity; M: mean number of pairwise differences; SSD: sum of squared deviations between the observed and expected mismatch distribution; r: Harpending’s raggedness index; D and Fs are, respectively, the Tajima’s and Fu’s tests of selective neutrality. The last 4 indices are presented with their significance values, as ns (non-significant), **P < 0.01; ***P < 0.001
Fig. 3a, b, c. Interpolated frequency surfaces of clade H and its main sub-clades (H1 and H3). Frequencies (%) are showed in a colour scale. See information about the populations used in Additional files 4 and 5. Map templates were taken from Natural Earth free map repository (http://www.naturalearthdata.com/)
Fig. 4Median-joining networks for sub-Hgs H1 (a) and H3 (b) in Iberia and Northern Africa. HVS-I sequences (nH1 = 708; nH3 = 264) are shown along with some coding-region positions (in bold red). All positions are scored against the rCRS. Sub-Hgs are filled in blue. Transversions are specified with the base change after the mutation. The asterisks show the basal nodes (nps. 3010 for H1; nps. 6776 for H3). Circle sizes are proportional to the haplotype frequency. See Additional file 4 for details on populations used
Cases (above diagonal) and percentages (below diagonal) of different and shared mtDNA haplotypes (non-singletons) for H1 and H3 Hgs between Andalusia, Iberiaa, Morocco and Maghrebb
| Hg H1 | Andalusia | Iberiaa
| Morocco | Maghrebb
|
| Andalusia | – | 22 | 5 | 7 |
| Iberiaa | 84.62 | – | 6 | 13 |
| Morocco | 83.33 | 100.00 | – | 4 |
| Maghrebb | 35.00 | 65.00 | 66.67 | – |
| Hg H3 | Andalusia | Iberiaa
| Morocco | Maghrebb
|
| Andalusia | – | 8 | 2 | 2 |
| Iberiaa | 80.00 | – | 2 | 4 |
| Morocco | 50.00 | 50.00 | – | 2 |
| Maghrebb | 33.33 | 66.67 | 50.00 | – |
without aAndalusia and bMorocco
Coalescence age estimates for the main H-subclades
| Sub-clade |
|
| σ | Age (years) | 95% C.I. |
|---|---|---|---|---|---|
| H1 | 1524 | 0.99 | 0.21 | 16,480 | [9687–23,273] |
| H1 Iberia | 591 | 0.80 | 0.20 | 13,375 | [6892–19,859] |
| H1 North Africa | 191 | 0.63 | 0.11 | 10,478 | [6756–14,200] |
| H2 | 244 | 1.42 | 0.70 | 23,648 | [899–46,397] |
| H3 | 391 | 0.78 | 0.20 | 13,085 | [6442–19,728] |
| H3 Iberia | 236 | 0.57 | 0.15 | 9469 | [4420–14,518] |
| H3 North Africa | 37 | 0.49 | 0.17 | 8113 | [2526–13,700] |
| H4 | 129 | 0.57 | 0.13 | 9437 | [5243–13,632] |
| H5 | 324 | 0.61 | 0.14 | 10,140 | [5686–14,594] |
| H6 | 177 | 0.78 | 0.19 | 13,002 | [6931–19,074] |
| H7 | 120 | 0.66 | 0.19 | 10,979 | [4756–17,202] |
| H10 | 69 | 0.88 | 0.41 | 14,743 | [1286–28,201] |
| H11 | 59 | 1.27 | 0.41 | 21,200 | [7823–34,577] |
| H13 | 120 | 1.00 | 0.20 | 16,677 | [10,154–23,200] |
| H20 | 46 | 1.07 | 0.50 | 17,789 | [1353–34,225] |
Values of the parameter ρ were based on HVS-I sequences detailed in Additional file 4 and transformed into years by using the corrected molecular clock proposed by [24]
Fig. 5Hierarchical Cluster Analysis (HCA) of 71 populations based on their Hg H sub-structure. Only high-resolution studies were considered for this analysis (see populations underlined in Additional file 4). The sub-clades used here are depicted as vectors. Affiliation to each of 4 clusters discussed in the text is colour-coded. Each geographic affiliation has a characteristic symbol (see legend). Populations are coded as in Additional file 4. Our Andalusian populations correspond to numbers 17 (Huelva) and 19 (Granada)
F and F indices obtained by AMOVA for several groups of northern and southern Mediterranean populations
| Fixation Indices | ||||
|---|---|---|---|---|
| Population Groups |
| % total variance |
| % total variance |
| Southern Iberia (Andalusia) (2)a vs Northern Iberia (12)b | 0.01921*** | 1.90 | 0.01022 (n.s.)e | 1.02 |
| Southern Iberia (Andalusia) (2) vs Spanish Basque Area (8)c | 0.01944*** | 1.90 | 0.02117 (n.s.) | 2.12 |
| Iberian Peninsula (18) vs Maghreb (6)d | 0.02035*** | 2.01 | 0.01034 (n.s.) | 1.03 |
| Iberian Peninsula (18) vs Morocco, Algeria (2) | 0.01838*** | 1.83 | 0.00592 (n.s.) | 0.59 |
| Iberian Peninsula (18) vs Italy (14) | 0.02284*** | 2.17 | 0.05207*** | 5.21 |
| Andalusia (2) vs Maghreb (6) | 0.01590*** | 1.57 | 0.01017 (n.s.) | 1.02 |
| Andalusia (2) vs Morocco, Algeria (2) | 0.00224 (n.s.) | 0.22 | 0.02106 (n.s.) | 2.11 |
Results are based on mtDNA clade H composition
All values have been corrected for multiple testing using Bonferroni
aNumber of populations per group
bIncluding from west to east the Spanish northern regions of Galicia, Asturias, Cantabria and Basque Country
cIncluding the Basque Country and the neighboring territory of Navarra
dIncluding Morocco, Algeria, Tunisia and Libya
en.s. not significant; ***P < 0.001