| Literature DB >> 32752197 |
Fatma A Aljasmi1, Ranjit Vijayan2, Naganeeswaran Sudalaimuthuasari3, Abdul-Kader Souid1, Noushad Karuvantevida2, Raja Almaskari2, Hidaya Mohammed Abdul Kader2, Biduth Kundu2, Khaled Michel Hazzouri3, Khaled M A Amiri2,3.
Abstract
In order to assess the genomic landscape of the United Arab Emirates (UAE) mitogenome, we sequenced and analyzed the complete genomes of 232 Emirate females mitochondrial DNA (mtDNA) within and compared those to Africa. We investigated the prevalence of haplogroups, genetic variation, heteroplasmy, and demography among the UAE native population with diverse ethnicity and relatively high degree of consanguinity. We identified 968 mtDNA variants and high-resolution 15 haplogroups. Our results show that the UAE population received enough gene flow from Africa represented by the haplogroups L, U6, and M1, and that 16.8% of the population has an eastern provenance, depicted by the U haplogroup and the M Indian haplogroup (12%), whereas western Eurasian and Asian haplogroups (R, J, and K) represent 11 to 15%. Interestingly, we found an ancient migration present through the descendant of L (N1 and X) and other sub-haplogroups (L2a1d and L4) and (L3x1b), which is one of the oldest evolutionary histories outside of Africa. Our demographic analysis shows no population structure among populations, with low diversity and no population differentiation. In addition, we show that the transmission of mtDNA in the UAE population is under purifying selection with hints of diversifying selection on ATP8 gene. Last, our results show a population bottleneck, which coincides with the Western European contact (1400 ybp). Our study of the UAE mitogenomes suggest that several maternal lineage migratory episodes liking African-Asian corridors occurred since the first modern human emerges out of Africa.Entities:
Keywords: Mitochondrial DNA; Selection; demography; heteroplasmy; next generation sequencing; single nucleotide polymorphism (SNP)
Mesh:
Year: 2020 PMID: 32752197 PMCID: PMC7464197 DOI: 10.3390/genes11080876
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Mitochondrial genomic variation within the UAE population. (A) Circos plot depicting the overall mitochondrial genomic variation. From the outer to inner rings: (1) variant allele frequency, (2) variant type, and (3) gene positions. The inner barplot shows the synonymous and nonsynonymous variants for each mitochondrial gene. (B) Quality control statistics such as depth of coverage (DP), mapping quality (MQ), and quality of variants (QUAL), as well as the number of variants per site. (C) Percentage of heterozygosity as a proxy for heteroplasmy for heterozygote variants depicted on the x-axis. The y-axis is the number of individuals with heteroplasmic locus. (D) The number of variants distribution per mitochondrial gene, RNR, tRNA, and noncoding.
UAE population haplogroup distribution.
| NE ** | NW ** | Mid-NW ** | SW ** | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Group a | F b | F b% | Unknown | Fujairah | RAK * | UAQ * | Dubai | Sharjah | Ajman | Al Ain | Abu Dhabi |
| E | 2 | 0.86 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 |
| F | 1 | 0.43 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| H | 19 | 8.18 | 0 | 5 | 3 | 0 | 1 | 3 | 0 | 7 | 0 |
| HV | 10 | 4.31 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 5 | 2 |
| I | 3 | 1.29 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 2 | 0 |
| J | 25 | 10.77 | 0 | 0 | 5 | 0 | 1 | 2 | 1 | 13 | 3 |
| K | 26 | 11.20 | 0 | 11 | 11 | 0 | 0 | 0 | 0 | 4 | 0 |
| L | 20 | 8.62 | 1 | 3 | 4 | 0 | 0 | 4 | 0 | 5 | 3 |
| M | 28 | 12.06 | 2 | 11 | 5 | 0 | 2 | 0 | 0 | 7 | 1 |
| N | 6 | 2.58 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 4 | 0 |
| R | 35 | 15.08 | 1 | 6 | 7 | 2 | 2 | 2 | 1 | 14 | 0 |
| T | 14 | 6.03 | 0 | 3 | 0 | 0 | 2 | 1 | 0 | 6 | 2 |
| U | 39 | 16.81 | 3 | 4 | 3 | 1 | 2 | 2 | 1 | 16 | 7 |
| W | 1 | 0.43 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| X | 3 | 1.29 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 |
| Total | 232 | 100 | 9 | 43 | 39 | 3 | 15 | 15 | 4 | 86 | 18 |
** Region-wise classification: NE—Northeast region (n = 43), NW—Northwest region (n = 39); Mid-NW—mid-Northwest region (n = 37); SW—Southwest region (n = 104); * RAK—Ras al Khaimah; UAQ—Umm al-Quwain; a Haplogroup; b Frequency.
Figure 2Haplogroup distribution and frequency. (A) Network haplogroup tree depicting the ancestral (L) haplogroups and the out of Africa haplogroups. (B) Total haplogroup frequencies for the UAE population. (C) Stacked bar plots for the frequencies of L3 haplogroup from different Levant and African countries. Stars highlight the presence of L3x1b sub-haplogroups in these countries. (D) A map of Africa and the Arabian Peninsula and a zoom on the UAE map with highlights of the different haplogroup frequencies for the different cities of the UAE.
Figure 3Population structure in UAE population. (A) Maximum likelihood phylogenetic tree depicting the network relationship among the different haplogroups (sub-haplogroups), where they are color-coded by city/region. (B) Admixture plot for K = 2 and K = 7 ancestral population. (C) Discriminant Analysis of Principal Component (DAPC) showing the clustering of the different cities of the UAE population.
Figure 4Population differentiation and diversity. (A) Population differentiation (Fst) per city of the UAE. (B) Population differentiation statistics depicted by Hs, Ht, Gst, G’st, and D. (C) A map showing the expected gradient of diversity decrease along the out of Africa passage. Diversity estimates for Africa in comparisons to other UAE cities as well as L haplogroup (and L3) to M and N haplogroups. (D) Consanguinity statistics plotted as a KING robust kinship matrix for the different individuals, where the degree of relatedness is shown as UN (unrelated), DS (distantly related), C 1(first cousin), HS (half-sibs), PO (parents-offspring), and FS (full-sibs).
Population genetic parameters and the ratio of nonsynonymous to synonymous substitution rate for each mitochondrial gene.
| Genes | M | S | Ps | θ | π | D | dN/dS |
|---|---|---|---|---|---|---|---|
| ATP6 | 232 | 45 | 0.067265 | 0.011244 | 0.002034 | −2.391682 | 0.29 |
| ATP8 | 232 | 17 | 0.087179 | 0.014573 | 0.001415 | −2.325058 | 1.03 |
| COX1 | 232 | 75 | 0.050302 | 0.008409 | 0.001305 | −2.556932 | 0.10 |
| COX2 | 232 | 34 | 0.050595 | 0.008458 | 0.001166 | −2.449534 | 0.15 |
| COX3 | 232 | 40 | 0.053763 | 0.008987 | 0.001757 | −2.323988 | 0.12 |
| CYTB | 232 | 95 | 0.085818 | 0.014346 | 0.002837 | −2.457115 | 0.19 |
| ND1 | 232 | 56 | 0.06041 | 0.010098 | 0.002071 | −2.361573 | 0.12 |
| ND2 | 232 | 63 | 0.0625 | 0.010448 | 0.00161 | −2.533238 | 0.12 |
| ND3 | 232 | 16 | 0.048048 | 0.008032 | 0.003149 | −1.548463 | 0.15 |
| ND4 | 232 | 74 | 0.055306 | 0.009245 | 0.002533 | −2.195525 | 0.04 |
| ND4L | 232 | 31 | 0.050542 | 0.016855 | 0.001867 | −2.440526 | 0.06 |
| ND5 | 232 | 121 | 0.068789 | 0.011499 | 0.002293 | −2.476299 | 0.14 |
| ND6 | 232 | 31 | 0.060429 | 0.010102 | 0.001778 | −2.31697 | 0.08 |
m: number of sequences. S: Number of segregating sites. Ps: S/n (total number of sites). θ: Watterson theta. π: nucleotide diversity. D: Tajima’s D. dN/dS: ratio of nonsynonymous/synonymous substitution rate.
Figure 5Demographic reconstruction of the UAE mitogenome. Extended Bayesian skyline plot of female effective population size and vertical red line to highlight the bottleneck event that is around 1400 ybp.
Population genetic parameters per cities and haplogroups.
| City/Haplogroup | π | D |
|---|---|---|
| Abu Dhabi | 0.0033 ± 0.00310 | −0.9003 ± 0.67193 |
| Al Ain | 0.0022 ± 0.00241 | −1.0783 ± 0.59466 |
| Dubai, Ajman, UQA | 0.0029 ± 0.00264 | −0.8337 ± 0.54551 |
| Fujairah | 0.0023 ± 0.00258 | −1.0999 ± 0.74124 |
| RAK | 0.0021 ± 0.00237 | −0.8034 ± 0.62341 |
| Sharjah | 0.0024 ± 0.00211 | −0.9012 ± 0.53421 |
| UAQ | 0.0021 ± 0.0012 | −0.5231 ± 0.32310 |
| Africa (L3) | 0.004 ± 0.00279 | −1.7485 ± 0.38859 |
| L | 0.0037 ± 0.00302 | −0.8079 ± 0.70818 |
| M | 0.0016 ± 0.00162 | −1.0248 ± 0.56632 |
| N | 0.0021 ± 0.00221 | −0.9080 ± 0.85152 |
π: nucleotide diversity. D: Tajima’s D.