| Literature DB >> 30250667 |
Panayiota Kotsakiozi1, Benjamin R Evans1, Andrea Gloria-Soria1,2, Basile Kamgang3, Martin Mayanja4, Julius Lutwama4, Gilbert Le Goff5,6, Diego Ayala5,7, Christophe Paupy5, Athanase Badolo8, Joao Pinto9, Carla A Sousa9, Arlete D Troco10, Jeffrey R Powell1.
Abstract
Aedes aegypti, the major vector of dengue, yellow fever, chikungunya, and Zika viruses, remains of great medical and public health concern. There is little doubt that the ancestral home of the species is Africa. This mosquito invaded the New World 400-500 years ago and later, Asia. However, little is known about the genetic structure and history of Ae. aegypti across Africa, as well as the possible origin(s) of the New World invasion. Here, we use ~17,000 genome-wide single nucleotide polymorphisms (SNPs) to characterize a heretofore undocumented complex picture of this mosquito across its ancestral range in Africa. We find signatures of human-assisted migrations, connectivity across long distances in sylvan populations, and of local admixture between domestic and sylvan populations. Finally, through a phylogenetic analysis combined with the genetic structure analyses, we suggest West Africa and especially Angola as the source of the New World's invasion, a scenario that fits well with the historic record of 16th-century slave trade between Africa and Americas.Entities:
Keywords: Aedes aegypti; Africa; SNP‐chip; genetics; migration; population structure
Year: 2018 PMID: 30250667 PMCID: PMC6145026 DOI: 10.1002/ece3.4278
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Population information for the Aedes aegypti samples used in this study
| Continent | Region | Country/island | Locality (abbreviation) | Type | Samples | SNPs | latitude | longitude |
|---|---|---|---|---|---|---|---|---|
| Africa | West Africa | Angola | Luanda (Ang) | Domestic | 12 | 16,906 | −9.76667 | 14.26667 |
| Burkina Faso | Burkina Faso (BF) | Domestic | 12 | 16,855 | 12.2383 | −1.5616 | ||
| Cameroon | Yaounde Mokolo (YAOMO) | Domestic | 7 | 16,845 | 3.87275 | 11.5012 | ||
| Cameroon | Yaounde MvogAda (YAOMV) | Domestic | 8 | 16,804 | 3.86275 | 11.5259 | ||
| Cameroon | Yaounde Center (CAM) | Domestic | 12 | 16,877 | 3.866667 | 11.5167 | ||
| Cameroon | Yaounde Forest (YAOF) | Sylvan | 8 | 16,758 | 3.87601 | 11.3761 | ||
| Cameroon | Yaounde Village (YAOV) | Peridomestic | 8 | 16,795 | 3.86076 | 11.3937 | ||
| Cameroon | Buffalo camp (CamD) | Peridomestic | 10 | 16,853 | 8.371057 | 13.866 | ||
| Gabon | Franceville (GB) | Domestic | 12 | 16,797 | −1.63324 | 13.583 | ||
| Gabon | Lope Forest (GB_F) | Sylvan | 12 | 16,801 | −0.37896 | 11.5274 | ||
| Gabon | Lope Village (GB_V) | Peridomestic | 12 | 16,701 | −0.37896 | 11.5274 | ||
| Senegal | Sedhiou (Sedh) | Peridomestic | 12 | 16,866 | 14.183 | −12.717 | ||
| Senegal | Goudiry (Goud) | Peridomestic | 12 | 16,903 | 12.707 | −15.5552 | ||
| East Africa | South Africa | Johannesburg (AFS) | Domestic | 9 | 16,777 | 27.9006 | −25.9904 | |
| Uganda | Lunyo (Lun) | Peridomestic | 12 | 16,859 | 0.3267 | 33.8936 | ||
| Uganda | Zika village (ZIKA) | Peridomestic | 14 | 16,811 | 0.12745 | 32.5313 | ||
| Kenya | Kaya Forest (KEN) | Sylvan | 8 | 16,861 | −3.93194 | 39.5961 | ||
| Kenya | Kahawa Sukari (KS) | Peridomestic | 8 | 16,874 | −1.19451 | 36.9456 | ||
| Kenya | Nairobi (NBO) | Domestic | 8 | 16,702 | −1.2833 | 36.8167 | ||
| Reunion island | Reunion Island (RI) | Domestic | 12 | 14,499 | −20.1818 | 57.5171 | ||
| Mauritius island |
| Outgroup | 4 | 13,286 | −20.1668 | 57.5147 | ||
| Asia | Australia | Cairns (Cairns) | Aaa | 12 | 16,990 | −16.817 | 145.686 | |
| Georgia | Georgia (Georgia) | Aaa | 10 | 16,927 | 41.9614 | 43.3624 | ||
| Philippines | Philippines (BBG) | Aaa | 8 | 17,005 | 10.2833 | 123.947 | ||
| Tahiti | Tahiti (FP) | Aaa | 12 | 17,000 | −17.531 | −149.56 | ||
| Vietnam | Ho Chi Minh (HCM) | Aaa | 12 | 16,976 | 10.8032 | 106.695 | ||
| New World | Brazil | Macapà (AJM) | Aaa | 12 | 16,935 | 0.03542 | −51.071 | |
| Caribbean | Dominica (Dom) | Aaa | 12 | 16,938 | 15.59166 | −61.4111 | ||
| Colombia | Cali (Cali) | Aaa | 12 | 17,012 | 3.43894 | −76.516 | ||
| Siquirres | Costa Rica (CR) | Aaa | 6 | 16,394 | 9.93848 | −84.095 | ||
| Mexico | Chetumal (CheDC) lab strain | Aaa | 8 | 16,997 |
For each population, the sampling locality (with abbreviation), the ecological setting where sampled, the number of mosquitoes analyzed, the average number of SNPs obtained, and location in latitude/longitude for the samples are presented.
Figure 1Locations of Ae. aegypti sampled from mainland Africa and Reunion Island. Two of the sampling localities, Yaounde and Lope, include 5 and 2 sampling sites, respectively. The multiple sampling points in these localities are less than 3 km apart. The blue sampling site represents Ae. mascarensis used as outgroup
Figure 2STRUCTURE bar plots for all Ae. aegypti populations and Ae. mascarensis. Population names are reported on the x‐axis. The y‐axis reports the probability of each individual (Q‐value) assigned to one of the genetic groups identified by fastSTRUCTURE, which are represented by different colors. Each bar represents an individual. Individuals with 100% assignment to one group are identified by a single color. Individuals with mixed ancestry are represented by bars with different percentages of colors. The thick black lines within the plots indicate population limits. Abbreviations: SA: South Africa, BF: Burkina Faso, ANG: Angola, masc: Ae. mascarensis
Figure 3STRUCTURE bar plots for all African Ae. aegypti populations. Population names are reported on the x‐axis. For details, see legend of Figure2
Figure 4Principal components analysis (PCA) on the broad dataset including all the Ae. aegypti populations as well as the Ae. mascarensis (a) and including only the African populations (b). PCA implemented and plotted in LEA R package, presenting the projection of all individual mosquitoes on the first two PCs. Populations originated from different regions are presented with different colors as shown in the inset
Figure 5Discriminant analysis of principal components (DAPC) for the African populations as implemented and plotted in “adegenet” R package. The graph represents the individuals as dots and the groups as inertia ellipses. A bar plot of eigenvalues for the discriminant analysis (DA eigenvalues) is displayed in the inset. The bars in the inset represent the number of discriminant functions retained in the analysis, the first two of which are used in the plot. Population codes are as shown in Table 1
Analyses of molecular variance (AMOVA) as implemented in Arlequin
| Groups | Source of variation |
| Percentage of variation (%) |
|---|---|---|---|
| Africa/out of Africa | Among groups | 1 | 20.79 |
| Within groups | 28 | 13 | |
| Within populations | 592 | 66.21 | |
| West Africa/East Africa | Among groups | 1 | 1.89 |
| Within groups | 17 | 12.87 | |
| Within populations | 371 | 85.23 | |
| BF/Kenya/Uganda/Angola/SA/Cameroon/Gabon/Senegal | Among groups | 7 | 6.37 |
| Within groups | 11 | 8.13 | |
| Within populations | 371 | 85.5 | |
| Domestic/Peridomestic/Sylvan | Among groups | 2 | 0.05 |
| Within groups | 13 | 13.82 | |
| Within populations | 371 | 86.13 |
Populations are divided into groups as shown in Table 1.
BF: Burkina Faso; df: degrees of freedom; SA: South Africa.
Population Differentiation
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1: Buffalo camp | ||||||||||||||||||
| 2: Yaounde Mokolo | 0.09 | |||||||||||||||||
| 3: Yaounde Mvog | 0.05 | 0.07 | ||||||||||||||||
| 4: Yaounde Center | 0.08 | 0.07 | 0.06 | |||||||||||||||
| 5: Yaounde Forest | 0.03 | 0.08 | 0.04 | 0.08 | ||||||||||||||
| 6: Yaounde Village | 0.02 | 0.08 | 0.04 | 0.07 | 0.01 | |||||||||||||
| 7: Burkina Faso | 0.03 | 0.07 | 0.04 | 0.07 | 0.04 | 0.03 | ||||||||||||
| 8: Luanda Angola | 0.20 | 0.19 | 0.17 | 0.19 | 0.20 | 0.19 | 0.18 | |||||||||||
| 9: Goudiry | 0.15 | 0.16 | 0.13 | 0.15 | 0.14 | 0.13 | 0.13 | 0.15 | ||||||||||
| 10: Sedhiou | 0.13 | 0.15 | 0.12 | 0.15 | 0.11 | 0.11 | 0.11 | 0.20 | 0.16 | |||||||||
| 11: Johannesburg | 0.22 | 0.26 | 0.23 | 0.24 | 0.22 | 0.21 | 0.22 | 0.22 | 0.27 | 0.27 | ||||||||
| 12: Kahawa Sukari | 0.06 | 0.11 | 0.07 | 0.10 | 0.06 | 0.05 | 0.07 | 0.19 | 0.16 | 0.14 | 0.18 | |||||||
| 13: Kaya Forest | 0.28 | 0.30 | 0.27 | 0.28 | 0.28 | 0.27 | 0.26 | 0.23 | 0.29 | 0.31 | 0.22 | 0.25 | ||||||
| 14: Nairobi | 0.18 | 0.20 | 0.19 | 0.20 | 0.18 | 0.17 | 0.17 | 0.17 | 0.22 | 0.23 | 0.07 | 0.15 | 0.20 | |||||
| 15: Lope Forest | 0.06 | 0.11 | 0.08 | 0.11 | 0.06 | 0.05 | 0.07 | 0.20 | 0.17 | 0.15 | 0.16 | 0.07 | 0.25 | 0.14 | ||||
| 16: Lope Village | 0.05 | 0.09 | 0.07 | 0.09 | 0.05 | 0.04 | 0.06 | 0.18 | 0.15 | 0.14 | 0.15 | 0.06 | 0.23 | 0.13 | 0.01 | |||
| 17: Franceville | 0.08 | 0.13 | 0.10 | 0.12 | 0.09 | 0.08 | 0.09 | 0.19 | 0.19 | 0.17 | 0.13 | 0.08 | 0.23 | 0.12 | 0.05 | 0.04 | ||
| 18: Lunyo | 0.07 | 0.11 | 0.08 | 0.11 | 0.06 | 0.06 | 0.07 | 0.21 | 0.17 | 0.15 | 0.23 | 0.08 | 0.29 | 0.20 | 0.10 | 0.08 | 0.11 | |
| 19: Zika | 0.04 | 0.09 | 0.06 | 0.09 | 0.04 | 0.04 | 0.05 | 0.20 | 0.15 | 0.13 | 0.20 | 0.05 | 0.26 | 0.17 | 0.07 | 0.06 | 0.09 | 0.05 |
Pairwise Fst values between African populations of Ae. aegypti as estimated based on the panel of ~17K SNPs, using Arlequin. All values are significant at significance level 0.05.
Figure 6Isolation‐by‐distance plots for all pairs of populations from continental Africa. Statistical significance was evaluated through a Mantel test as implemented in the “ade4” R package. The original value of the correlation between the two matrices (geographic distance and genetic distance) is represented by a dot, while the histogram (a) represents the permutated values assuming the absence of spatial structure. Significant spatial structure results in the original value being out of the reference distribution. The correlation between geographic and genetic distance was plotted using the R package “MASS.” The scatterplot (b) shows one single consistent cloud of points. The colored gradient from light blue to red indicates the density of the points which are also shown as red points in the background of the graph. The blue dashed line represents the regression line between the geographic and genetic distance
Figure 7Maximum likelihood (ML) rooted phylogenetic tree re‐constructed using a panel of ~12,000 SNPs. Ae. mascarensis was used as an outgroup, and Aaa samples from New World and Asia were used to test the distinctiveness of Aaf and Aaa lineages. Bootstraps are presented on the nodes; values <70 are not shown