| Literature DB >> 29899497 |
Christine M Jones1, Yoosook Lee2, Andrew Kitchen3, Travis Collier4, Julia C Pringle1, Mbanga Muleba5, Seth Irish6, Jennifer C Stevenson1,7, Maureen Coetzee8,9, Anthony J Cornel2, Douglas E Norris1, Giovanna Carpi10.
Abstract
Anopheles funestus s.s. is a primary vector of malaria in sub-Saharan Africa. Despite its important role in human Plasmodium transmission, evolutionary history, genetic diversity, and population structure of An. funestus in southern and central Africa remains understudied. We deep sequenced, assembled, and annotated the complete mitochondrial genome of An. funestus s.s. for the first time, providing a foundation for further genetic research of this important malaria vector species. We further analyzed the complete mitochondrial genomes of 43 An. funestus s.s. from three sites in Zambia, Democratic Republic of the Congo, and Tanzania. From these 43 mitogenomes we identified 41 unique haplotypes that comprised 567 polymorphic sites. Bayesian phylogenetic reconstruction confirmed the co-existence of two highly divergent An. funestus maternal lineages, herein defined as lineages I and II, in Zambia and Tanzania. The estimated coalescence time of these two mitochondrial lineages is ~500,000 years ago (95% HPD 426,000-594,000 years ago) with subsequent independent diversification. Haplotype network and phylogenetic analysis revealed two major clusters within lineage I, and genetic relatedness of samples with deep branching in lineage II. At this time, data suggest that the lineages are partially sympatric. This study illustrates that accurate retrieval of full mitogenomes of Anopheles vectors enables fine-resolution studies of intraspecies genetic relationships, population differentiation, and demographic history. Further investigations on whether An. funestus mitochondrial lineages represent biologically meaningful populations and their potential implications for malaria vector control are warranted.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29899497 PMCID: PMC5997999 DOI: 10.1038/s41598-018-27092-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Sampling sites, methods, numbers (N) and collection dates for whole genome sequenced specimens.
| Site | N | Country | Coordinates (lat., long.) | Method | Collection Date |
|---|---|---|---|---|---|
| Nchelenge | 6 | Zambia | −9.2869, 28.7590 | Indoor CDC-LT | Feb, 2015 |
| Nchelenge | 5 | Zambia | −9.3247, 28.7819 | Indoor PSC | Apr, 2015 |
| Nchelenge | 6 | Zambia | −9.3042, 28.7822 | Indoor CDC-LT | Feb, 2015 |
| Nchelenge | 6 | Zambia | −9.2926, 28.7539 | Indoor PSC | Apr, 2015 |
| Kilwa Island | 5 | Zambia | −9.2675, 28.4500 | Indoor backpack aspiration | Aug, 2014 |
| Kapolowe | 5 | DRC | −10.9398, 26.9530 | Indoor HLC | Apr, 2015 |
| Lupiro | 5 | Tanzania | −8.383, 36.667 | Indoor backpack aspiration | Jun, 2013 |
| Lupiro | 5 | Tanzania | −8.383, 36.667 | Animal-pen backpack aspiration | Jun, 2013 |
Diversity statistics, neutrality tests, and demographic analysis.
| Total | Lineage I | Lineage II | DRC | Zambia | Tanzania | |
|---|---|---|---|---|---|---|
| # Samples | 43 | 29 | 14 | 5 | 28 | 10 |
| # Haplotypes | 41 | 28 | 13 | 5 | 26 | 10 |
| Hd (sd) | 0.998 (0.006) | 0.998 (0.010) | 0.989 (0.031) | 1.000 (0.126) | 0.995 (0.011) | 1.000 (0.045) |
| π (sd) | 0.00625 (0.00054) | 0.00237 (0.00019) | 0.00500 (0.00033) | 0.00163 (0.00022) | 0.00668 (0.00046) | 0.00611 (0.00139) |
| K | 95.93 | 36.320 | 76.725 | 25.000 | 102.571 | 93.756 |
|
| ||||||
| Tajima’s D | −1.075 | −1.841** | −1.040 | −1.210 | −0.514 | 0.023 |
| Fu and Li’s D* | −1.912* | −2.385* | −0.908 | −1.210* | −1.098 | 0.076 |
| Fu and Li’s F* | −1.915 | −2.606** | −1.086 | −1.312 | −1.069 | 0.071 |
| Fu’s Fs | −4.278 | −6.543 | 0.904 | 0.778 | −0.514 | 0.0525 |
|
| ||||||
| SSD | 0.00947 | 0.0123** | 0.0284** | 0.126** | 0.013** | 0.0365 |
| Raggedness index | 0.00309 | 0.00909* | 0.0156 | 0.300 | 0.00638 | 0.0415 |
|
| ||||||
| SSD | 0.00939** | 0.0135 | 0.0202 | 0.118** | 0.0103** | 0.034 |
| Raggedness index | 0.00309 | 0.00909 | 0.0156 | 0.300 | 0.00638 | 0.0415 |
Samples have been split into two general comparisons: lineage I vs lineage II and DRC vs Tanzania vs Zambia. p-value is indicated by *0.10 > p > 0.05 or **p < 0.05.
Figure 1Map and phylogenestic relationships of 43 An. funestus mitogenomes. (A) Map indicating the collection sites for 43 An. funestus samples, created using ArcGIS v10.5.1 (www.esri.org). (B) Bayesian maximum clade credibility phylogeny of complete mitogenomes from the 43 An. funestus samples of the best fitting model (GTR + G + I, Bayesian skyline plot, and a relaxed molecular clock) inferred using BEAST2. Samples are colored by geographic origin: blue indicates Zambia (N = 28), orange indicates Tanzania (N = 10), green indicates DRC (N = 5). Divergence dates (median estimates and 95% HPD) are given in parenthesis for major nodes. Posterior probabilities > 0.5 are indicated at each node. The timescale is indicated below the tree and is in years before present.
Figure 2Haplotype network of 43 An. funestus mitogenomes. In this TCS network, each node indicates a haplotype, with nodes colored according to origin. The number of mutational steps between nodes are indicated in parentheses beside the line connecting one node to another. One group of samples (all lineage II) did not connect to the main cluster within 95 mutational steps (over a 95% confidence limit for connectivity): these are shown in the box in the lower right. There are two distinct groups within the main cluster (lineage I): one more highly clustered (cluster A), and another with fewer, more-distant nodes (cluster B). Cluster A and B in lineage I are separated by ≥42 mutations. The size of each node indicates the number of samples sharing a specific haplotype.
BaTS (Bayesian Tip-association Significance testing).
| Statistic | p-value |
|---|---|
| Association Index | 0.02** |
| Parsimony Score | 0.01** |
| MC (Zambia) | 0.02** |
| MC (Tanzania) | 0.08 |
| MC (DRC) | 1 |
MC = maximum clade size statistic; measures how closely particular sites are associated with monophyletic clade structure. Strength of p-value is indicated by *0.10 > p > 0.05 or **p < 0.05.