| Literature DB >> 32092056 |
Rosemary Bateta1, Norah P Saarman2, Winnie A Okeyo1,3, Kirstin Dion2, Thomas Johnson2, Paul O Mireji1,4, Sylvance Okoth1, Imna Malele5, Grace Murilla1, Serap Aksoy6, Adalgisa Caccone2.
Abstract
Glossina pallidipes is the main vector of animal African trypanosomiasis and a potential vector of human African trypanosomiasis in eastern Africa where it poses a large economic burden and public health threat. Vector control efforts have succeeded in reducing infection rates, but recent resurgence in tsetse fly population density raises concerns that vector control programs require improved strategic planning over larger geographic and temporal scales. Detailed knowledge of population structure and dispersal patterns can provide the required information to improve planning. To this end, we investigated the phylogeography and population structure of G. pallidipes over a large spatial scale in Kenya and northern Tanzania using 11 microsatellite loci genotyped in 600 individuals. Our results indicate distinct genetic clusters east and west of the Great Rift Valley, and less distinct clustering of the northwest separate from the southwest (Serengeti ecosystem). Estimates of genetic differentiation and first-generation migration indicated high genetic connectivity within genetic clusters even across large geographic distances of more than 300 km in the east, but only occasional migration among clusters. Patterns of connectivity suggest isolation by distance across genetic breaks but not within genetic clusters, and imply a major role for river basins in facilitating gene flow in G. pallidipes. Effective population size (Ne) estimates and results from Approximate Bayesian Computation further support that there has been recent G. pallidipes population size fluctuations in the Serengeti ecosystem and the northwest during the last century, but also suggest that the full extent of differences in genetic diversity and population dynamics between the east and the west was established over evolutionary time periods (tentatively on the order of millions of years). Findings provide further support that the Serengeti ecosystem and northwestern Kenya represent independent tsetse populations. Additionally, we present evidence that three previously recognized populations (the Mbeere-Meru, Central Kenya and Coastal "fly belts") act as a single population and should be considered as a single unit in vector control.Entities:
Year: 2020 PMID: 32092056 PMCID: PMC7058365 DOI: 10.1371/journal.pntd.0007855
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Fig 1Map showing sample sites in Kenya and Tanzania and location of the study region within Africa.
Sampling sites shown with dots and labeled with three letter codes as listed in Table 1. The striped region denotes the 11 sampling sites within the Serengeti ecosystem. The map was created in QGIS v2.12.1 (August 2017; http://qgis.osgeo.org) with free and publicly available data from DIVA-GIS (August 2017; http://www.diva-gis.org).
Sampling sites and estimates of their genetic diversity and assignment.
| Site (date) | Code | Lat. | Long. | N | AR | HO | HE | FIS | FIS p-value | qNW | qSW | qE | qO |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Kapesur | KAP | 0.733 | 34.316 | 30 | 2.06 | 0.42 | 0.51 | 0.18 | 0.00 | 0.93 | 0.00 | 0.00 | 0.07 |
| Ruma | RUM | -0.608 | 34.307 | 30 | 1.81 | 0.38 | 0.42 | 0.10 | 0.02 | 1.00 | 0.00 | 0.00 | 0.00 |
| Governor’s Camp | GVR | -1.309 | 35.034 | 30 | 2.3 | 0.49 | 0.58 | 0.16 | 0.00 | 0.00 | 1.00 | 0.00 | 0.00 |
| Mara Talek | MRT | -1.431 | 35.059 | 30 | 2.26 | 0.54 | 0.56 | 0.04 | 0.09 | 0.03 | 0.97 | 0.00 | 0.00 |
| Fig Tree Camp | FGT | -1.436 | 35.194 | 30 | 2.36 | 0.54 | 0.59 | 0.09 | 0.01 | 0.09 | 0.89 | 0.02 | 0.00 |
| Naibosho | NBS | -1.465 | 35.309 | 30 | 2.29 | 0.51 | 0.57 | 0.10 | 0.00 | 0.00 | 1.00 | 0.00 | 0.00 |
| Marabridge | MRB | -1.556 | 35.025 | 30 | 2.44 | 0.52 | 0.62 | 0.16 | 0.01 | 0.00 | 1.00 | 0.00 | 0.00 |
| Grumeti | GTR | -2.092 | 34.322 | 30 | 2.39 | 0.55 | 0.61 | 0.10 | 0.00 | 0.00 | 1.00 | 0.00 | 0.00 |
| Ikorongo | IKR | -2.026 | 34.692 | 30 | 2.42 | 0.54 | 0.62 | 0.13 | 0.00 | 0.00 | 0.93 | 0.00 | 0.07 |
| Kilimafedha | KLM | -2.299 | 34.901 | 30 | 2.43 | 0.58 | 0.63 | 0.08 | 0.00 | 0.00 | 0.98 | 0.02 | 0.00 |
| Maswa North | MSN | -2.674 | 34.401 | 30 | 2.46 | 0.55 | 0.63 | 0.13 | 0.00 | 0.00 | 1.00 | 0.00 | 0.00 |
| Maswa South | MSS | -3.199 | 34.46 | 30 | 2.43 | 0.58 | 0.62 | 0.07 | 0.03 | 0.01 | 0.98 | 0.02 | 0.00 |
| Ngorongoro | NGK | -3.446 | 34.886 | 30 | 2.35 | 0.54 | 0.59 | 0.08 | 0.01 | 0.00 | 0.97 | 0.03 | 0.00 |
| Nguruman | NGU | -1.977 | 36.117 | 30 | 2.12 | 0.51 | 0.54 | 0.06 | 0.04 | 0.03 | 0.10 | 0.87 | 0.00 |
| Meru National Park | MNP | 0.077 | 38.064 | 30 | 2.5 | 0.58 | 0.65 | 0.11 | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 |
| Kibwezi | KIB | -2.416 | 37.954 | 30 | 2.53 | 0.58 | 0.66 | 0.12 | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 |
| Tsavo west | TSW | -3.027 | 38.218 | 30 | 2.55 | 0.57 | 0.65 | 0.13 | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 |
| Kinango | KIN | -4.108 | 38.874 | 30 | 2.55 | 0.55 | 0.65 | 0.16 | 0.00 | 0.00 | 0.02 | 0.98 | 0.00 |
| Tiribe | SHT | -4.338 | 39.264 | 8 | 2.51 | 0.4 | 0.72 | 0.48 | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 |
| Shimba | SHI | -4.152 | 39.420 | 22 | 2.54 | 0.52 | 0.66 | 0.22 | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 |
| Hindi | HND | -2.117 | 40.791 | 30 | 2.54 | 0.56 | 0.65 | 0.14 | 0.00 | 0.00 | 0.03 | 0.97 | 0.00 |
Sampling information including sampling site (and date of collection), code, latitude (Lat.), longitude (Long.), number of samples (N), mean allelic richness across all 11 loci (AR), observed heterozygosity (HO), expected heterozygosity (HE), inbreeding coefficient (FIS), and results from the STRUCTURE [35,36] clustering analysis of average assignment probability (q) to the northwest, southwest, east, and outlier clusters (qNW, qSW, qE, qO, respectively).
Estimates of effective population size (Ne), bottleneck, and relatedness.
| Site | Ne | Ne 95% CI | p-value (TPM) | p-value | AFD | % UR | % HS | % FS | % PO |
|---|---|---|---|---|---|---|---|---|---|
| KAP | 2.7 | 2.2–3.3 | 0.58 | 0.06 | L-shaped | 70.6 | 13.6 | 4.8 | 11.0 |
| RUM | n/a | 82.6-∞ | 0.29 | L-shaped | 80.5 | 11.3 | 3.0 | 5.3 | |
| GVR | 111.3 | 50 - ∞ | 0.90 | 0.29 | L-shaped | 85.7 | 11.7 | 1.6 | 0.9 |
| MRT | 626.1 | 81.8 - ∞ | 0.86 | 0.29 | L-shaped | 84.6 | 12.0 | 0.9 | 2.5 |
| FGT | 203.4 | 60.5 - ∞ | 0.45 | 0.09 | L-shaped | 85.1 | 12.2 | 1.6 | 1.1 |
| NBS | n/a | 279.7 - ∞ | 0.94 | 0.16 | L-shaped | 86.2 | 11.0 | 1.1 | 1.6 |
| MRB | 50.4 | 32.3–98.1 | 0.84 | 0.42 | L-shaped | 87.1 | 8.7 | 2.3 | 1.8 |
| GTR | 41.8 | 27.1–77.6 | 0.77 | 0.29 | L-shaped | 84.6 | 12.6 | 0.9 | 1.8 |
| IKR | 21.1 | 15.8–29.3 | 0.74 | L-shaped | 83.4 | 13.1 | 0.9 | 2.5 | |
| KLM | n/a | 156.4 - ∞ | 0.45 | L-shaped | 86.4 | 11.7 | 0.9 | 0.9 | |
| MSN | 21.3 | 16.1–29.4 | 0.71 | 0.12 | L-shaped | 84.8 | 12.4 | 1.4 | 1.4 |
| MSS | 94.8 | 43.6–22768.4 | 0.16 | L-shaped | 85.5 | 12.4 | 1.6 | 0.5 | |
| NGK | 111.9 | 48.6-∞ | 0.55 | 0.12 | L-shaped | 85.1 | 11.7 | 0.9 | 2.3 |
| NGU | n/a | 204.5-∞ | 0.84 | 0.10 | L-shaped | 79.0 | 14.0 | 2.7 | 4.3 |
| MNP | 86.6 | 646.1 | 0.23 | L-shaped | 84.6 | 13.8 | 1.1 | 0.5 | |
| KIB | 49.2 | 33.3–85.4 | 0.82 | 0.14 | L-shaped | 89.9 | 9.0 | 0.7 | 0.5 |
| TSW | n/a | 444.8-∞ | 0.45 | L-shaped | 90.1 | 9.2 | 0.7 | 0.0 | |
| KIN | n/a | 2252.5-∞ | 0.65 | 0.12 | L-shaped | 91.3 | 7.6 | 1.1 | 0.0 |
| SHT | n/a | 29.3 - ∞ | 0.54 | 0.14 | L-shaped | 100.0 | 0.0 | 0.0 | 0.0 |
| SHI | n/a | 209.4-∞ | 0.78 | 0.42 | L-shaped | 92.2 | 6.9 | 0.4 | 0.4 |
| HND | 3507.0 | 125.8 - ∞ | 0.82 | 0.10 | L-shaped | 90.0 | 8.3 | 1.2 | 0.5 |
Site, Ne estimates (marked n/a if indistinguishable from infinity), the Ne 95% confidence interval (CI), p-value of tests for bottlenecks under the TPM, and IAM mutation models, allele frequency distribution (AFD), and the percent of each sample that was estimated to be unrelated (UR), half-siblings (HS), full-siblings (FS), and part of a parent/offspring relationship (PS) is also reported. Ne was estimated with the LD method in NeESTIMATOR [51], tests for population bottlenecks were run in BOTTLENECK [52], and relatedness was estimated in ML-Relate [69]. Significant at p-value < 0.05 after Benjamini-Hochberg correction for multiple testing are marked *.
Fig 2Results of the Bayesian clustering analyses based on microsatellite data.
Spatially explicit genetic clustering was performed in the program BAPS v 6 [55,56] Vertical bars indicate the probability of assignment (q-value) of an individual to each cluster (S7 Table). Thin vertical lines separate sampling sites reported along the bottom x-axis, and think vertical lines separate the three major clusters reported along the top x-axis.
Pairwise genetic and geographic distance.
| 154.7 | 241.0 | 254.8 | 260.5 | 268.5 | 266.8 | 314.5 | 310.0 | 343.8 | 379.5 | 438.1 | 469.6 | ||||||||||
| 0.123 | 111.4 | 122.7 | 134.1 | 146.1 | 130.4 | 159.9 | 159.4 | 195.7 | 225.1 | 283.9 | 318.0 | ||||||||||
| 0.224 | 0.239 | 13.9 | 22.7 | 35.2 | 27.6 | 117.8 | 88.5 | 111.2 | 167.5 | 219.9 | 238.5 | ||||||||||
| 0.152 | 0.165 | 0.105 | 15.0 | 28.1 | 14.4 | 110.1 | 77.8 | 98.2 | 156.5 | 207.8 | 225.1 | ||||||||||
| 0.118 | 0.145 | 0.104 | 13.2 | 23.0 | 121.4 | 86.2 | 101.5 | 163.7 | 212.6 | 226.4 | |||||||||||
| 0.238 | 0.280 | 0.113 | 0.102 | 33.1 | 130.1 | 92.8 | 103.3 | 168.3 | 214.9 | 225.5 | |||||||||||
| 0.200 | 0.240 | 0.121 | 0.106 | 0.013 | 98.4 | 64.1 | 83.8 | 142.6 | 193.4 | 211.0 | |||||||||||
| 0.228 | 0.263 | 0.020 | 0.117 | 0.095 | 0.015 | 41.8 | 68.4 | 65.5 | 124.3 | 163.3 | |||||||||||
| 0.215 | 0.244 | 0.093 | 0.086 | 0.017 | 38.3 | 79.1 | 133.1 | 159.6 | |||||||||||||
| 0.199 | 0.229 | 0.009 | 0.091 | 0.080 | 69.6 | 111.6 | 127.7 | ||||||||||||||
| 0.201 | 0.218 | 0.011 | 0.100 | 0.015 | 0.004 | 58.8 | 101.5 | ||||||||||||||
| 0.217 | 0.239 | 0.017 | 0.113 | 0.097 | 0.013 | 54.8 | |||||||||||||||
| 0.241 | 0.259 | 0.016 | 0.116 | 0.108 | 0.011 | 0.014 | 0.014 | 0.012 | 0.014 | 0.013 | |||||||||||
| 217.4 | 217.6 | 400.0 | 472.9 | 449.8 | 524.1 | 305.7 | |||||||||||||||
| 0.156 | 277.9 | 346.0 | 474.6 | 509.3 | 518.0 | 389.5 | |||||||||||||||
| 0.170 | 0.017 | 74.0 | 214.3 | 258.7 | 276.2 | 317.4 | |||||||||||||||
| 0.175 | 0.014 | 0.023 | 140.7 | 186.5 | 205.9 | 303.6 | |||||||||||||||
| 0.136 | 0.030 | 0.017 | 0.033 | 50.3 | 76.5 | 307.7 | |||||||||||||||
| 0.163 | 0.077 | 0.082 | 0.093 | 0.088 | 28.5 | 300.0 | |||||||||||||||
| 0.158 | 0.028 | 0.033 | 0.052 | 285.3 | |||||||||||||||||
| 0.161 | 0.026 | 0.024 | 0.082 | ||||||||||||||||||
Pairwise FST and geographic distance (below and above the diagonal, respectively) between site pairs within the (a) northwest and southwest (shaded grey), and (b) east. Pairwise FST was computed in Arlequin [44] based on Weir and Cockerham 1984 [65]. Non-significant FST values (p > 0.05) are in bold.
Fig 3Comparison of FST among and between clusters, and relationship between FST and geographic distance.
(a) Genetic differentiation (FST) computed in Arlequin [44] based on Weir and Cockerham 1984 [65] within and among clusters. Box plots show the mean, 1st and 3rd quartile, 95% quantiles (whiskers) and outliers (dots). Student’s t-tests indicated that average FST was significantly lower in the Southwest than the East (p = 0.0273, marked *), and significantly higher between-cluster comparisons than in the southwest or east (p < 0.0001, marked ***). (b) Genetic versus geographic distance using FST/(1- FST) to correct for finite population sizes [66] plotted for the northwest (star), southwest (downward pointing triangles), east (upward pointing triangles), and between clusters (grey circles), with linear line of best fit with R2 and p-values for Mantel tests for isolation by distance [66,68] performed in the “isolation by distance” web service v3.23 [67].
Fig 4First generation migrants among sampling sites.
Migrants detected using the software GENECLASS [70] indicated with arrows pointed in the direction of movement. Sites were grouped together if less than 50 km apart. Each dot represents a sampling site labeled with site codes (Table 1). The dashed outlines denote the three genetic clusters identified in BAPS v 6 [55,56]. The map was created in QGIS v2.12.1 (August 2017; http://qgis.osgeo.org) with free and publicly available data available from DIVA-GIS (August 2017; http://www.diva-gis.org).