| Literature DB >> 21408087 |
Susan M Rottschaefer1, Michelle M Riehle, Boubacar Coulibaly, Madjou Sacko, Oumou Niaré, Isabelle Morlais, Sekou F Traoré, Kenneth D Vernick, Brian P Lazzaro.
Abstract
The three-gene APL1 locus encodes essential components of the mosquito immune defense against malaria parasites. APL1 was originally identified because it lies within a mapped QTL conferring the vector mosquito Anopheles gambiae natural resistance to the human malaria parasite, Plasmodium falciparum, and APL1 genes have subsequently been shown to be involved in defense against several species of Plasmodium. Here, we examine molecular population genetic variation at the APL1 gene cluster in spatially and temporally diverse West African collections of A. gambiae. The locus is extremely polymorphic, showing evidence of adaptive evolutionary maintenance of genetic variation. We hypothesize that this variability aids in defense against genetically diverse pathogens, including Plasmodium. Variation at APL1 is highly structured across geographic and temporal subpopulations. In particular, diversity is exceptionally high during the rainy season, when malaria transmission rates are at their peak. Much less allelic diversity is observed during the dry season when mosquito population sizes and malaria transmission rates are low. APL1 diversity is weakly stratified by the polymorphic 2La chromosomal inversion but is very strongly subdivided between the M and S "molecular forms." We find evidence that a recent selective sweep has occurred at the APL1 locus in M form mosquitoes only. The independently reported observation of a similar M-form restricted sweep at the Tep1 locus, whose product physically interacts with APL1C, suggests that epistatic selection may act on these two loci causing them to sweep coordinately.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21408087 PMCID: PMC3050937 DOI: 10.1371/journal.pbio.1000600
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Figure 1Schematic representation of proteins encoded by APL1A, APL1B, and APL1C genes.
The two major structural variants of APL1A are shown separately. APL1A alleles are characterized by the deletion of the PANGGL region. APL1A alleles carry an early stop codon that eliminates the coiled-coil domain.
Population genetic parameter estimates at the APL1 locus in four collections.
| Collection | n | bp | πtot
| θtot
| TajD | πsyn
| πnon
|
|
| |||||||
| Bancoumana dry | 19 | 1,669 | 0.019 | 0.032 | −1.734 | 0.033 | 0.016 |
| Bancoumana rainy | 9 | 1,665 | 0.048 | 0.054 | −0.587 | 0.086 | 0.039 |
| Toumani-Oulena | 12 | 1,541 | 0.084 | 0.074 | 0.668 | 0.119 | 0.066 |
| Makouchetoum | 8 | 1,541 | 0.088 | 0.078 | 0.678 | 0.119 | 0.070 |
| All pooled | 48 | 1,537 | 0.075 | 0.065 | 0.541 | 0.114 | 0.059 |
|
| |||||||
| Bancoumana dry | 19 | 2,005 | 0.014 | 0.017 | −0.685 | 0.017 | 0.015 |
| Bancoumana rainy | 12 | 2,077 | 0.030 | 0.032 | −0.213 | 0.046 | 0.030 |
| Toumani-Oulena | 16 | 2,067 | 0.042 | 0.043 | −0.079 | 0.074 | 0.036 |
| Makouchetoum | 12 | 1,968 | 0.039 | 0.047 | −0.832 | 0.070 | 0.033 |
| All pooled | 59 | 1,902 | 0.034 | 0.046 | −0.921 | 0.057 | 0.031 |
|
| |||||||
| Bancoumana dry | 15 | 2,569 | 0.006 | 0.009 | −1.308 | 0.010 | 0.005 |
| Bancoumana rainy | 10 | 2,569 | 0.028 | 0.028 | −0.055 | 0.059 | 0.021 |
| Toumani-Oulena | 16 | 2,410 | 0.027 | 0.027 | −0.002 | 0.050 | 0.020 |
| Makouchetoum | 12 | 2,393 | 0.030 | 0.029 | 0.231 | 0.064 | 0.020 |
| All pooled | 53 | 2,393 | 0.031 | 0.025 | 0.556 | 0.061 | 0.024 |
The Bancoumana dry season collection is almost entirely M form mosquitoes, the Toumani-Oulena and Makouchetoum collections are almost entirely S form mosquitoes, and the Bancoumana rainy collection is a mixture of M and S form. These same parameter estimates are given separately for M form and S form mosquitoes in Table S1 and for APL1A and APL1A alleles in Table S2.
Number of alleles sequenced.
Locus size, in base pairs, excluding insertions and deletions.
Average number of differences per pair of alleles, per nucleotide.
Watterson's estimator of the population genetic parameter 4Neμ.
Tajima's D test statistic.
Average number of difference per pair of alleles, per nucleotide, synonymous sites only.
Average number of difference per pair of alleles, per nucleotide, nonsynonymous sites only.
Figure 2Population differentiation among A. gambiae collections at the APL1 locus as estimated by KST*.
Statistical significance determined by permutation of alleles among subpopulation pairs [48]. Mosquitoes sampled during the 2005 rainy season from the humid Toumani-Oulena (TM) and Makouchetoum (MK) regions are undifferentiated. These populations are mildly differentiated from a collection drawn from Bancoumana in the 2005 rainy season (BC rainy). All collections are highly differentiated from a collection drawn from Bancoumana in the 2003 dry season (BC dry). The analysis presented in this figure pools all mosquitoes by site and date of collection and does not take into account 2La karyotype or M/S molecular form.
Figure 3Number of observations of mosquitoes with each 2La inversion karyotype in each molecular form (M and S) over each sample collection.
Population frequencies (in percentages) are given in the margins of each table. “Unk.” indicates that 2La karyotype was not determined.
Figure 4Plot of nucleotide diversity and skew in the site frequency spectrum as a function of physical distance from the APL1 locus.
M form mosquitoes exhibit a sharp drop in polymorphism at the APL1 locus relative to S form mosquitoes, plotted as the ratio of nucleotide diversity (π) in the M form to diversity in the S form. The M form mosquitoes also exhibit an enhanced skew toward rare variants, indicated by negative values of Fu and Li's F* [41]. The data are consistent with a recent selective sweep at APL1 in the M form only.
Genetic diversity and haplotype homozygosity in the M and S molecular forms at the APL1 locus and flanking regions are indicative of selective maintenance of diversity at APL1 in the S molecular form and a recent selective sweep at APL1 within the M form.
| Position | M Form | EW | S Form | |||||||||
| bp | π |
| n | # haps | bp | π |
| n | # haps | EW | ||
| −30 kb | 1,181 | 0.014 | −0.660 | 10 | 10 | 0.100 | 1,181 | 0.014 | −1.667 | 10 | 10 | 0.100 |
| −20 kb | 981 | 0.011 | −0.362 | 10 | 10 | 0.100 | 981 | 0.018 | −0.260 | 10 | 10 | 0.100 |
| −10 kb | 1,144 | 0.014 | −0.372 | 10 | 10 | 0.100 | 1,144 | 0.014 | −0.523 | 10 | 10 | 0.100 |
| −5 kb | 1,162 | 0.008 | −2.086 | 10 | 10 | 0.350 ( | 1,162 | 0.010 | 0.584 | 10 | 10 | 0.120 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| +5 kb | 1,198 | 0.013 | −0.500 | 10 | 9 | 0.120 | 1,198 | 0.020 | −0.021 | 9 | 9 | 0.111 |
| +10 kb | 1,135 | 0.019 | −0.905 | 10 | 9 | 0.120 | 1,135 | 0.018 | −0.616 | 10 | 10 | 0.100 |
| +20 kb | 1,119 | 0.011 | −0.573 | 10 | 10 | 0.100 | 1,119 | 0.016 | −0.602 | 10 | 10 | 0.100 |
| +40 kb | 1,313 | 0.034 | 0.642 | 10 | 9 | 0.120 | 1,313 | 0.046 | −0.178 | 10 | 9 | 0.120 |
Nucleotide diversity (π) at APL1 is greatly reduced in the M form relative to the S form with a strong skew toward rare variants (indicated by negative values of F*) that is not observed in the S form (see also Figure 4). Nucleotide diversity at APL1 is slightly reduced relative to flanking loci within the M form but is greatly elevated relative to flanking loci in the S form. Haplotype diversity is prominently depressed at the APL1 locus, as indicated by high values of the EW statistic [42] that are significantly incompatible with neutral evolution. No such decrease in haplotype diversity is observed in flanking regions progressively distant from the APL1 locus in the M form or at any of the S form loci. Most of the haplotypes in M form mosquitoes differ by only one or a few nucleotide substitutions at the APL1 locus, whereas S form mosquitoes show deeper genealogical structure (see Figure S2).
Locus size, in base pairs, excluding insertions and deletions.
Average number of differences per pair of alleles, per nucleotide.
Skew in the site frequency spectrum, with negative values indicating an excess of rare variants.
Number of alleles sequenced.
Number of distinct haplotypes observed.
Haplotype homozygosity, calculated as the sum of squared observed haplotype frequencies. The use of the EW statistic to contrast the APL1 genes to the flanking regions is very conservative for the detection of selective sweeps in our experimental framework, as many more alleles were sampled at the APL1 locus and the physical region surveyed is larger in the APL1 genes than in flanking regions, both of which allow greater opportunity for recombination to generate distinct haplotypes. p values are given only for loci that depart significantly from the neutral expectation.