| Literature DB >> 25552603 |
Susan M Rottschaefer1, Jacob E Crawford2, Michelle M Riehle3, Wamdaogo M Guelbeogo4, Awa Gneme4, N'Fale Sagnon4, Kenneth D Vernick5, Brian P Lazzaro6.
Abstract
Natural selection is expected to drive adaptive evolution in genes involved in host-pathogen interactions. In this study, we use molecular population genetic analyses to understand how natural selection operates on the immune system of Anopheles coluzzii (formerly A. gambiae "M form"). We analyzed patterns of intraspecific and interspecific genetic variation in 20 immune-related genes and 17 nonimmune genes from a wild population of A. coluzzii and asked if patterns of genetic variation in the immune genes are consistent with pathogen-driven selection shaping the evolution of defense. We found evidence of a balanced polymorphism in CTLMA2, which encodes a C-type lectin involved in regulation of the melanization response. The two CTLMA2 haplotypes, which are distinguished by fixed amino acid differences near the predicted peptide cleavage site, are also segregating in the sister species A. gambiae ("S form") and A. arabiensis. Comparison of the two haplotypes between species indicates that they were not shared among the species through introgression, but rather that they arose before the species divergence and have been adaptively maintained as a balanced polymorphism in all three species. We additionally found that STAT-B, a retroduplicate of STAT-A, shows strong evidence of adaptive evolution that is consistent with neofunctionalization after duplication. In contrast to the striking patterns of adaptive evolution observed in these Anopheles-specific immune genes, we found no evidence of adaptive evolution in the Toll and Imd innate immune pathways that are orthologously conserved throughout insects. Genes encoding the Imd pathway exhibit high rates of amino acid divergence between Anopheles species but also display elevated amino acid diversity that is consistent with relaxed purifying selection. These results indicate that adaptive coevolution between A. coluzzii and its pathogens is more likely to involve novel or lineage-specific molecular mechanisms than the canonical humoral immune pathways.Entities:
Keywords: C-type lectin; JAK-STAT; balancing selection; genetics of immunity; innate immunity; population genetics
Mesh:
Substances:
Year: 2014 PMID: 25552603 PMCID: PMC4349087 DOI: 10.1534/g3.114.014845
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Average pairwise divergence in immune and control groups
| n | Ktotal | KS | KA | KA/KS | |
|---|---|---|---|---|---|
| All data | |||||
| Immune | 20 | 0.0335 | 0.0652 | 0.0120 | 0.1747 |
| Control | 17 | 0.0319 | 0.0545 | 0.0043 | 0.0772 |
| IMD pathway | |||||
| Immune | 8 | 0.0336 | 0.0613 | 0.0121 | 0.1968 |
| Control | 8 | 0.0302 | 0.0526 | 0.0041 | 0.0675 |
| TOLL pathway | |||||
| Immune | 7 | 0.0332 | 0.0713 | 0.0087 | 0.1090 |
| Control | 6 | 0.0335 | 0.0565 | 0.0032 | 0.0555 |
| Exclude IMD pathway | |||||
| Immune | 12 | 0.0335 | 0.0679 | 0.0119 | 0.1600 |
| Control | 10 | 0.0322 | 0.0557 | 0.0043 | 0.0835 |
Number of genes considered.
Average per-gene divergence at all sites between A. coluzzii and A. merus
Average per-gene silent divergence between A. coluzzii and A. merus
Average per-gene replacement divergence between A. coluzzii and A. merus
Average per-gene KA/KS ratio between A. coluzzii and A. merus.
Average pairwise genetic diversity in immune and control groups
| n | πtotal | πs | πa | ||
|---|---|---|---|---|---|
| All data | |||||
| Immune | 19 | 0.0140 | 0.0283 | 0.0048 | −0.434 |
| Control | 19 | 0.0127 | 0.0252 | 0.0016 | −0.764 |
| V = 102, | V = 128, | V = 172, | V = 141, | ||
| IMD pathway | |||||
| Immune | 8 | 0.0182 | 0.0385 | 0.0066 | −0.512 |
| Control | 8 | 0.0134 | 0.0297 | 0.0018 | −0.601 |
| V = 27, | V = 34, | V = 36, | V = 20, | ||
| TOLL pathway | |||||
| Immune | 6 | 0.0116 | 0.0231 | 0.0031 | −0.353 |
| Control | 6 | 0.0141 | 0.0239 | 0.0014 | −0.834 |
| V = 4, | V = 9, | V = 17, | V = 19, |
Number of genes considered, 18–20 alleles sampled per gene.
Average per-gene genetic diversity calculated for all sites.
Average per-gene genetic diversity calculated for synonymous sites.
Average per-gene genetic diversity calculated for nonsynonymous sites.
Average per-gene Tajima’s D calculated using silent sites.
Multilocus MK-test model comparison
| Model | Description | Par | log( | 2∆log( | χ2 | AICc | Akaike Weight | αa | αb |
|---|---|---|---|---|---|---|---|---|---|
| All loci | |||||||||
| M0 | α = 0 | 38 | −615.83 | 1335.9 | 0.806 | [0] | [0] | ||
| M1 | α ∼ (all loci) | 39 | −615.51 | 0.64 | 0.4237 | 1339.0 | 0.169 | [0.07] | [0.07] |
| M2 | α ∼ (control, immune) | 40 | −615.48 | 0.06 | 0.8065 | 1342.8 | 0.025 | 0.04 | 0.09 |
| Imd Pathway | |||||||||
| M0 | α = 0 | 18 | −235.44 | 522.1 | 0.003 | [0] | [0] | ||
| M1 | α ∼ (all loci) | 19 | −228.01 | 14.86 | 0.0001 | 511.3 | 0.695 | [−0.64] | [−0.64] |
| M2 | α ∼ (control, immune) | 20 | −226.72 | 2.58 | 0.1069 | 513.0 | 0.302 | −0.20 | −0.91 |
| Toll pathway | |||||||||
| M0 | α = 0 | 14 | −181.22 | 403.2 | 0.305 | [0] | [0] | ||
| M1 | α ∼ (all loci) | 15 | −178.55 | 5.34 | 0.0208 | 402.1 | 0.520 | [−0.5] | [−0.5] |
| M2 | α ∼ (control, immune) | 16 | −177.37 | 2.36 | 0.1245 | 404.3 | 0.175 | −1.09 | −0.24 |
The Akaike information criterion corrected for sample size.
The likelihood of the model, given the relative support for each of the models tested.
Estimate of the proportion of adaptive substitutions in the control genes. Square brackets indicate where α is constrained by the model.
Estimate of the proportion of adaptive substitutions in the immune genes. Square brackets indicate where α is constrained by the model.
Figure 1Distribution of Tajima’s D across all 37 genes. Tajima’s D was calculated for each gene using silent sites and plotted as a histogram showing deviation from neutral expectations. Loci are ordered based on the value of D to draw attention to the contrast between CTLMA2 and the rest of the dataset.
Figure 2Sliding window analysis of Tajima’s D in CTL4 and CTLMA2. CTL4 and CTLMA2 are located directly adjacent on the chromosome and were PCR amplified and cloned as a single fragment. Tajima’s D was calculated using silent sites in a sliding window along the entire sequenced region using a 200-bp window with a 25-bp step size. The dashed line indicates the expected value of D under a neutral equilibrium model. The schematic below the plot shows the exon structure and direction of transcription for CTL4 and CTLMA2. Positive values of D in the 5′ region of CTLMA2 indicate an excess of intermediate frequency polymorphism in this region.
Figure 3Amino acid alignment of CTLMA2 sequences of A. coluzzii collected in Burkina Faso, along with previously published sequences from A. gambiae and A. arabiensis collected in Kenya (Obbard ). Sites matching the top sequence are indicated with a dot (•). Dashes (–) note unavailable sequence because we sequenced a different amplicon than was available for A. gambiae or A. arabiensis
Population genetic statistics for CTLMA2 haplotype clades and surrounding loci
| n | S | π | D | |
|---|---|---|---|---|
| CTLMA2 all | 20 | 55 | 0.022 | 1.254 |
| Clade A | 10 | 30 | 0.010 | −0.465 |
| Clade B | 10 | 25 | 0.007 | −1.737 |
| CTL4 | 20 | 44 | 0.013 | −0.263 |
| AGAP005327 | 19 | 54 | 0.010 | −1.107 |
Number of alleles sampled.
Number of segregating sites.
Average number of pairwise differences per site.
Tajima’s D calculated using silent sites.
Figure 4Neighbor-joining tree of CTLMA2 sequences from A. coluzzii (blue circles), A. gambiae (black triangle), A. arabiensis (red squares), A. quadriannulatus (open circle), A. merus (open triangle), and A. christyi (open square). Bootstrap support was determined using 1000 bootstrap replicates, and nodes with >50% bootstrap support are shown.
D among species within and between CTLMA2 haplotype clades and in two nearby loci
| Clade A | Clade A | 0.009 |
| Clade B | Clade B | 0.011 |
| Clade A | Clade B | 0.034 |
| Clade B | Clade A | 0.031 |
| AGAP005540 | AGAP005540 | 0.016 |
| APL2 | APL2 | 0.017 |
Sequences from published data (Obbard ).