| Literature DB >> 25865270 |
David Weetman1, Sara N Mitchell2, Craig S Wilding3, Daniel P Birks1, Alexander E Yawson4,5, John Essandoh6, Henry D Mawejje7, Luc S Djogbenou8,9, Keith Steen1, Emily J Rippon1, Christopher S Clarkson1, Stuart G Field10, Daniel J Rigden11, Martin J Donnelly1,12.
Abstract
Functionally constrained genes are ideal insecticide targets because disruption is often fatal, and resistance mutations are typically costly. Synaptic acetylcholinesterase (AChE) is an essential neurotransmission enzyme targeted by insecticides used increasingly in malaria control. In Anopheles and Culex mosquitoes, a glycine-serine substitution at codon 119 of the Ace-1 gene confers both resistance and fitness costs, especially for 119S/S homozygotes. G119S in Anopheles gambiae from Accra (Ghana) is strongly associated with resistance, and, despite expectations of cost, resistant 119S alleles are increasing significantly in frequency. Sequencing of Accra females detected only a single Ace-1 119S haplotype, whereas 119G diversity was high overall but very low at non-synonymous sites, evidence of strong purifying selection driven by functional constraint. Flanking microsatellites showed reduced diversity, elevated linkage disequilibrium and high differentiation of 119S, relative to 119G homozygotes across up to two megabases of the genome. Yet these signals of selection were inconsistent and sometimes weak tens of kilobases from Ace-1. This unexpected finding is attributable to apparently ubiquitous amplification of 119S alleles as part of a large copy number variant (CNV) far exceeding the size of the Ace-1 gene, whereas 119G alleles were unduplicated. Ace-1 CNV was detectable in archived samples collected when the 119S allele was rare in Ghana. Multicopy amplification of resistant alleles has not been observed previously and is likely to underpin the recent increase in 119S frequency. The large CNV compromised localization of the strong selective sweep around Ace-1, emphasizing the need to integrate CNV analysis into genome scans for selection.Entities:
Keywords: Ace-1 G119S; Acetylcholinesterase; gene duplication; malaria mosquito; purifying selection
Mesh:
Substances:
Year: 2015 PMID: 25865270 PMCID: PMC4447564 DOI: 10.1111/mec.13197
Source DB: PubMed Journal: Mol Ecol ISSN: 0962-1083 Impact factor: 6.185
Fig. 1EHH analysis showing LD decay with increasing distance from the 119 codon core position (marked as the origin on the x-axis) in Anopheles gambiae from Accra. Negative numbers are SNPs in the telomeric (T) direction and positive numbers in the centromeric (C) direction plotted on a physical scale with a total span of approximately 2000 bp. For the 119G wild-type data (filled circles), the 95% CI were estimated by bootstrapping. Only a single haplotype was associated with the 119S resistance mutation (open circles), and therefore, EHH = 1 for all SNP positions. Inset panel: long-range haplotype bifurcation plot for 119G (119S shows no bifurcation) illustrating patterns of recombination in each direction, with orientation as in main panel. The core is marked by the dark circle; each SNP is represented by a node; and a recombination (or possibly mutation) event is represented by a bifurcation. The diameter of the circle at each SNP node is proportional to the number of individuals with the same long-range haplotype at that position.
Association of G119S with bendiocarb resistance in females from Accra, Ghana
| Alive | Pop freq | Alive | Pop freq | |
|---|---|---|---|---|
| (A) 2008 Genotype | ||||
| 119G/G | 0 | 33 | 1 | 94 |
| 119G/S | 88 | 30 | 3 | 24 |
| 119S/S | 48 | 10 | 1 | 1 |
| Total N | 136 | 73 | 5 | 119 |
| Probability | 5.5 × 10−11 | 0.004 | ||
(A) Frequency of each Ace-1 G119S genotype in the insecticide-unexposed Accra population (mixed suburbs as in sequence data) and in bioassay survivors from the same collection sites for A. gambiae and A. coluzzii. (B) Genotypes of females from the Madina suburb of Accra, alive and dead following bendiocarb bioassays. Footnotes show probability of (no) association from Fisher's exact tests and (in B) the allelic odds ratio (OR) for association of G119S with bioassay survivorship.
Fig. 2Frequency (with 95% CI) of the Ace-1 119S-resistant allele in Anopheles gambiae from the suburb of Madina in Accra over a period of approximately 4 years. The linear trend represented by the regression is highly significant (Cochrane–Armitage = 14.2, P = 0.0002), whereas the nonlinear component is not ( = 0.88, P = 0.64).
Fig. 3Variation in diversity at microsatellites distributed approximately symmetrically over 2 Mb around Ace-1 G119S in all 2008 Accra samples of Anopheles gambiae. (A) Heterozygosity ±95% CI for 119S (resistant) homozygotes (red triangles) and 119G (wild-type) homozygotes (blue diamonds) plotted against an ordinal scale, numbered from the Ace-1 G119S position in a telomeric (T) to centromeric (C) orientation. The purple line without markers is a deterministic model prediction for a hard selective sweep calculated from wild-type heterozygosities and realistic parameter estimates for the selection coefficient, mutation and recombination rates. *microsatellites detected as significant outliers by their (poor) fit to model prediction (B) Ratio of expected heterozygosities at each locus between groups: ln(HR/HS) plotted against a physical scale. The dashed line shows the 95% significance threshold for a two-tailed Z-test. The purple line without markers shows the equivalent model prediction to that in A; loci identified as outliers in A are shown as unfilled points.
Fig. 4Microsatellite differentiation, measured as diversity-corrected FST between 119S/S (resistant) and 119G/G (susceptible) homozygote Anopheles gambiae. All loci showed significant differentiation (after correction for multiple testing) apart from some of the outermost loci, shown as unfilled bars. Dark red bars show model-fit outliers (see Fig.3). The medians for FST’ across loci on each side of G119S are shown by dashed lines; loci at −103 and −5 are excluded for symmetry because there is no paired microsatellite location on the opposing side.
Deviations from Hardy–Weinberg equilibrium (measured by FIS) and possible sources of scoring errors at microsatellite loci
| 119S/S | 119G/G | |||||
|---|---|---|---|---|---|---|
| Locus (kb) | Null freq | Null freq | ||||
| −2120 | −0.21 | 0.571 | −0.17 | 0.050 | ||
| −1030 | −0.01 | 0.348 | −0.03 | 0.057 | ||
| −554 | −0.16 | 1.000 | 0.22 | 0.066 | ||
| −372 | 0.00 | 0.513 | 0.09 | 0.427 | ||
| −322 | −0.07 | 1.000 | 0.12 | 0.533 | ||
| −199 | 0.11 | 0.196 | 0.15 | 0.288 | ||
| −151 | 0.05 | 0.445 | −0.04 | 0.910 | ||
| −103 | 0.55 | 0.190 | 0.67 | 0.450 | ||
| −73 | −0.12 | 1.000 | 0.00 | 0.931 | ||
| −50 | −0.79 | 0.12 | 0.416 | |||
| −24 | 0.34 | 0.008 | 0.057 | 0.25 | 0.166 | |
| −14 | 0.65 | 0.090 | 0.63 | 0.273 | ||
| −5 | −0.01 | 1.000 | 0.46 | 0.215 | ||
| 15 | 0.64 | 0.198 | 0.12 | 0.036 | ||
| 29 | −0.75 | 0.32 | 0.004 | 0.192 | ||
| 58 | −0.85 | 0.12 | 0.063 | |||
| 75 | −0.33 | 0.36 | 0.374 | |||
| 150 | 0.54 | 0.199 | 0.36 | 0.301 | ||
| 193 | −0.09 | 1.000 | −0.01 | 0.021 | ||
| 271 | −0.11 | 1.000 | 0.18 | 0.488 | ||
| 388 | 0.29 | 0.009 | 0.227 | 0.44 | 0.288 | |
| 606 | 0.05 | 0.922 | 0.25 | 0.034 | 0.167 | |
| 1000 | −0.02 | 0.626 | 0.14 | 0.041 | ||
| 2040 | 0.10 | 0.334 | 0.28 | 0.229 | ||
Loci are ordered by distance from Ace-1 G119S with FIS values and probabilities (for the null hypothesis of FIS = 0), shown in bold where significant after Bonferroni correction for multiple testing. Null allele frequency estimates are shown where suggested as probable by Micro-Checker. Significant cases of heterozygote excess are offset to the left;
loci exhibiting significantly poor fit to the hard sweep model (see Fig.3).
Fig. 5G119S polymorphism and Ace-1 CNV estimation in Anopheles gambiae. (A) Samples from the 2008 Accra collection used for microsatellite genotyping and gene sequencing; (B) archived samples from several sites within Greater Accra sampled in 2002. Blue circle = 119G/G; red triangle = 119S/S; yellow diamond = 119G/S. The x-axis shows the dye balance ratio, which indicates the ratio of fluorescent signal from 119 serine (resistant): 119 glycine (susceptible) alleles at the qPCR endpoint. The y-axis shows Ace-1 copy number estimated as ΔΔCT values calculated for products from three Ace-1 exons (± standard deviation).