| Literature DB >> 35939523 |
Matthew L Aardema1,2, Michael G Campana3, Nicole E Wagner4, Francisco C Ferreira3,4, Dina M Fonseca4.
Abstract
Understanding patterns of diversification, genetic exchange, and pesticide resistance in arthropod disease vectors is necessary for effective population management. With the availability of next-generation sequencing technologies, one of the best approaches for surveying such patterns involves the simultaneous genotyping of many samples for a large number of genetic markers. To this end, the targeting of gene sequences of known function can be a cost-effective strategy. One insect group of substantial health concern are the mosquito taxa that make up the Culex pipiens complex. Members of this complex transmit damaging arboviruses and filariae worms to humans, as well as other pathogens such as avian malaria parasites that are detrimental to birds. Here we describe the development of a targeted, gene-based assay for surveying genetic diversity and population structure in this mosquito complex. To test the utility of this assay, we sequenced samples from several members of the complex, as well as from distinct populations of the relatively under-studied Culex quinquefasciatus. The data generated was then used to examine taxonomic divergence and population clustering between and within these mosquitoes. We also used this data to investigate genetic variants present in our samples that had previously been shown to correlate with insecticide-resistance. Broadly, our gene capture approach successfully enriched the genomic regions of interest, and proved effective for facilitating examinations of taxonomic divergence and geographic clustering within the Cx. pipiens complex. It also allowed us to successfully survey genetic variation associated with insecticide resistance in Culex mosquitoes. This enrichment protocol will be useful for future studies that aim to understand the genetic mechanisms underlying the evolution of these ubiquitous and increasingly damaging disease vectors.Entities:
Mesh:
Year: 2022 PMID: 35939523 PMCID: PMC9387926 DOI: 10.1371/journal.pntd.0010689
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Comparison of read-mapping between enriched and unenriched libraries.
We report the average number of read pairs after quality trimming, the number of read pairs that properly mapped to the full genome, and the percent of the reads which mapped to the full genome. The standard deviations are given in parentheses. Also included are the number of read pairs that mapped to the target bait regions, the percentage of the properly paired reads that mapped to the target regions, and the percentage of the target bait regions with a coverage of three or more reads (≥3×). For individual sample statistics see S3 Table.
| Post-trimming Read Pairs | Properly Paired Reads Mapped to the Full Genome | % all reads mapped | Properly Paired Mapped Reads (Target Regions) | % Properly paired reads mapped to target regions | % of target regions with coverage ≥3x | |
|---|---|---|---|---|---|---|
|
| 2,504,426 (1,289,201) | 2,036,689 (1,198,782) | 79.8% (11.8%) | 274,708 (146,076) | 13.82% (2.27%) | 52.6% (14.27%) |
|
| 3,000,000 (0*) | 2,439,858 (88,964) | 81.3% (3.0%) | 18,562 (872) | 0.76%. (0.01%) | 10.65% (1.19%) |
For each unenriched dataset, the number of reads used was down sampled to three million after read trimming and quality filtering. See text for more details.
Fig 1Results of Principal Component Analyses.
Shown are the first and second principal components (PC1 & PC2) for all the Cx. pipiens complex samples (left panel) and just the Cx. quinquefasciatus samples (right panel). These analyses were performed with neutral, segregating variants. Taxonomic and population memberships were based on prior designations and collection location respectively.
Fig 2All complex ADMIXTURE results.
Shown are the percent ancestry assignments (Q) for K values 2 through 7 based on our analysis of admixture. Sample designations are given on the left along with taxonomic designations.
Fig 3Cx. quinquefasciatus ADMIXTURE results.
Shown are the percent ancestry assignments (Q) for K values 2 through 7 based on our analysis of admixture. Sample designations are given on the left along with population designations.
Relative genetic diversity within taxa across 916 neutral, bi-allelic, segregating SNPs.
Given are the taxonomic designations (including a category for known hybrid samples), sample size for each taxon, the mean number of heterozygous sites observed per sample with standard error, and the corresponding mean pairwise sample heterozygosity value with standard error.
| Taxon | Sample Size (n) | Mean Number of Heterozygous Sites (SE) | Mean Sample Pairwise Heterozygosity (π) (SE) |
|---|---|---|---|
| known hybrids | 3 | 60.7 (7.9) | 0.066 (0.009) |
| 3 | 25.0 (11.0) | 0.027 (0.012) | |
| 4 | 62.3 (7.2) | 0.068 (0.008) | |
| 2 | 83.5 (5.5) | 0.091 (0.006) | |
| 23 | 20.9 (2.1) | 0.023 (0.002) | |
| 1 | 77 | 0.084 |
taxon identification was based on examination of male genitalia, geographical source, or in some cases expression of autogeny, as well as prior examination with panels of microsatellite loci (please refer to the text)
Relative genetic diversity within populations of Cx. quinquefasciatus across 540 segregating, neutral, bi-allelic SNPs.
Given are the population designation, sample size for each population, the mean number of heterozygous sites observed per sample with standard error, and the corresponding mean pairwise sample heterozygosity value with standard error.
| Population | Sample Size (n) | Mean Number of Heterozygous Sites | Mean Sample Pairwise Heterozygosity |
|---|---|---|---|
| east Asia | 5 | 81.0 (7.9) | 0.150 (0.015) |
| Samoa | 4 | 38.0 (6.9) | 0.070 (0.013) |
| Hawaii | 6 | 55.5 (8.9) | 0.103 (0.017) |
| North America & Caribbean | 4 | 46.5 (8.5) | 0.086 (0.016) |
| Brazil | 2 | 10.5 (6.5) | 0.019 (0.012) |
| Nigeria | 2 | 48.0 (11.0) | 0.089 (0.020) |
Pairwise Fst values between taxa.
Given are both unweighted and weighted measures for each pair of taxa (excluding known hybrid samples). Taxonomic designations were determined prior to this study (see text for more details).
| TAXON | ||||||
|---|---|---|---|---|---|---|
|
| 0.5744 | 0.2752 | 0.4989 | 0.4065 | Unweighted Fst Values | |
|
| 0.7468 | 0.1594 | 0.3166 | 0.2585 | ||
|
| 0.5724 | 0.3192 | -0.1026 | 0.2967 | ||
|
| 0.7167 | 0.5050 | 0.0276 | 0.2656 | ||
|
| 0.7827 | 0.5475 | 0.5634 | 0.6415 | ||
| Weighted Fst Values | ||||||
Pairwise Fst values between Cx. quinquefasciatus populations.
Given are both unweighted and weighted measures for each pair of populations. Population designations were assigned based on collection location (see text for more details).
| POPULATION | east Asia | Samoa | Hawaii | North America & Caribbean | Brazil | Nigeria | |
|---|---|---|---|---|---|---|---|
|
| 0.0514 | 0.0593 | 0.0051 | -0.2280 | 0.0161 | Unweighted Fst Values | |
|
| 0.1611 | 0.0168 | 0.1016 | -0.0514 | 0.1233 | ||
|
| 0.1326 | 0.1026 | -0.0265 | -0.1741 | 0.0578 | ||
|
| 0.1104 | 0.2277 | 0.0327 | -0.2329 | 0.0342 | ||
|
| -0.0668 | 0.1622 | 0.0234 | -0.0638 | -0.1288 | ||
|
| 0.1100 | 0.2387 | 0.1626 | 0.1389 | 0.0497 | ||
| Weighted Fst Values | |||||||
Fig 4Maximum-likelihood phylogeny of samples.
A maximum-likelihood analysis of all samples using a transversional model of mutation with a gamma distribution of rate heterogeneity TVM + Γ; Tavaré, 1986 [71]. 100 bootstrap replicates of the analysis were performed and the bootstrap support for major nodes are shown in red. The colors correspond to the different taxonomic designations.
Summary of Insecticide Resistance-Associated Allele Frequencies.
Given are the genomic position of examined SNPs that were previously found to correlate with resistance (chromosome and base position), gene ID (in the annotated Cx. quinquefasciatus genome), gene name, amino acid change, number of chromosomes examined (e.g., # of samples the variant is called at position * 2), and the frequencies of the susceptible and resistant alleles.
| Chromosome | Position | Gene ID | Gene Name | Amino Acid Change | Number of Chromosomes Examined (n = 72) | Susceptible Allele Frequency | Resistance Allele Frequency | Reference |
|---|---|---|---|---|---|---|---|---|
| supercont3.35 | 864866 | CPIJ002538 | CYP6AG12: cytochrome P450 6AG12 | H293L | 72 | A:0.986 | T:0.014 | [ |
| supercont3.104 | 245887 | CPIJ005956 | CYP6BZ2: cytochrome P450 6BZ2 | R213L, R213Q | 72 | G:0.250 | T:0.417, A:0.333 | [ |
| supercont3.106 | 33980 | CPIJ006034 | ACE-1: acetylcholinesterase | G247S | 56 | G:0.929 | A:0.071 | [ |
| supercont3.106 | 46973 | CPIJ006034 | ACE-1: acetylcholinesterase | T682A | 72 | A:0.917 | G:0.083 | [ |
| supercont3.196 | 232328 | CPIJ008566 | CYP6Z15: cytochrome P450 6Z15 | E243A | 70 | A:0.857 | C:0.143 | [ |
| supercont3.228 | 585169 | CPIJ009085 | CYP6AG13: cytochrome P450 6AG13 | N211D | 52 | A:0.827 | G:0.173 | [ |
| supercont3.510 | 164957 | CPIJ014218 | CYP9M10: cytochrome P450 9M10 | F245I | 66 | A:0.864 | T:0.136 | [ |