| Literature DB >> 34996998 |
Iliyas Rashid1,2,3, Melina Campos1, Travis Collier1, Marc Crepeau1, Allison Weakley4, Hans Gripkey1, Yoosook Lee5, Hanno Schmidt6, Gregory C Lanzaro7.
Abstract
Using high-depth whole genome sequencing of F0 mating pairs and multiple individual F1 offspring, we estimated the nuclear mutation rate per generation in the malaria vectors Anopheles coluzzii and Anopheles stephensi by detecting de novo genetic mutations. A purpose-built computer program was employed to filter actual mutations from a deep background of superficially similar artifacts resulting from read misalignment. Performance of filtering parameters was determined using software-simulated mutations, and the resulting estimate of false negative rate was used to correct final mutation rate estimates. Spontaneous mutation rates by base substitution were estimated at 1.00 × 10-9 (95% confidence interval, 2.06 × 10-10-2.91 × 10-9) and 1.36 × 10-9 (95% confidence interval, 4.42 × 10-10-3.18 × 10-9) per site per generation in A. coluzzii and A. stephensi respectively. Although similar studies have been performed on other insect species including dipterans, this is the first study to empirically measure mutation rates in the important genus Anopheles, and thus provides an estimate of µ that will be of utility for comparative evolutionary genomics, as well as for population genetic analysis of malaria vector mosquito species.Entities:
Mesh:
Substances:
Year: 2022 PMID: 34996998 PMCID: PMC8742016 DOI: 10.1038/s41598-021-03943-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Whole genome sequencing coverage results for each sample categorization.
| Species | Sample type | N | Coverage | |||
|---|---|---|---|---|---|---|
| 25th | Median | 75th | Mean | |||
| Female parent | 1 | 16 | 21 | 25 | 20.4 | |
| Putative male parent | 4 | 24 | 31 | 40 | 32.4 | |
| Focal offspring | 10 | 22.1 | 26 | 32 | 25.7 | |
| Bait offspring | 19 | 8.9 | 13 | 17 | 13.1 | |
| Female parent | 1 | 30 | 36 | 41 | 35.7 | |
| Putative male parent | 2 | 22 | 28 | 34 | 28.2 | |
| Focal offspring | 13 | 27 | 32 | 38 | 32.2 | |
| Bait offspring | 17 | 11 | 14 | 17 | 14.4 | |
Number of samples (N) and whole genome coverage results for parents and offspring types (focal and bait) for A. coluzzii and A. stephensi.
25th 25th percentile, 75th 75th percentile.
Figure 1Male parent identification of A. coluzzii and A. stephensi offspring. Boxplots of relatedness values between each potential male parent and all offspring. Both species had a unique female parent and 4 potential male parents for A. coluzzii and 2 for A. stephensi. ***p-value < 0.0001; Wilcoxon rank-sum test.
Mendelian inheritance for male parent identification.
| Species | Parent pair (female, male) | Homozygous for different allele | Number of sites | Heterozygous genotypes in all offspring (percentage) |
|---|---|---|---|---|
Ac-F0-F01 Ac-F0-M01 | Ref × Alt Alt × Ref | 9038 8227 | 6962 (77.03%) 6584 (80.03%) | |
Ac-F0-F01 Ac-F0-M02 | Ref × Alt Alt × Ref | 40,095 39,555 | 8 (0.02%) 2 (0.01%) | |
Ac-F0-F01 Ac-F0-M03 | Ref × Alt Alt × Ref | 33,316 28,923 | 3 (0.01%) 15 (0.05%) | |
Ac-F0-F01 Ac-F0-M04 | Ref × Alt Alt × Ref | 18,591 14,986 | 5 (0.03%) 3 (0.02%) | |
As19-STE2-PF1 As19-STE2-PM1 | Ref × Alt Alt × Ref | 19,740 29,616 | 16,523 (83.70%) 25,100 (84.75%) | |
As19-STE2-PF1 As19-STE2-PM2 | Ref × Alt Alt × Ref | 30,949 43,070 | 44 (0.14%) 42 (0.10%) |
The Mendelian inheritance-based approach for paternity analysis reveals the number of sites at which all offspring are heterozygous for different pairs of parents where parents are homozygous for different alleles.
Candidate mutation detection summary.
| Filtering site steps | ||
|---|---|---|
| Total variants | 6,150,332 | 2,556,842 |
| Biallelic SNPs | 5,141,452 (83.6%) | 2,189,630 (85.6%) |
| Reference homozygous sites DP ≥ 10 | 298,527 (4.9%) | 45,338 (1.8%) |
| Sites without alternate allele in bait offspring | 277,677 (4.5%) | 35,959 (1.4%) |
| Sites called in every focal offspring | 256,261 (4.2%) | 33,203 (1.3%) |
| One or two heterozygous allele in focal | 170 | 49 |
| 27% < AAF < 73% on a heterozygous site | 8 | 6 |
The number of sites filtered out at each step of the candidate mutation detection protocol.
DP depth, AAF alternate allele frequency.
Candidate de novo mutations detected in A. coluzzii and A. stephensi by whole genome sequencing and validated by Sanger re-sequencing.
| Chromosome, position | Ref | Alt | Focal offspring | Allele depth | Mean depth | Confirmed | Location | ||
|---|---|---|---|---|---|---|---|---|---|
| Ref | Alt | Parents | Focal | ||||||
| 2L, 21501855 | A | G | Ac-F1-M01 | 16 | 6 | 35 | 28.1 | No | – |
| 3L, 10978191 | G | A | Ac-F1-M04 | 8 | 9 | 31.5 | 26.9 | Yes | Intron |
| 3L, 30182152 | A | C | Ac-F1-M05 | 15 | 8 | 31.5 | 28.2 | Yes | Intergenic |
| 3L, 32059779 | C | T | Ac-F1-F09 | 11 | 7 | 26.5 | 17.8 | Yes | Intergenic |
| X, 1692026 | C | T | As19-STE2-F23 | 18 | 8 | 22 | 18.6 | Yes | Intergenic |
| X, 14920753 | C | T | As19-STE2-M15 | 0 | 12 | 27 | 17.8 | Yes | Intron |
| 2, 29186821 | T | A | As19-STE2-M09 | 15 | 19 | 39.5 | 30.8 | Yes | Intergenic |
| 3, 6181251 | G | C | As19-STE2-M02 | 13 | 14 | 26.5 | 26.3 | Yes | Intron |
| 3, 56020205 | G | T | As19-STE2-F14 | 10 | 9 | 35.5 | 27.2 | Yes | Intergenic |
Genomic position, reference (Ref.) and alternate (Alt.) alleles in focal offspring and allele depth. Read depth mean for parents and focal offspring. All candidates were re-sequenced by Sanger sequencing for validation.
Available direct estimates of mutation rate in vertebrates and invertebrates.
| Species | Mutation rate | Genome size (Mb) | References |
|---|---|---|---|
| 1.1 × 10–8 | 3232 | [ | |
| 1.2 × 10–8 | 3309 | [ | |
| 0.94 × 10–8 | 2797 | [ | |
| 1.22 × 10–8 | 3084 | [ | |
| 4.5 × 10–9 | 2350 | [ | |
| 5.4 × 10–9 | 2671 | [ | |
| 4.6 × 10–9 | 1118 | [ | |
| 3.4 × 10–9 | 247 | [ | |
| 3.6 × 10–9 | 433 | [ | |
| 2.8 × 10–9 | 148 | [ | |
| 2.9 × 10–9 | 269 | [ | |
| 2.1 × 10–9 | 210 | [ | |
| 1.0 × 10–9 | 280 | This study | |
| 1.4 × 10–9 | 240 | This study | |
Mutation rate and genome size of different species of vertebrates and invertebrates.
Figure 2Mutation rate estimation and number of generations per year. Plot of number of generations per year (x-axis) against mutation rate estimations (y-axis) for six species of mammals, green circles (Canis lupus, Mus musculus, Chlorocebus sabaeus, Gorilla gorilla, Pan troglodytes and Homo sapiens), a bird, violet circle (Ficedula albicollis), the two species of mosquitoes estimated in this study, blue circles and five non-mosquito insects (Chironomus riparius, Drosophila melanogaster, Apis mellifera, Bombus terrestris, Heliconius melpomene) cyan circles. Error bars correspond to 95% CI values when provided.
Figure 3Experimental and analytical designs. Schematics of experimental and analytical procedures for direct mutation rate estimation which includes a series of parameters to exclude false positive mutations and account for false negatives. (1) Sampling & sequencing designs: experimental female and male mating, and whole genome sequencing of parents and offspring at high and low coverage levels. (2) Father identification: two methods are represented (relatedness and Mendelian heritage). (3) De novo mutation detection: candidate de novo mutation is absent from both parents. (4) Manual curation: Integrated Genome Viewer (IGV) for mapping visualization of reads of candidate mutation list. (5) Sanger validation: PCR amplification and re-sequencing of candidate mutation. (6) False Negative Rate (FNR): 1000 synthetic mutations were inserted in each focal offspring; reads are processed and the de novo mutation detection program performed as in (3).