| Literature DB >> 20333199 |
Benjamin Dickins1, Anton Nekrutenko.
Abstract
Experimental evolution in rapidly reproducing viruses offers a robust means to infer substitution trajectories during evolution. But with conventional approaches, this inference is limited by how many individual genotypes can be sampled from the population at a time. Low-frequency changes are difficult to detect, potentially rendering early stages of adaptation unobservable. Here we circumvent this using short-read sequencing technology in a fine-grained analysis of polymorphism dynamics in the sentinel organism: a single-stranded DNA phage PhiX174. Nucleotide differences were educed from noise with binomial filtering methods that harnessed quality scores and separate data from brief phage amplifications. Remarkably, a significant degree of variation was observed in all samples including those grown in brief 2-h cultures. Sites previously reported as subject to high-frequency polymorphisms over a course of weeks exhibited monotonic increases in polymorphism frequency within hours in this study. Additionally, even with limitations imposed by the short length of sequencing reads, we were able to observe statistically significant linkage among polymorphic sites in evolved lineages. Additional parallels between replicate lineages were apparent in the sharing of polymorphic sites and in correlated polymorphism frequencies. Missense mutations were more likely to occur than silent mutations. This study offers the first glimpse into "real-time" substitution dynamics and offers a robust conceptual framework for future viral resequencing studies.Entities:
Keywords: experimental evolution; next-generation sequencing; substitution analysis
Year: 2009 PMID: 20333199 PMCID: PMC2817424 DOI: 10.1093/gbe/evp029
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FAnalytical schema depicting filters employed (blue) and statistical analyses undertaken (green). All analyses were nonparametric. TP0: time point zero; DRP: deviant read proportion.
FHistograms showing the frequencies of genome positions with different deviant read proportions across time point zero samples (A) and experimental samples (B). Gray bars indicate frequencies from significantly polymorphic sites only, and blue bars or portions of bars indicate the additional numbers of positions removed by the binomial filter (see text). Figure 2 is continued on the following page.
FCircular histograms of deviant read proportions across the genome in lineages B and C. The height of each spoke represents deviant read proportions at each genome position from 0% (at the inner ring) to the threshold value (for highly polymorphic sites) of ∼1.51% (at the outer ring; for exact frequencies of highly polymorphic sites, see table 1). Colors represent time points with red = time point 1, green = time point 3, and blue = time point 4.
Highly Polymorphic Sites Identified by the Maximum Deviation Filter
Effect of Filter on Number of Sites in Each Sample with A) Threshold-Based and B) Binomial Filters
FScatterplots showing the deviant read proportions of polymorphic sites in both lineages at given time points. (A) Highly polymorphic sites detected in one or the other or both samples are shown; (B): significantly polymorphic sites detected only in both samples are shown.
Summary of Linkage Analysis
Proportions of Mutations Identified as Significantly Polymorphic by Binomial Filter
| Sample | Percentage Significant Transitions | Percentage Significant Transversions | Ti/Tv Ratio | Percentage Significant Missense | Percentage Significant Silent | NS/S Ratio |
| A1 | 34.74 | 46.74 | 0.743**** | 43.56 | 42.20 | 1.032 |
| A2 | 36.63 | 47.73 | 0.767**** | 45.05 | 43.29 | 1.041 |
| A3 | 32.10 | 39.77 | 0.807**** | 38.48 | 36.39 | 1.057• |
| A4 | 34.18 | 45.27 | 0.755**** | 42.64 | 39.55 | 1.078** |
| B1 | 7.87 | 37.73 | 0.209**** | 29.07 | 23.17 | 1.255**** |
| B3 | 10.75 | 43.72 | 0.246**** | 34.31 | 27.85 | 1.232**** |
| B4 | 9.39 | 32.91 | 0.285**** | 26.8 | 19.70 | 1.362**** |
| C1 | 5.07 | 20.16 | 0.251**** | 16.18 | 12.10 | 1.337**** |
| C3 | 5.83 | 24.21 | 0.241**** | 19.49 | 13.74 | 1.419**** |
| C4 | 7.87 | 25.97 | 0.303**** | 21.18 | 16.51 | 1.283**** |
NOTE.—Sample: sample identity; percentage significant transitions/transversions/missense/silent: percentage of mutations of the indicated type identified as significant by the binomial filter, that is, the probability of significant change; Ti/Tv ratio: probability ratio of transitions to transversions; NS/S ratio: ratio for missense to silent changes. Significance probabilities indicated by: •P < 0.10, **P < 0.01, ****P < 0.0001.
FBox and whisker plots showing ranked deviant read proportions associated with different classes of polymorphism in each experimental sample. (A) Transitions and transversions contrasted; (B): missense and silent mutations contrasted. Ranking is carried out within each polymorphism class but between samples.
Region-Specific Analysis of Significant Proportions of Missense and Silent Changes
| Region | Overlapping | Nonoverlapping | |||||
| Gene | Time Point | Percentage Significant Missense | Percentage Significant Silent | NS/S Ratio | Percentage Significant Missense | Percentage Significant Silent | NS/S Ratio |
| A | 1 | 21.11 | 16.12 | 1.310* | 23.94 | 18.48 | 1.295*** |
| 3 | 24.25 | 19.01 | 1.276* | 29.07 | 22.28 | 1.305**** | |
| 4 | 21.41 | 15.50 | 1.382* | 26.21 | 19.63 | 1.336**** | |
| B | 1 | 20.39 | 18.44 | 1.105 | — | — | — |
| 3 | 23.76 | 20.44 | 1.162 | — | — | — | |
| 4 | 21.10 | 16.22 | 1.301* | — | — | — | |
| D | 1 | 21.54 | 21.28 | 1.012 | 21.06 | 16.39 | 1.285 |
| 3 | 26.42 | 26.41 | 1.000 | 24.38 | 16.80 | 1.451* | |
| 4 | 23.90 | 24.10 | 0.992 | 21.31 | 14.34 | 1.485 | |
| E | 1 | 23.62 | 14.80 | 1.596** | — | — | — |
| 3 | 28.50 | 19.90 | 1.432** | — | — | — | |
| 4 | 25.16 | 20.15 | 1.249• | — | — | — | |
NOTE.—Gene: reading frame analyzed; time point: time points analyzed (lineages merged); region(overlapping/nonoverlapping): region of gene analyzed (for exact specification, see text); percentage significant missense/silent: percentage of mutations of indicated type identified as significant by the binomial filter, that is, probability of significant change; NS/S ratio: ratio for missense to silent changes. Significance probabilities indicated by •P < 0.10, *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001.