| Literature DB >> 28182717 |
Jeffrey R Kugelman1, Michael R Wiley1, Elyse R Nagle1, Daniel Reyes1, Brad P Pfeffer1, Jens H Kuhn2, Mariano Sanchez-Lockhart1, Gustavo F Palacios1.
Abstract
Individual RNA viruses typically occur as populations of genomes that differ slightly from each other due to mutations introduced by the error-prone viral polymerase. Understanding the variability of RNA virus genome populations is critical for understanding virus evolution because individual mutant genomes may gain evolutionary selective advantages and give rise to dominant subpopulations, possibly even leading to the emergence of viruses resistant to medical countermeasures. Reverse transcription of virus genome populations followed by next-generation sequencing is the only available method to characterize variation for RNA viruses. However, both steps may lead to the introduction of artificial mutations, thereby skewing the data. To better understand how such errors are introduced during sample preparation, we determined and compared error baseline rates of five different sample preparation methods by analyzing in vitro transcribed Ebola virus RNA from an artificial plasmid-based system. These methods included: shotgun sequencing from plasmid DNA or in vitro transcribed RNA as a basic "no amplification" method, amplicon sequencing from the plasmid DNA or in vitro transcribed RNA as a "targeted" amplification method, sequence-independent single-primer amplification (SISPA) as a "random" amplification method, rolling circle reverse transcription sequencing (CirSeq) as an advanced "no amplification" method, and Illumina TruSeq RNA Access as a "targeted" enrichment method. The measured error frequencies indicate that RNA Access offers the best tradeoff between sensitivity and sample preparation error (1.4-5) of all compared methods.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28182717 PMCID: PMC5300104 DOI: 10.1371/journal.pone.0171333
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Study sample diagram.
Fig 2Total error by sample preparation method.
(A) The mean read depth per position from two DNA controls (PLASMID and P_AMP) and five RNA sample preparation methods (n = 2 of replicate experiments). (B) Errors calculated as error/site/copy (plasmid or transcript) are presented as the average of duplicate experiments as in A. Intra-host single nucleotide variants present in the original plasmid were removed from the calculation of the error (reference positions: 4,697 and 4,725). Student t-tests were performed to demonstrate differences between the mean error per site per copy in the control plasmid (* = p<0.05) and each sample preparation method. (C) The error calculated in B is converted to fold change over the control (“PLASMID”). Error bars in all panels represent standard deviation of the mean.
Fig 3Percentage of errors by type of acquired diversity determined during sample preparation.
(A) Percentage of error attributed to synonymous vs. non-synonymous variants. (B) Percentage of errors attributed to transitions vs. transversions. (C) Percentage of errors attributed to insertions or deletions vs. Intra-host single nucleotide variants (iSNV).
Fig 4Sources of error per sample preparation method.
(A) Intra-host single nucleotide variants (iSNVs) that were present in multiple samples at greater than 1% of population are shown in a heat map format to visualize patterned diversity acquired during sample preparation. The total number of samples containing iSNVs in greater than 1% of population are summarized in the column “#> 1% of Pop” in green. “Codon” column (second column from right) provides both the nucleotide and protein translation of the site. (B) The detected mean error rates of iSNVs greater than 0. 2% of population (error/site/copy) are stratified by presence in the plasmid (Origin), detection after transcription/reverse transcription (Transc) or preparation, and preparation/sequencer error (Prep). (C) Error profiles are expressed as the number of iSNVs per percent of population obtained from each of the 5 sample preparation methods.