| Literature DB >> 29759067 |
Anna Heintz-Buschart1,2,3, Dilmurat Yusuf4,5, Anne Kaysen4,6, Alton Etheridge7, Joëlle V Fritz4,6, Patrick May4, Carine de Beaufort4,6, Bimal B Upadhyaya4, Anubrata Ghosal4,8, David J Galas7, Paul Wilmes9.
Abstract
BACKGROUND: Sequencing-based analyses of low-biomass samples are known to be prone to misinterpretation due to the potential presence of contaminating molecules derived from laboratory reagents and environments. DNA contamination has been previously reported, yet contamination with RNA is usually considered to be very unlikely due to its inherent instability. Small RNAs (sRNAs) identified in tissues and bodily fluids, such as blood plasma, have implications for physiology and pathology, and therefore the potential to act as disease biomarkers. Thus, the possibility for RNA contaminants demands careful evaluation.Entities:
Keywords: Artefact removal; Contaminant RNA; Exogenous RNA in human blood plasma; RNA sequencing; Spin columns
Mesh:
Year: 2018 PMID: 29759067 PMCID: PMC5952572 DOI: 10.1186/s12915-018-0522-7
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Fig. 1Workflow of the initial screen for and validation of exogenous sRNA sequences in human plasma samples
Sequences of non-human sRNAs found in plasma preparations, synthetic sRNA templates, primers and annealing temperatures
| Name | RNA sequence | Average counts per million in 10 plasma samples | Potential origin of sequence | Primer sequence | Annealing temperature |
|---|---|---|---|---|---|
| sRNA 1 | (CU)AACAGACCGAGGACUUGAA(U) | 133,700 | algae | AACAGACCGAGGACTTGAA | 57 °C |
| sRNA 2 | ACGGACAAGAAUAGGCUUCGGCU | 8000 | fungi or plants | ACGGACAAGAATAGGCTTC | 54 °C |
| sRNA 3 | GCCUUGGUUGUAGGAUCUGU | 8200 | plants | GCCTTGGTTGTAGGATCTGT | 57 °C |
| sRNA 4 | GCCAGCAUCAGUUCGGUGUG | 6800 | bacteria | CAGCATCAGTTCGGTGTG | 57 °C |
| sRNA 5 | GAGAGUAGGACGUUGCCAGGUU | 3900 | bacteria | AGTAGGACGTTGCCAGGTT | 57 °C |
| sRNA 6 | UUGAAGGGUCGUUCGAGACCAGGACGUUGAUAGGCUGGGUG | 3400 | bacteria | GAAGGGTCGTTCGAGACC | 57 °C |
| UCCUGUACUGAGCUGCCCCGAG | human | –* | 60 °C |
* hsa-miR486-5p specific assay from Quanta BIOSCIENCES
Fig. 2Detection of non-human sRNA species in column eluates and their removal from columns: a qPCR amplification of six non-human sRNA species in extracts from human plasma and qPCR control (water). b Detection of the same sRNA species in mock extracts without input to extract columns and water passed through extraction columns (‘eluate’). c Levels of the same sRNA species in mock extracts without and with DNase treatment during the extraction. d Relative levels of sRNA remaining after pre-treatment of extraction columns with bleach or washing ten times with water, detected after eluting columns with water. All: mean results of three experiments, measured in reaction duplicates; error bars represent one standard deviation; data points are available in Additional file 2: Tables S7–S10. Experiments displayed in panels b and d were performed on the same batch of columns, a and c on independent batches
Fig. 3Detection of contaminant sequences in published sRNA sequencing datasets of low biomass samples. Datasets are referenced by NCBI bioproject accession or first author of the published manuscript. n number of samples in the dataset, E extraction kit used (if this information is available), Q regular miRNeasy (QIAGEN), T TRIzol (Thermo Fisher), P mirVana PARIS RNA extraction kit (Thermo Fisher), V mirVana RNA extraction kit with phenol, Rpm reads per million. Error bars indicate one standard deviation
Fig. 4Confirmed and potential contaminant sequences in eluates of regular and ultra-clean RNeasy spin columns: a Levels of contaminant sequences in eluates of two batches of regular and four batches of ultra-clean spin columns, based on qPCR; ultra-clean batches 1 and 2 are cleaned-up versions of regular batch 2 and ultra-clean batches 3 and 4 are cleaned-up versions of regular batch 3; error bars indicate one standard deviation; data points are available in Additional file 2: Table S11. b and c Numbers of different further potential contaminant sequences on the regular and ultra-clean spin columns from two different batches. d Total levels of further potential contaminant sequences, based on sRNA sequencing data normalised to spike-in levels. Cpm counts per million
Fig. 5Titration experiment: Detection of contaminants in sRNA preparations of human plasma using different input volumes and extraction columns. a Detected levels of the six contaminant sRNA sequences in sRNA sequencing data of preparations using 0 to 1115 μL human plasma and regular or ultra-clean RNeasy spin columns. b Detailed view of the data displayed in a for 100 μL of human plasma as input to regular and ultra-clean RNeasy spin columns. Cpm counts per million. Error bars indicate one standard deviation; data points are available in Additional file 2: Table S12
Fig. 6Summary: Recommendations for artefact-free analysis of sRNA by sequencing