| Literature DB >> 29703133 |
Zachary T Herbert1, Jamie P Kershner2, Vincent L Butty3, Jyothi Thimmapuram4, Sulbha Choudhari4, Yuriy O Alekseyev5,6, Jun Fan7, Jessica W Podnar8, Edward Wilcox9, Jenny Gipson10, Allison Gillaspy10, Kristen Jepsen11, Sandra Splinter BonDurant12, Krystalynne Morris13, Maura Berkeley1, Ashley LeClerc5, Stephen D Simpson13, Gary Sommerville1, Leslie Grimmett1, Marie Adams14, Stuart S Levine15.
Abstract
BACKGROUND: Ribosomal RNA (rRNA) comprises at least 90% of total RNA extracted from mammalian tissue or cell line samples. Informative transcriptional profiling using massively parallel sequencing technologies requires either enrichment of mature poly-adenylated transcripts or targeted depletion of the rRNA fraction. The latter method is of particular interest because it is compatible with degraded samples such as those extracted from FFPE and also captures transcripts that are not poly-adenylated such as some non-coding RNAs. Here we provide a cross-site study that evaluates the performance of ribosomal RNA removal kits from Illumina, Takara/Clontech, Kapa Biosystems, Lexogen, New England Biolabs and Qiagen on intact and degraded RNA samples.Entities:
Keywords: ABRF; Illumina; NGS; RNAseq; Transcriptomics; rRNA depletion
Mesh:
Substances:
Year: 2018 PMID: 29703133 PMCID: PMC6389247 DOI: 10.1186/s12864-018-4585-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Design of the ribosomal depletion study: Schematic of the sample processing is shown. A single sample of UHR RNA with SIRV spike-ins was kept intact or heat degraded followed by addition of the ERCC spike-in. The two samples were then distributed to the participating sites where they were run as technical duplicates for each kit. All graphics were either produced by the authors or are public domain images that are no longer under copyright
Fig. 2Properties of the rRNA depleted libraries: a) Fraction of reads mapping to nuclear rRNA shown. Site number indicated by color. Intact samples are shown as circles, degraded samples are shown as diamonds. Kit abbreviations: RZ = RiboZero Gold, LX = Lexogen RiboCop, Q = Qiagen GeneRead rRNA Depletion, NE = NEBNext rRNA Depletion, K=Kapa RiboErase, CR = Clontech Ribogone, CZ = SMARTer Pico total RNA. b) Reads were mapped to exons in UCSC known gene and scored based on strand of alignment. c) Fraction of reads mapping to mt rRNA shown as in A. *- RiboZero site 3 used standard RiboZero instead of RiboZero Gold
Fig. 3Protein Coding Gene Detection in rRNA Depleted Libraries: a) Non-mtRNA reads were mapped to the ENSENBL annotation and grouped by transcript type. Fraction of reads associated with transcript types > 1% shown. Data sets ordered by site then intact/degraded status within each kit top to bottom. b) Number of genes detected at >1RPKM (dark blue) and > 0.1RPKM (light blue) shown for each replicate. Genes ordered the same as in A but left to right. c) Changes in RNA detection compared to Illumina RiboZero. Hierarchical clustering genes with fold changes > 2 and Benjamini corrected p-values < 0.001 are shown (union of all comparisons). d) Count of genes with fold changes > 2 and Benjamini corrected p-values < 0.001 are shown for each kit as compared to Illumina RiboZero. Increased detection shown in red, decreased detection shown in green. e) Distribution of read lengths for transcripts detected at higher (red) or lower (green) rate relative to RiboZero. f) Distribution of GC% for transcripts detected at higher (red) or lower (green) rate relative to RiboZero
Fig. 4LincRNA Detection in rRNA Depleted Libraries: a) Mean fraction of non-rRNA reads assigned to lincRNAs based on the ENSEMBL annotation. b) Number of lincRNAs detected over specified RPM levels. c) Fraction of lincRNA mapped reads assigned to the top 4 lincRNAs detected for each sample. Data sets ordered by site then intact/degraded status within each kit left to right. Average RPM counts for each lincRNA was calculated and the top four lincRNAs were shown keeping remaining lincRNAs in ‘Others’ category. d) Overlap of lincRNAs detected by the three core library prep methodologies: ribosomal pulldown (RLQ), RNAse H (NCK), and ZapR (CZ). Average RPM counts for each lincRNA for all samples in each of the three core library methods (RLQ = RZ, LX, Q; NCK = NE, CR, K; CZ = CZ) was calculated and lincRNAs with average RPM > 0 were compared among the methods
Fig. 5Effect of rRNA Depletion Chemistry on Spike-In Controls: a) Effect of degradation of SIRVs on the ratio of SIRV reads to ERCC reads in each replicate. Percent of reads mapping to SIRVs out of total reads mapping to Spike-ins in shown. Data sets ordered by intact/degraded status followed by site within each kit left to right. b) Relative ratio of reads mapping to SIRV1. Fraction of reads mapping to each isoform of SIRV1 are shown for each replicate. Expected fractions shown as dark horizontal lines. Light horizontal lines show 2-fold changes in fraction observed (log scale). Each SIRV1 isoform is shown in a different color. Replicates ordered as in a. c) Boxplot of normalized transcripts per million (TPM) for subsets of SIRVs. SIRVs present at 4×, 1× and 1/4× in the pool were randomized and normalized TPM scores for all sites for two sets of 10 SIRVs were plotted across different chemistries using intact samples. 5th, 25th,Median, 75th and 95th percentile are shown. d) Boxplot as in c except red boxplot highlights shortest quintile (len < 480 nt) and green includes longest quintile (len > 2200). e) Boxplot as in c except red boxplot highlights lowest GC quintile (< 36% GC) and green includes highest quintile (> 44.5% GC). f) Boxplot as in c except red highlights random set of 10 transcripts included at 4× while green includes set of 10 transcripts at 1/4×
User Experience with Different rRNA Depletion Chemistries: Study participants were surveyed for their opinions on the ease of use of each kit they tested
| Kit | Site | Used kit before | Comfort levela |
|---|---|---|---|
| RZ | 1 | Yes | 3-4 |
| 2 | Yes | 5 | |
| 3 | Yes | 5 | |
| 4 | Yes | 5 | |
| LX | 1 | No | 5 |
| 2 | No | 5 | |
| 3 | No | 3 | |
| 4 | No | 3 | |
| Q | 1 | No | 5 |
| 2 | No | 4 | |
| 3 | No | 4 | |
| 4 | No | 5 | |
| NE | 1 | No | 2-3 |
| 2 | Yes | 4 | |
| 3 | No | 4 | |
| 4 | No | 5 | |
| CR | 1 | No | 4 |
| 2 | No | 4 | |
| 3 | No | 3-4 | |
| 4 | No | 5 | |
| K | 1 | No | 4 |
| 2 | Yes | 3-4 | |
| 3 | No | 5 | |
| 4 | No | 3-4 | |
| CZ | 1 | No | 4 |
| 2 | No | 4 | |
| 3 | Yes | 3-4 | |
| 4 | No | 2 |
aOn a scale of 1-5 with 1 being not at all comfortable and 5 being very comfortable