| Literature DB >> 27765019 |
Jaime I Davila1, Numrah M Fadra2, Xiaoke Wang3, Amber M McDonald3, Asha A Nair2, Barbara R Crusan3, Xianglin Wu3, Joseph H Blommel3, Jin Jen3,4, Kandelaria M Rumilla3, Robert B Jenkins3, Umut Aypar3, Eric W Klee2,3, Benjamin R Kipp3, Kevin C Halling3.
Abstract
BACKGROUND: RNA-seq is a well-established method for studying the transcriptome. Popular methods for library preparation in RNA-seq such as Illumina TruSeq® RNA v2 kit use a poly-A pulldown strategy. Such methods can cause loss of coverage at the 5' end of genes, impacting the ability to detect fusions when used on degraded samples. The goal of this study was to quantify the effects RNA degradation has on fusion detection when using poly-A selected mRNA and to identify the variables involved in this process.Entities:
Keywords: Fusion detection; Poly-A pulldown; RNA degradation; RNA-seq
Mesh:
Substances:
Year: 2016 PMID: 27765019 PMCID: PMC5072325 DOI: 10.1186/s12864-016-3161-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Identification of the BCR-ABL and ABL-BCR fusions in chemically degraded RNA from a KU812 cell line using the TruSeq RNA-seq protocol. a Supporting reads per million reads for BCR-ABL and ABL-BCR at different RIN values (10, 7, 5, 3). b Diagram of ABL and BCR genes and the BCR-ABL and ABL-BCR fusion products showing the different distances of each fusion product from the 3′ end. c Coverage level per million reads as a function of the distance from the 3′ end for the ABL-BCR fusion at different levels of degradation. A loess trend line is depicted for each sample. d Coverage level per million reads as a function of the distance from the 3′ end for the BCR-ABL fusion at different levels of degradation. A loess trend line is depicted for each sample
Fig. 2Median coverage profile as a function of distance from 3′ end of mRNA and read decay across all protein-coding genes and genes involved in fusions for samples with different levels of degradation. a Median coverage per million reads (in log scale) as function of the distance from the 3′ end for UHR sample at different RIN values (8.6, 8.4, 7.6, 5.9, 4.9 and 3.9). Linear trend lines and 95 % confidence intervals are denoted in gray. b Median read decay rate per kilobase for all genes as a function of RIN for chemically degraded samples (UHR and U251 cell line). Notice that U251 had replicates. Linear trend line is shown in blue and 95 % confidence intervals are shown in gray. c Cumulative histogram of the distance of gene fusion breakpoints from the 3′ end for fusions in the COSMIC database. d Median read decay rate per kilobase for fusion related genes as function of RIN for a set of 20 fresh tissue specimens. Linear trend line is shown in blue and 95 % confidence intervals are shown in gray
Fig. 3Estimation of fusion sensitivity as a function of distance of breakpoint from the 3′ end. a Diagram showing the strategy used to estimate the probability of detecting a fusion (i.e. sensitivity) at different distances from the 3′ end by enumerating the proportion of genes having a coverage ≥10x. A indicates that there are more than 10 reads at that particular position for that gene while a X indicates that there are not. b Sensitivity as a function of the distance from the 3′ end for the UHR sample at varying levels of degradation. Loess trend is shown for each sample and 95 % confidence intervals are shown in gray. c Estimated sensitivity (log scale) for five different fusions present in UHR that occur at different distances from the 3′ end at different RIN values. If the fusion was detected at the particular degradation level it is shown as a square and if it is not detected it is shown as a circle. Linear trend line is shown for each fusion and 95 % confidence intervals are shown in gray