| Literature DB >> 34470578 |
Annelien Morlion1,2, Celine Everaert1,2, Justine Nuytens1,2, Eva Hulstaert1,2,3, Jo Vandesompele1,2, Pieter Mestdagh1,2.
Abstract
Long non-coding RNAs (lncRNAs) are a heterogeneous group of transcripts that lack protein coding potential and display regulatory functions in various cellular processes. As a result of their cell- and cancer-specific expression patterns, lncRNAs have emerged as potential diagnostic and therapeutic targets. The accurate characterization of lncRNAs in bulk transcriptome data remains challenging due to their low abundance compared to protein coding genes. To tackle this issue, we describe a unique short-read custom lncRNA capture sequencing approach that relies on a comprehensive set of 565,878 capture probes for 49,372 human lncRNA genes. This custom lncRNA capture approach was evaluated on various sample types ranging from artificial high-quality RNA mixtures to more challenging formalin-fixed paraffin-embedded tissue and biofluid material. The custom enrichment approach allows the detection of a more diverse repertoire of lncRNAs, with better reproducibility and higher coverage compared to classic total RNA-sequencing.Entities:
Keywords: FFPE; RNA abundance; RNA expression; RNA sequencing; biofluid; lncRNA; lncRNome; probes
Mesh:
Substances:
Year: 2021 PMID: 34470578 PMCID: PMC8682977 DOI: 10.1080/15476286.2021.1971438
Source DB: PubMed Journal: RNA Biol ISSN: 1547-6286 Impact factor: 4.652
Figure 1.Custom capture sequencing (Custom) is able to detect more lncRNAs than total RNA-sequencing (Total). Quantification based on combined reference of Ensembl and LNCipedia. a: RNA biotype distribution plot of mapped reads where 1 and 2 indicate the two different donors and A and B refer to MAQCA and MAQCB, respectively (lncRNAs: high-confidence lncRNAs based on LNCipedia 5.2; miscRNA: miscellaneous RNA, non-coding RNA that cannot be classified; MT RNR gene: mitochondrially encoded ribosomal RNAs; protein coding: protein coding RNA transcripts; pseudogene; rRNA (45S): (45S) ribosomal RNA; s(no)RNA: small nuclear/nucleolar RNA; ucgenes: unannotated cancer genes; other: T cell receptor genes, Immunoglobulin genes, TEC (To be Experimentally Confirmed) – regions with EST clusters that have polyA features that could indicate the presence of protein coding genes, vaultRNA – short non coding RNA genes that form part of the vault ribonucleoprotein complex; microRNAs; ribozymes); b: number of unique lncRNAs with at least 10 counts (filter threshold), data points from same donor or MAQC type are linked (grey lines); c: overlap between lncRNAs that are detected above threshold in all replicates of a certain library prep method, plots made with eulerr package (v6.1.0) in R; d: correlation and density plots of overlapping (grey) and specific lncRNAs for custom capture (orange) and total RNA-sequencing (yellow); lncRNAs below count threshold in both methods were left out
Figure 2.Custom capture seq (Custom) has a higher lncRNA count reproducibility and coverage than total RNA-seq (Total). Cumulative distributions of absolute log2 fold changes (log2 FC) between lncRNA counts in the two technical replicates are shown for (A) FFPE from donor 1, (C) MAQCA, and (E) seminal plasma from donor 2. Kolmogorov–Smirnov tests each time showed significant difference in distributions between Total and Custom (p-value < 0.001). Boxplot of corresponding transcripts per million (TPM) values of these lncRNAs are shown in (B) for FFPE, (D) for MAQCA, and (F) for seminal plasma
Figure 3.Higher coverage for prostate-cancer related lncRNAs with custom capture (Custom) than total RNA-sequencing (Total). Heatmaps based on z-score transformed lncRNA counts of seminal plasma samples from donor 1 (A) and donor 2 (B), respectively. Per donor, only lncRNAs detected above count threshold (10 counts) in at least one replicate were considered. A higher z-score (orange/red) indicates relatively more coverage. Complete clustering of samples based on Euclidean distance. R1: technical replicate 1; R2: technical replicate 2