| Literature DB >> 29423032 |
Giulio Ferrero1,2, Francesca Cordero1,3, Sonia Tarallo3, Maddalena Arigoni4, Federica Riccardo4, Gaetano Gallo5,6, Guglielmo Ronco7, Marco Allasia8, Neha Kulkarni4, Giuseppe Matullo3,9, Paolo Vineis3,10, Raffaele A Calogero4, Barbara Pardini3,9, Alessio Naccarati3,11.
Abstract
The role of non-coding RNAs in different biological processes and diseases is continuously expanding. Next-generation sequencing together with the parallel improvement of bioinformatics analyses allows the accurate detection and quantification of an increasing number of RNA species. With the aim of exploring new potential biomarkers for disease classification, a clear overview of the expression levels of common/unique small RNA species among different biospecimens is necessary. However, except for miRNAs in plasma, there are no substantial indications about the pattern of expression of various small RNAs in multiple specimens among healthy humans. By analysing small RNA-sequencing data from 243 samples, we have identified and compared the most abundantly and uniformly expressed miRNAs and non-miRNA species of comparable size with the library preparation in four different specimens (plasma exosomes, stool, urine, and cervical scrapes). Eleven miRNAs were commonly detected among all different specimens while 231 miRNAs were globally unique across them. Classification analysis using these miRNAs provided an accuracy of 99.6% to recognize the sample types. piRNAs and tRNAs were the most represented non-miRNA small RNAs detected in all specimen types that were analysed, particularly in urine samples. With the present data, the most uniformly expressed small RNAs in each sample type were also identified. A signature of small RNAs for each specimen could represent a reference gene set in validation studies by RT-qPCR. Overall, the data reported hereby provide an insight of the constitution of the human miRNome and of other small non-coding RNAs in various specimens of healthy individuals.Entities:
Keywords: microRNAs; next-generation sequencing; non-invasive biomarkers; small non-coding RNA profiling; surrogate tissues
Year: 2017 PMID: 29423032 PMCID: PMC5790449 DOI: 10.18632/oncotarget.23203
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1(A) Schematic representation of the computational pipeline applied in the analysis of small RNA-Seq dataset from healthy individuals. The modules of the pipeline designed for miRNAs and other sncRNAs are depicted in orange and green, respectively. (B) Bar plot showing for each specimen, the average number of sequencing reads aligned to miRNA annotations (green), unmapped on miRNA annotations but mapped on human genome (red), and unmapped on both miRNA annotations and the human genome (blue). (C) Table reporting the average, minimum, and maximum number of reads (in million) composing the starting datasets, aligned in the different analysis phases, or assigned to specific RNA annotations. HS= Homo sapiens.
Figure 2(A) Venn diagram reporting the number of miRNAs detected in different specimens from healthy individuals and their overlap. (B) Heat map showing the log10 number of normalized reads supporting the miRNAs specifically detected in one specimen or commonly detected among them. (C) PCA plot showing the small RNA-Seq datasets separation obtained using miRNAs detected in samples analysed.
Figure 3(A) Venn diagram reporting the number of non-miRNA sncRNA species detected in different specimens from healthy individuals and their overlap. (B) PCA plot showing the small RNA-Seq datasets separation obtained using the non-miRNA RNA species detected in the samples analysed. (C) Heat map showing the log10 number of normalized reads supporting the non-miRNA RNA species detected in one specimen only or commonly detected among them.
Figure 4(A) Bar plot showing the top 10 miRNAs characterized by the lower ratio between the MAD and the median expression levels in plasma exosome, stool, or urine samples. (B) Box plot showing the log10 number of normalized reads supporting miRNAs characterized by the lower ratio between the MAD and the median expression level in plasma exosomes stool, or urine samples. (C) Bar plot showing the top 10 non-miRNA sncRNA species characterized by the lower ratio between the MAD and the median expression levels in plasma exosome, stool, or urine samples. (D) Box plot showing the log10 number of normalized reads supporting non-miRNA sncRNA species characterized by the lower ratio between the MAD and the median expression level in plasma exosomes stool, or urine samples.