| Literature DB >> 31530647 |
Sonia Tarallo1, Giulio Ferrero2, Barbara Pardini3,4, Alessio Naccarati1,5, Francesca Cordero1,2, Gaetano Gallo6,7, Antonio Francavilla1, Giuseppe Clerico7, Alberto Realis Luc7, Paolo Manghi8, Andrew Maltez Thomas8, Paolo Vineis1,9, Nicola Segata8.
Abstract
Dysbiotic configurations of the human gut microbiota have been linked to colorectal cancer (CRC). Human small noncoding RNAs are also implicated in CRC, and recent findings suggest that their release in the gut lumen contributes to shape the gut microbiota. Bacterial small RNAs (bsRNAs) may also play a role in carcinogenesis, but their role has been less extensively explored. Here, we performed small RNA and shotgun sequencing on 80 stool specimens from patients with CRC or with adenomas and from healthy subjects collected in a cross-sectional study to evaluate their combined use as a predictive tool for disease detection. We observed considerable overlap and a correlation between metagenomic and bsRNA quantitative taxonomic profiles obtained from the two approaches. We identified a combined predictive signature composed of 32 features from human and microbial small RNAs and DNA-based microbiome able to accurately classify CRC samples separately from healthy and adenoma samples (area under the curve [AUC] = 0.87). In the present study, we report evidence that host-microbiome dysbiosis in CRC can also be observed by examination of altered small RNA stool profiles. Integrated analyses of the microbiome and small RNAs in the human stool may provide insights for designing more-accurate tools for diagnostic purposes.IMPORTANCE The characteristics of microbial small RNA transcription are largely unknown, while it is of primary importance for a better identification of molecules with functional activities in the gut niche under both healthy and disease conditions. By performing combined analyses of metagenomic and small RNA sequencing (sRNA-Seq) data, we characterized both the human and microbial small RNA contents of stool samples from healthy individuals and from patients with colorectal carcinoma or adenoma. With the integrative analyses of metagenomic and sRNA-Seq data, we identified a human and microbial small RNA signature which can be used to improve diagnosis of the disease. Our analysis of human and gut microbiome small RNA expression is relevant to generation of the first hypotheses about the potential molecular interactions occurring in the gut of CRC patients, and it can be the basis for further mechanistic studies and clinical tests.Entities:
Keywords: gut microbiome; human stool samples; microRNAs; small RNAs
Year: 2019 PMID: 31530647 PMCID: PMC6749105 DOI: 10.1128/mSystems.00289-19
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1(A) Violin plots reporting the relative abundances of differentially abundant bacterial phyla among healthy, adenoma, and carcinoma groups from metagenomic data analysis. P values were computed using the Wilcoxon rank sum test and adjusted using the Benjamini-Hochberg method. **, adjusted P value < 0.01; *, adjusted P value < 0.05. (B) Violin plots reporting the relative abundances of E. coli from metagenomic data analysis. P values were computed using the Wilcoxon rank sum test and adjusted using the Benjamini-Hochberg method. **, adjusted P value < 0.01. (C) Stacked bar plot reporting the fraction of sRNA-Seq reads assigned to hsa-miRNAs or hsa-sncRNAs (blue), human genome (green), or bsRNAs (orange) or that were not mapped (red). (D) Violin plots reporting the expression levels for the most significant differentially expressed hsa-miRNA (left) and hsa-sncRNA (right) between the healthy and carcinoma groups (DESeq2 adjusted P value < 0.05).
FIG 2(A) Flow chart summarizing the analyses performed to identify and analyze the sRNA-Seq reads assigned to microbial genomes and bsRNAs. WMS, whole-metagenome sequencing. (B) Stacked bar plots reporting the relative abundances of bacterial phyla detected using whole-metagenome sequencing (top) and small RNA sequencing (sRNA-Seq; bottom) data, respectively. (C) Bar plot reporting the Pearson correlation coefficient (r) computed from comparisons between metagenomic and sRNA-Seq data for each phylum. (D) Heat map reporting the log2 ratios of relative abundances between bsRNAs and bacterial DNA profiles. Only ratios of bacterial species with median greater than 1 are reported. P.aeu, P. aeruginosa. (E) Heat map representing the log10 numbers of reads assigned to bsRNAs from the Bacteria Small RNA Database (BSRD). Only annotations that were significantly different (adjusted P value < 0.05) between healthy, adenoma, and CRC groups are shown.
FIG 3(A) Bar plot reporting the list of hsa-miRNAs and hsa-sncRNAs correlated with the abundance of E. coli. (B) Heat map showing the KEGG pathways significantly enriched in the targets of hsa-miRNAs correlated with E. coli abundance. Highlighted in bold are the hsa-miRNAs annotated as corresponding to the KEGG term “Pathogenic Escherichia coli Infection.” (C) STRING network representation of the interactions between genes belonging to the KEGG pathway “Pathogenic Escherichia coli infection” and targeted by hsa-miRNAs correlated with E. coli abundance. Nodes are colored based on their association with specific gene ontology biological processes. Edge thickness is proportional to the interaction confidence computed by STRING.
FIG 4(A) Heat map reporting the area under the curve (AUC) computed by the Random Forest classifier using bacterial relative abundances provided by metagenomic data (bDNA), sRNA-Seq data (bsRNAs), and the combination of both bDNA and bsRNAs and combined with the expression levels of hsa-miRNAs (hsa-miRNAs + bDNA + bsRNAs). (B) Line plot reporting the AUC computed by the Random Forest classifier. For the classification of carcinoma and healthy samples, a specific number of features from different input information is reported. Rankings are obtained excluding testing set to avoid overfitting issues. (C) Bar plot reporting the average classification contribution of each of the 32 features providing the best classification accuracy of cancer and healthy samples. All the reported bacterial features were obtained using sRNA-Seq.