| Literature DB >> 35927305 |
Namjoo Kim1, Jeong-An Gim2, Beom Jae Lee3, Byung Il Choi1, Hee Sook Yoon1, Seung Han Kim1, Moon Kyung Joo1, Jong-Jae Park1, Chungyeul Kim4.
Abstract
Various omics-based biomarkers related to the occurrence, progression, and prognosis of colorectal cancer (CRC) have been identified. In this study, we attempted to identify gut microbiome-based biomarkers and detect their association with host gene expression in the initiation and progression of CRC by integrating analysis of the gut mucosal metagenome, RNA sequencing, and sociomedical factors. We performed metagenome and RNA sequencing on colonic mucosa samples from 13 patients with advanced CRC (ACRC), 10 patients with high-risk adenoma (HRA), and 7 normal control (NC) individuals. All participants completed a questionnaire on sociomedical factors. The interaction and correlation between changes in the microbiome and gene expression were assessed using bioinformatic analysis. When comparing HRA and NC samples, which can be considered to represent the process of tumor initiation, 28 genes and five microbiome species were analyzed with correlation plots. When comparing ACRC and HRA samples, which can be considered to represent the progression of CRC, seven bacterial species and 21 genes were analyzed. When comparing ACRC and NC samples, 16 genes and five bacterial species were analyzed, and four correlation plots were generated. A network visualizing the relationship between bacterial and host gene expression in the initiation and progression of CRC indicated that Clostridium spiroforme and Tyzzerella nexilis were hub bacteria in the development and progression of CRC. Our study revealed the interactions of and correlation between the colonic mucosal microbiome and host gene expression to identify potential roles of the microbiome in the initiation and progression of CRC. Our results provide gut microbiome-based biomarkers that may be potential diagnostic markers and therapeutic targets in patients with CRC.Entities:
Mesh:
Year: 2022 PMID: 35927305 PMCID: PMC9352898 DOI: 10.1038/s41598-022-17823-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Integrative analysis and visualization of the 30 samples using three datasets. In the 30 samples (13 advanced colorectal cancers [ACRCs], 10 high risk adenomas [HRAs], and 7 normal controls [NCs]), there were 529 species identified by metagenome analysis and 763 genes identified by RNA sequencing. The survey results identified 93 associated variables. Each dataset was merged and normalized to a value between 0 and 1. Principal component analysis (PCA) was applied to reduce the dimension of features from each dataset, and 10 principal components (PCs) were retrieved. (A) Each two-dimensional PCA plot represents two PCs, reduced from 1385 features. (B) The scree plot represents 10 PCs. (C) The biplot represents the direction of each feature. (D) In the ternary plot, 1385 features corresponding to ACRCs, HRAs, and NCs are presented as points and are distinguished by three colors.
Figure 2Alpha diversity, distribution, and different patterns of the metagenome analysis. (A) Five diversity indices were visualized as boxplots. Points and horizon bars indicate the means and the medians of each group, respectively. The operational taxonomic unit (OTU), the Chao1, the Shannon index, the inverse Simpson index, and the Good’s coverage are provided. The three groups are advanced colorectal cancer (ACRC; n = 13), high-risk adenoma (HRA; n = 10), and normal controls (NC; n = 7). (B) Relative abundance of each classification level. The relative abundance of each sample is listed by phylum, class, order, family, genus, and species. (C) Heatmap showing the relative abundance of significantly different bacterial species between the three groups. Of 528 species, 8 were significantly different between the three groups (ANOVA, p < 0.05). Each row indicates 8 species and is classified by group. Statistical significance is indicated as a row annotation bar, and darker green indicates greater significance. Each column represents an individual patient, and their labels are indicated as a column annotation bar. Each cell of the heatmap indicates the relative abundance of species, with colors gradually changing from blue to red, corresponding to low and high relative abundance, respectively. (D) Boxplots showing the ratio of the abundance in each group of the top three taxa in six classification levels. In each boxplot analysis, the statistical significance of the three pairings (ACRC vs. NC, ACRC vs. HRA, and HRA vs. NC) was analyzed by t-test, the overall significance was analyzed by Kruskal–Wallis test, and the p-value is presented. ANOVA analysis of variance.
Figure 3Correlation between RNA sequencing and metagenome results. (A) Correlation plots and (B) scatter plots comparing ACRC and HRA samples. (C) Correlation plots and (D) scatter plots comparing ACRC and NC samples. (E) Correlation plots and (F) scatter plots comparing HRA and NC samples. In correlation plots, color indicates correlation coefficients, and circle size indicates statistical significances. Only correlation coefficients with p-values < 0.05 are shown. The two analysis results with the highest positive and negative correlation coefficients are presented as scatter plots and regression lines. p-values of each correlation analysis are indicated in the correlation plots (*p < 0.05, **p < 0.01, ***p < 0.001). ACRC advanced colorectal cancer, HRA high-risk adenoma, NC normal control.
Figure 4Interaction networks between differentially expressed genes (DEGs) and relative abundance of microbial strains. (A) Network analysis of 13 advanced colorectal cancer (ACRC) samples and 10 high-risk adenoma (HRA) samples versus 7 normal control (NC) samples. (B) Network analysis of 13 ACRC samples versus 10 HRA samples. (C) Network analysis of 13 ACRC samples versus 7 NC samples. (D) Network analysis of 10 HRA samples and 7 NC samples. Genes that are enriched in the word group are indicated in red; all other genes are in blue. Species with high relative abundance in the word group are indicated in red; all other species are in blue.