| Literature DB >> 34548904 |
Matthew Devall1,2, Christopher H Dampier1,2, Stephen Eaton1,2, Mourad W Ali1,2, Virginia Díez-Obrero3,4,5,6, Ferran Moratalla-Navarro3,4,5,6, Jennifer Bryant1,2, Lucas T Jennelle1,2, Victor Moreno3,4,5,6, Steven M Powell7, Ulrike Peters8, Graham Casey1,2.
Abstract
Tobacco smoke and red/processed meats are well-known risk factors for colorectal cancer (CRC). Most research has focused on studies of normal colon biopsies in epidemiologic studies or treatment of CRC cell lines in vitro. These studies are often constrained by challenges with accuracy of self-report data or, in the case of CRC cell lines, small sample sizes and lack of relationship to normal tissue at risk. In an attempt to address some of these limitations, we performed a 24-hour treatment of a representative carcinogens cocktail in 37 independent organoid lines derived from normal colon biopsies. Machine learning algorithms were applied to bulk RNA-sequencing and revealed cellular composition changes in colon organoids. We identified 738 differentially expressed genes in response to carcinogens exposure. Network analysis identified significantly different modules of co-expression, that included genes related to MSI-H tumor biology, and genes previously implicated in CRC through genome-wide association studies. Our study helps to better define the molecular effects of representative carcinogens from smoking and red/processed meat in normal colon epithelial cells and in the etiology of the MSI-H subtype of CRC, and suggests an overlap between molecular mechanisms involved in inherited and environmental CRC risk. Copyright:Entities:
Keywords: colon organoids; microsatellite instability; single-cell deconvolution; smoking; weighted gene co-expression network analysis
Year: 2021 PMID: 34548904 PMCID: PMC8448508 DOI: 10.18632/oncotarget.28058
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Regression analysis of cell composition differences in response to carcinogen exposure.
(A) Carcinogens exposed organoids were associated with a reduced stemness index. (B) Heatmap of gene expression for signature matrix genes shows that selected genes are able to stratify cell types in single-cell data. (C) Enrichment analysis to determine overlap between DEGs identified by regression on cell score in colon organoids and known markers of cell types. Larger circles indicate a greater overlap between total number of marker genes and those identified as being significant in each regression. Increasing odds ratios generated from enrichment Fisher’s Exact tests is represented as a transition from light to dark blue. (D) Cell score regression analysis between treatment conditions: * P < 0.05; ** P < 0.01; *** P < 0.001.
Figure 2Summary of analysis of carcinogen exposure of organoids following adjustment for cell composition.
(A) Boxplot to show the proportion of gene-wise variance explained by each covariate within the mixed-effects regression model. (B) Volcano plot of carcinogen DEGs. ‘Prior’ and ‘Novel’ denote genes that were and were not previously identified in original analysis respectively (C) Volcano plot of BarcUVa-Seq analysis. DEGs only identified in BarcUVa-Seq are denoted light blue, while genes that were also present nominally (dark blue) and following Bonferroni correction (red) in carcinogen analysis are also shown. N.S denotes genes that were not significant.
Summary of significant modules identified in WGCNA that passed quality control tests and were enriched for protein-protein interactions
| Module |
| Gene Significance and Module Membership | No. CRC GWAS Genes | PPI | Hub Genes | |
|---|---|---|---|---|---|---|
| lightsteelblue | 31.949 | 9.62E-26 | 0.360 ( | 15 (233)* | 1.00E-16 |
|
| bisque4 | –24.523 | 6.84E-22 | 0.430 ( | 11 (114) | 0.046 |
|
| coral1 | –16.215 | 4.14E-16 | 0.150 ( | 37 (432) | 1.00E-16 |
|
| skyblue4 | –15.882 | 7.86E-16 | 0.420 ( | 7 (56) | 1.00E-16 |
|
| coral | –14.934 | 5.18E-15 | 0.110 ( | 39 (449) | 1.00E-16 |
|
| darkolivegreen | –8.313 | 4.64E-08 | 0.220 ( | 14 (139) | 6.29E-10 |
|
| plum4 | 5.249 | 4.15E-04 | 0.120 ( | 21 (330) | 1.00E-16 |
|
+Significance of correlation between gene significance and module membership for genes within a module. *Total number of genes within a module.