| Literature DB >> 35155435 |
Fei Yuan1, Xiaoyu Cao2, Yu-Hang Zhang3, Lei Chen4, Tao Huang5,6, ZhanDong Li7, Yu-Dong Cai8.
Abstract
Cancer driver gene is a type of gene with abnormal alterations that initiate or promote tumorigenesis. Driver genes can be used to reveal the fundamental pathological mechanisms of tumorigenesis. These genes may have pathological changes at different omics levels. Thus, identifying cancer driver genes involving two or more omics levels is essential. In this study, a computational investigation was conducted on lung cancer driver genes. Four omics levels, namely, epigenomics, genomics, transcriptomics, and post-transcriptomics, were involved. From the driver genes at each level, the Laplacian heat diffusion algorithm was executed on a protein-protein interaction network for discovering latent driver genes at this level. A following screen procedure was performed to extract essential driver genes, which contained three tests: permutation, association, and function tests, which can exclude false-positive genes and screen essential ones. Finally, the intersection operation was performed to obtain novel driver genes involving two omic levels. The analyses on obtained genes indicated that they were associated with fundamental pathological mechanisms of lung cancer at two corresponding omics levels.Entities:
Keywords: driver gene; epigenomics; genomics; heat diffusion algorithm; lung cancer; posttranscriptomics; protein-protein interaction network; transcriptomics
Year: 2022 PMID: 35155435 PMCID: PMC8826452 DOI: 10.3389/fcell.2022.825272
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
FIGURE 1Entire procedures of LHD-based method for identification of multi-omics lung cancer driver genes. Based on driver genes at one omics level, the Laplacian heat diffusion (LHD) algorithm is executed on a protein–protein interaction network reported in STRING to identify raw driver genes. These genes are filtered by a screen procedure. The intersection operation is conducted to identify driver genes involving any two omics levels. Six groups are accessed, each of which contains driver genes involving two omics levels.
Remaining latent driver genes at each stage of LHD-based method.
| Level | LHD-algorithm | Permutation test | Association test | Function test |
|---|---|---|---|---|
| Epigenomics | 13,101 | 311 | 228 | 199 |
| Genomics | 14,321 | 226 | 114 | 84 |
| Transcriptomics | 16,256 | 224 | 184 | 174 |
| Post-transcriptomics | 17,007 | 59 | 44 | 39 |
FIGURE 2Boxplot to show the associations between latent driver genes and validated ones at each of four omic levels. (A) Epigenomics; (B) Genomics; (C) Transcriptomics; (D) Post-transcriptomics. The Y-axis represents the number of validated genes that can interact with latent genes with different strength. The red (green, blue, respectively) box denotes number of validated genes that can interact with latent genes with medium (high, highest, respectively) confidence.
Driver genes involving two omics levels .
| Level | Epigenomics | Genomics | Transcriptomics | Post-transcriptomics |
|---|---|---|---|---|
| Epigenomics | - | 6 | 9 | 2 |
| Genomics |
| - | 6 | 1 |
| Transcriptomics |
|
| - | 1 |
| Post-transcriptomics |
|
|
| - |
Numbers in the upper triangle part of this table represent the numbers of driver genes involving two omics levels.