| Literature DB >> 34620211 |
Hussein Mohsen1, Vignesh Gunasekharan2, Tao Qing2, Montrell Seay3, Yulia Surovtseva3, Sahand Negahban4, Zoltan Szallasi5, Lajos Pusztai6, Mark B Gerstein7,8,9,10.
Abstract
BACKGROUND: The diversity of genomic alterations in cancer poses challenges to fully understanding the etiologies of the disease. Recent interest in infrequent mutations, in genes that reside in the "long tail" of the mutational distribution, uncovered new genes with significant implications in cancer development. The study of cancer-relevant genes often requires integrative approaches pooling together multiple types of biological data. Network propagation methods demonstrate high efficacy in achieving this integration. Yet, the majority of these methods focus their assessment on detecting known cancer genes or identifying altered subnetworks. In this paper, we introduce a network propagation approach that entirely focuses on prioritizing long tail genes with potential functional impact on cancer development.Entities:
Mesh:
Year: 2021 PMID: 34620211 PMCID: PMC8496153 DOI: 10.1186/s13059-021-02504-x
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Schematic overview of the UMG identification strategy
Fig. 2Distribution of UMGs across 17 cancer types. Right: genes in 2 or more cancer types. The dendrogram is based on the hierarchical clustering of heatmap rows. Each heatmap value corresponds to a percentage-based score of a cancer type’s cell lines whose survival is negatively impacted by a gene’s knockout. For each value, the maximum percentage across RNAi and CRISPR experiments is selected. Left: cancer type-specific genes. The histogram throughout the plot corresponds to the normalized rank of each UMG in the lists it belongs to
Fig. 3Biological enrichment results for UMGs at cancer type and pan-cancer levels. a UMGs uncover known and novel associations between cancer types and biological pathways. Enrichment analyses are performed for each cancer type’s combined list of UMGs and drivers. Shown results correspond to significant pathway and molecular function associations exclusively uncovered by UMGs. b Pan-cancer analysis of all 230 UMGs allows for the identification of biological pathways, processes, and functions strongly associated with UMGs (in red) that suggests potential therapeutic targets. c A similar analysis to b on clusters of KEGG mega-pathways uncover disease-disease and disease-infection associations pertaining to UMGs
Frequent UMGs driving high connectivity within EnrichmentMap functional clusters
| Functional cluster | Frequent UMGs |
|---|---|
| Known cancer-related | |
| Proliferation | |
| Adhesion | |
| Transcription and translation | |
| Binding | |
| Immune system | |
| Cancer mega-pathways | |
| Other diseases and infection mega-pathways |
Fig. 4Comparisons with other methods. a UMGs demonstrate a considerably stronger (CRISPR- and RNAi-measured) impact on survival of cancer cell lines than other non-driver genes suggested by HHotNet (in 3 settings), FDRNet, Zhou et al. (in its original and edge-normalized settings), nCOP, and MutSig. Higher negative values indicate a greater negative effect on cell survival after gene knockdown. b UMGs’ strong impact on the survival of cancer cell lines is significantly broader than that of genes selected by other methods. The median percentage-based score of cancer cell lines negatively impacted by UMGs’ knockout is consistently higher with cancer type specificity
Fig. 5PPI network analysis of the relationships between UMGs (white nodes) and known driver genes (red) in breast invasive carcinoma (BRCA) suggests roles of UMGs. Driver genes are split into categories based on initial mutation score and node degree: (i) high score, high degree (bottom left); (ii) high score, low degree (top left); (iii) low score, low degree (top right); and (iv) low score, high degree (bottom right). UMGs connected to driver subsets (i) and (ii) (olive and orange edges) and ones with no mutation score (e.g., POLR2E) are likely to be drug targets. UMGs connected to (iii) and (iv) and ones without connections to drivers (top right corner, e.g., DSN1) are likely to be “weak drivers.” Other cancer types’ results are available in Additional file 5: Figs. S4-19