| Literature DB >> 35030162 |
Alexey D Vyatkin1, Danila V Otnyukov1, Sergey V Leonov1, Aleksey V Belikov1.
Abstract
There is a growing need to develop novel therapeutics for targeted treatment of cancer. The prerequisite to success is the knowledge about which types of molecular alterations are predominantly driving tumorigenesis. To shed light onto this subject, we have utilized the largest database of human cancer mutations-TCGA PanCanAtlas, multiple established algorithms for cancer driver prediction (2020plus, CHASMplus, CompositeDriver, dNdScv, DriverNet, HotMAPS, OncodriveCLUSTL, OncodriveFML) and developed four novel computational pipelines: SNADRIF (Single Nucleotide Alteration DRIver Finder), GECNAV (Gene Expression-based Copy Number Alteration Validator), ANDRIF (ANeuploidy DRIver Finder) and PALDRIC (PAtient-Level DRIver Classifier). A unified workflow integrating all these pipelines, algorithms and datasets at cohort and patient levels was created. We have found that there are on average 12 driver events per tumour, of which 0.6 are single nucleotide alterations (SNAs) in oncogenes, 1.5 are amplifications of oncogenes, 1.2 are SNAs in tumour suppressors, 2.1 are deletions of tumour suppressors, 1.5 are driver chromosome losses, 1 is a driver chromosome gain, 2 are driver chromosome arm losses, and 1.5 are driver chromosome arm gains. The average number of driver events per tumour increases with age (from 7 to 15) and cancer stage (from 10 to 15) and varies strongly between cancer types (from 1 to 24). Patients with 1 and 7 driver events per tumour are the most frequent, and there are very few patients with more than 40 events. In tumours having only one driver event, this event is most often an SNA in an oncogene. However, with increasing number of driver events per tumour, the contribution of SNAs decreases, whereas the contribution of copy-number alterations and aneuploidy events increases.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35030162 PMCID: PMC8759692 DOI: 10.1371/journal.pgen.1009996
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 4Driver event distribution by patient’s gender.
Two-tailed heteroscedastic t-test was performed. p-values indicate a significant increase in the average number of driver events of a given type compared to the opposite gender. n.s.—difference not significant.
Driver genes uniquely predicted by SNADRIF and one other source.
| Gene | Entrez Gene ID | Also predicted by |
|---|---|---|
| AMD1 | 262 | HotMAPS |
| BMPR1A | 657 | 2020plus |
| CMTR2 | 55783 | OncodriveFML |
| ELL2 | 22936 | HotMAPS |
| HNRNPDL | 9987 | 2020plus |
| LYRM4 | 57128 | OncodriveFML |
| METTL14 | 57721 | HotMAPS |
| MYCL | 4610 | CGC |
| PPP3R1 | 5534 | HotMAPS |
| PTMA | 5757 | Bailey et al, 2018 |
| RPL22 | 6146 | Bailey et al, 2018 |
| SAMHD1 | 25939 | HotMAPS |
| TBL1XR1 | 79718 | CGC |
| TDG | 6996 | HotMAPS |
| TNNI3K | 51086 | HotMAPS |
| TTK | 7272 | HotMAPS |
Driver prediction algorithms.
| Name | Ref. | Repository | Level | Principles |
|---|---|---|---|---|
| 20/20plus | [ |
| gene | Machine learning, trained on Cancer Genome Landscapes (20/20 rule); Nonsynonymous/Synonymous, clustering, conservation (uses UCSC’s 46-way vertebrate alignment and SNVBox), impact (uses VEST), network (uses BioGrid), expression, chromatin, replication (uses MutSigCV) |
| ANDRIF | This paper |
| Chromosomal arm, chromosome | Recurrence |
| CHASMplus | [ |
| mutation | Machine learning, trained on TCGA; clustering (uses HotMAPS 1D), conservation (uses UCSC Multiz-100-way and SNV box), network (uses Interactome Insider) |
| CompositeDriver | [ |
| gene | Recurrence, impact (uses FunSeq2) |
| dNdScv | [ |
| gene | Nonsynonymous/Synonymous |
| DriverNet | [ |
| gene | Network (uses MGSA and a human functional protein interaction network), impact (uses gene expression outliers) |
| HotMAPS | [ |
| mutation | 3D clustering (uses Protein Data Bank and ModPipe) |
| OncodriveCLUSTL | [ |
| gene | Clustering |
| OncodriveFML | [ |
| gene | Recurrence, Impact (uses CADD and RNAsnp) |
| SNADRIF | This paper |
| gene | Nonsynonymous/Synonymous |
| Bailey et al, 2018 | [ |
| gene | Consensus driver gene list from 26 algorithms applied to PanCanAtlas data |
| COSMIC Cancer Gene Census | [ |
| gene | Manually curated list of cancer driver genes, current “gold standard” |
Driver event classification rules.
| Driver type | Number of nonsynonymous SNAs | Number of inactivating SNAs | HISR | CNA status | Count as … driver event(s) |
|---|---|---|---|---|---|
| SNA-based oncogene | ≥1 | 0 | >5 | 0 | 1 |
| CNA-based oncogene | 0 | 0 | >5 | 1 or 2 | 1 |
| Mixed oncogene | ≥1 | 0 | >5 | 1 or 2 | 1 |
| SNA-based tumour suppressor | ≥1 | ≥0 | ≤5 | 0 | 1 |
| CNA-based tumour suppressor | 0 | 0 | ≤5 | -1 or -2 | 1 |
| Mixed tumour suppressor | ≥1 | ≥0 | ≤5 | -1 or -2 | 1 |
| Passenger | 0 | 0 | 0 | 0 | |
| Low-probability driver | All the rest | 0 | |||