| Literature DB >> 32079540 |
Hu Chen1,2, Jun Li2, Yumeng Wang2, Patrick Kwok-Shing Ng3, Yiu Huen Tsang4, Kenna R Shaw3, Gordon B Mills4, Han Liang5,6.
Abstract
BACKGROUND: The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, so-called driver mutations. Identifying driver mutations in a patient's tumor cells is a central task in the era of precision cancer medicine. Over the decade, many computational algorithms have been developed to predict the effects of missense single-nucleotide variants, and they are frequently employed to prioritize mutation candidates. These algorithms employ diverse molecular features to build predictive models, and while some algorithms are cancer-specific, others are not. However, the relative performance of these algorithms has not been rigorously assessed.Entities:
Keywords: 3D clustering; Cell viability assay; Driver mutations; Passenger mutations; TP53 mutations; The Cancer Genome Atlas; Tumor transformation
Mesh:
Year: 2020 PMID: 32079540 PMCID: PMC7033911 DOI: 10.1186/s13059-020-01954-z
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Summary of 33 computational algorithms included in this study
| Classifier | Features | Method | Reference |
|---|---|---|---|
| CADD | Conservation, epigenetic signals, functional predictions, genetic context, and published predictors | Linear kernel support vector machine | Rentzsch et al. [ |
| CanDrA | Structural, evolutionary, and genomic features, published predictors | Support vector machine | Mao et al. [ |
| CHASM | Structural, evolutionary, and genomic features | Random forest | Carter et al. [ |
| CTAT-cancer | TransFIC, fathmm, chasm, candra | Principal component analysis (PCA) | Bailey et al. [ |
| CTAT-population | SIFT, PolyPhen2, mutationAssessor, VEST | PCA | Bailey et al. [ |
| DANN | Conservation, epigenetic signals, functional predictions, and genetic context | Deep neural network | Quang et al. [ |
| DEOGEN2 | Evolutionary, protein, gene, pathway, PROVEAN | Random forest | Raimondi et al. [ |
| Eigen | Prediction scores of other tools, allele frequencies, epigenomic signals | Unsupervised spectral approach | Ionita-Laza et al. [ |
| Eigen-PC | Prediction scores of other tools, allele frequencies, epigenomic signals | Unsupervised spectral approach | Ionita-Laza et al. [ |
| FATHMM-disease | Sequence homology | Hidden Markov models | Shihab et al. [ |
| FATHMM-cancer | Sequence homology | Hidden Markov models | Shihab et al. [ |
| FATHMM-MKL | Conservation, epigenomic signals | Multiple kernel learning | Shihab et al. [ |
| FATHMM-XF | Conservation, genomic features, epigenomic signals | Multiple kernel learning | Rogers [ |
| GenoCanyon | Conservation, biochemical annotation | Posterior probability by unsupervised statistical learning | Lu et al. [ |
| Integrated_fitCons | Integrated epigenomic signals | INSIGHT | Gulko et al. [ |
| LRT | Sequence homology | Likelihood ratio test of codon neutrality | Chun et al. [ |
| M-CAP | Published predictors, conservation | Gradient boosting tree classifier | Jagadeesh et al. [ |
| MetaLR | Nine prediction scores and allele frequencies in 1000G | Logistic regression | Dong et al. [ |
| MetaSVM | Nine prediction scores and allele frequencies in 1000G | Radial kernel support vector machine | Dong et al. [ |
| MPC | Regional missense constraint, missense badness, polyphen2 | Logistic regression | Samocha et al. [ |
| MutationAssessor | Sequence homology | Combinatorial entropy formalism | Reva et al. [ |
| MutationTaster2 | Conservation, genetic context, regulatory features | Naïve Bayes classifier | Schwarz et al. [ |
| MutPred | Protein structural and functional properties, conservation, SIFT | Random forest | Li et al. [ |
| MVP | Sequence and structural features, published predictors, conservation | Deep neural network | Qian et al. [ |
| Polyphen2_HDIV | Eight sequence-based and three structure-based predictive features | Naïve Bayes classifier | Adzhubei et al. [ |
| Polyphen2_HVAR | Eight sequence-based and three structure-based predictive features | Naïve Bayes classifier | Adzhubei et al. [ |
| PrimateAI | Sequence homology | Deep residual neural network | Sundaram et al. [ |
| PROVEAN | Sequence homology | Delta alignment score | Choi et al. [ |
| REVEL | Published predictors | Random forest | Ioannidis et al. [ |
| SIFT | Sequence homology based on PSI-BLAST | Position-specific scoring matrix | Ng et al. [ |
| SIFT4G | Sequence homology based on Smith-Watermann | Position-specific scoring matrix | Vaser et al. [ |
| TransFIC | SIFT, Polyphen2, mutationAssessor | Transformed functional impact scores | Gonzalez-Perez [ |
| VEST4 | Amino acid-related features, DNA context, conservation, protein structure | Random forest | Carter et al. [ |
Fig. 1Feature summary and inter-correlations between algorithms. a Based on features included, each algorithm was labeled as using ensemble score, sequence context, protein feature, conservation, or epigenomic information. The algorithms trained on cancer diver data or proposed to identify cancer drivers are labeled as cancer-specific. b Left: hierarchical clustering pattern of 33 algorithms based on ~ 710,000 TCGA somatic mutations; right, a triangle heatmap displays the Spearman rank correlation coefficient between any two algorithms
Fig. 2Assessment using a benchmark dataset based on mutation 3D clustering pattern. a Overview of the assessment process. We used four computational algorithms to detect whether mutations are located within the protein 3D structural hotspots, each algorithm with one vote. The number of votes was defined as the consensus cluster score. A mutation with a score of ≥ 2 and in a cancer gene (i.e., cancer gene consensus) was considered as a positive case, and a mutation with a score of 0 and in a non-cancer gene was considered as a negative case. b ROC curves and corresponding AUC scores for the top 10 algorithms. c Boxplots showing the differences of AUC between two groups of algorithms with or without certain features. p value is based on the Wilcoxon rank sum test. d Sensitivity and specificity of each algorithm calculated by using the median score value as the threshold to make binary predictions. Error bars, mean ± 2SD
Fig. 3Assessment using a benchmark dataset based on OncoKB annotation. a Overview of the assessment process. The OncoKB database classifies mutations into four categories: oncogenic, likely oncogenic, likely neutral, and inconclusive. We considered “likely neutral” as negative cases, and we considered “oncogenic” mutations only or both “oncogenic” and “likely oncogenic” mutations as positive cases. b Bar plots showing the AUC scores of the 33 algorithms in the two comparisons. The red color is for oncogenic plus likely oncogenic vs. likely neutral, and green is for oncogenic vs. likely neutral. c Sensitivity and specificity of 33 algorithms. Error bars, mean ± 2SD
Fig. 4Assessment using a benchmark dataset based on the transactivation effects of TP53 mutations. a Overview of the assessment process. Promoter-specific transcriptional activity was measured for 8 targets of p53 protein. Mutations with the median transcription activity ≤ 50% were used as positive cases, and others were used as negative cases. b ROC plot and AUC scores for the top 10 algorithms. c Sensitivity and specificity of 33 algorithms. Error bars, mean ± 2SD
Fig. 5Assessment using a benchmark dataset based on in vivo tumor formation. a Overview of the assessment process. Cell lines stabling expressing mutant alleles were injected into mice. Mutations that could form any tumors greater than 500 mm3 by 130 days were considered as functional mutations and used as positives, and other mutations were used as negatives. b ROC plot and AUC scores for the top 10 algorithms. c Sensitivity and specificity of 33 algorithms. Error bars, mean ± 2SD
Fig. 6Assessment using a benchmark dataset based on in vitro cell viability. a Overview of the assessment process. For each mutation, we performed cell viability assays in two “informer” cell lines, Ba/F3 and MCF10A. Consensus calls were inferred by integrating the functional effects observed in Ba/F3 and MCF10A. We considered activating, inactivating, inhibitory, and non-inhibitory mutations as positive cases, while neutral mutations were considered negative. b The ROC curves of the 33 algorithms based on a combined set of published mutations (Ng et al. [42]) and newly generated mutations in this study. c Bar plots showing the AUC scores of the 33 algorithms in the three datasets: new functional data (red), published functional data (green), and the combined set (blue). d Boxplots showing the differences of AUC between two groups of algorithms with or without certain features. p values are based on the Wilcoxon rank sum test. d Sensitivity and specificity of 33 algorithms. Error bars, mean ± 2SD
Fig. 7Overall evaluation. a, b The overlapping summary of positive (a) and negative cases (b) in the five benchmark datasets. c Correlations of the performance ranks of the 33 algorithms based on the five benchmark datasets. d A heatmap showing the rank of the 33 algorithms based on each benchmark dataset. Ranks are labeled for the top five algorithms only. Red, higher ranks, and white, lower ranks. The features of the 33 algorithms are shown on the top, indicated by color (gray, no; and black, yes)