| Literature DB >> 30662873 |
Marianna A Zolotovskaia1,2, Maxim I Sorokin3,4, Sergey A Roumiantsev1, Nikolay M Borisov2,3, Anton A Buzdin3,4,5.
Abstract
DNA mutations play a crucial role in cancer development and progression. Mutation profiles vary dramatically in different cancer types and between individual tumors. Mutations of several individual genes are known as reliable cancer biomarkers, although the number of such genes is tiny and does not enable differential diagnostics for most of the cancers. We report here a technique enabling dramatically increased efficiency of cancer biomarkers development using DNA mutations data. It includes a quantitative metric termed Pathway instability (PI) based on mutations enrichment of intracellular molecular pathways. This method was tested on 5,956 tumor mutation profiles of 15 cancer types from The Cancer Genome Atlas (TCGA) project. Totally, we screened 2,316,670 mutations in 19,872 genes and 1,748 molecular pathways. Our results demonstrated considerable advantage of pathway-based mutation biomarkers over individual gene mutation profiles, as reflected by more than two orders of magnitude greater numbers by high-quality [ROC area-under-curve (AUC)>0.75] biomarkers. For example, the number of such high-quality mutational biomarkers distinguishing between different cancer types was only six for the individual gene mutations, and already 660 for the pathway-based biomarkers. These results evidence that PI value can be used as a new generation of complex cancer biomarkers significantly outperforming the existing gene mutation biomarkers.Entities:
Keywords: DNA mutation; biomarker; cancer; molecular pathways; pathway instability
Year: 2019 PMID: 30662873 PMCID: PMC6328788 DOI: 10.3389/fonc.2018.00658
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1(A) PCA of Normalized mutation rate (nMR) patterns based on all mutations for 5,956 samples representing 15 primary human tumor localizations, reflected by the color key. Each point on the plot represents one tumor sample. Abbreviations for the cancer types: BRCA, breast invasive carcinoma; LGG, brain lower grade glioma; GBM, glioblastoma multiforme; CESC, cervical squamous cell carcinoma and endocervical adenocarcinoma; UCEC, uterine corpus endometrial carcinoma; LAML, acute myeloid leukemia; KIRP, kidney renal papillary cell carcinoma; KIRC, kidney renal clear cell carcinoma; COADREAD, colorectal cancer; LICA, liver cancer; LIHC, liver hepatocellular carcinoma; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; OV, ovarian serous cystadenocarcinoma; PRAD, prostate adenocarcinoma; SKCM, skin cutaneous melanoma; STAD, stomach adenocarcinoma; THCA, thyroid carcinoma; BLCA, bladder urothelial carcinoma. (B) PCA of Pathway instability (PI) patterns based on all mutations for the same samples. (C) PCA of Normalized mutation rate (nMR) patterns based on the truncating mutations for 5,297 tumor samples. (D) PCA of Pathway instability (PI) patterns based on the truncating mutations for 5,297 tumor samples.
Figure 2Bioinformatic comparison of quality for pathway- and gene-based mutation biomarkers: (A) for all mutations; (B) for truncating mutations.
Figure 3(A) Numbers of mutation marker genes and molecular pathways in “one vs. all” cancer type comparisons for all mutations. The cancer types are abbreviated as follows: BRCA, breast invasive carcinoma; LGG, brain lower grade glioma; GBM, glioblastoma multiforme; CESC, cervical squamous cell carcinoma and endocervical adenocarcinoma; UCEC, uterine corpus endometrial carcinoma; LAML, acute myeloid leukemia; KIRP, kidney renal papillary cell carcinoma; KIRC, kidney renal clear cell carcinoma; COADREAD, colorectal cancer; LICA, liver cancer; LIHC, liver hepatocellular carcinoma; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; OV, ovarian serous cystadenocarcinoma; PRAD, prostate adenocarcinoma; SKCM, skin cutaneous melanoma; STAD, stomach adenocarcinoma; THCA, thyroid carcinoma; BLCA, bladder urothelial carcinoma. (B) Numbers of mutation marker genes and molecular pathways in “one vs. other cancer types” comparisons for truncating mutations only. (C) AUC distributions of pathway- and genes-based mutation biomarkers for all mutations. Cut-off AUC level of high-quality biomarkers is set 0.75. AUC were obtained as the result of “one vs. others” comparisons. (D) AUC distributions of pathway—and genes-based mutation biomarkers for truncating mutations only. Cut-off level of high-quality biomarkers is set 0.75. AUC were obtained in “one vs. other cancer types” comparisons.
Intersection of top 10% molecular pathways by average PI and molecular pathways with AUC < 0.7 for all cancer types.
| 1 | NCI Aurora A signaling Pathway (protein catabolic process) | 0.19 | |
| 2 | biocarta double stranded RNA induced gene expression Main Pathway | 0.19 | |
| 3 | biocarta role of | 0.18 | |
| 4 | biocarta | 0.18 | |
| 5 | 0.17 | ||
| 6 | 0.16 | ||
| 7 | 0.16 | ||
| 8 | 0.16 | ||
| 9 | biocarta tumor suppressor | 0.16 | |
| 10 | biocarta | 0.14 | |
| 11 | NCI Hypoxic and oxygen homeostasis regulation of | 0.12 | |
| 12 | 0.11 | ||
| 13 | 0.09 | ||
| 14 | 0.09 | ||
| 15 | Lipoxins Influence on Cell Growth and Proliferation | 0.05 | |
| 16 | NCI Validated transcriptional targets of TAp63 isoforms Pathway (Pathway degradation of TP63) | 0.03 | |
| 17 | NCI Validated transcriptional targets of TAp63 isoforms Pathway (Metastasis) | 0.02 | |
| 18 | D-imyoi-inositol 145-trisphosphate biosynthesis | 0.02 |
Top 25 molecular pathways sorted by the number of cancer types where PI score serves as a good biomarker distinguishing from the other fourteen localizations (AUC>0.75).
| 1 | KEGG Pathways in cancer Main Pathway | 6 | |
| 2 | 5 | ||
| 3 | 5 | ||
| 4 | 5 | ||
| 5 | 5 | ||
| 6 | KEGG Neuroactive ligand receptor interaction Main Pathway | 5 | |
| 7 | 5 | ||
| 8 | 5 | ||
| 9 | 5 | ||
| 10 | 4 | ||
| 11 | 4 | ||
| 12 | 4 | ||
| 13 | KEGG | 4 | |
| 14 | KEGG HTLV I infection Main Pathway | 4 | |
| 15 | KEGG | 4 | |
| 16 | KEGG Olfactory transduction Main Pathway | 4 | |
| 17 | KEGG Protein digestion and absorption Main Pathway | 4 | |
| 18 | KEGG Sphingolipid signaling Main Pathway | 4 | |
| 19 | 4 | ||
| 20 | 4 | ||
| 21 | NCI Beta1 integrin cell surface interactions Main Pathway | 4 | |
| 22 | 4 | ||
| 23 | 4 | ||
| 24 | 4 | ||
| 25 | Ras Pathway | 4 |
Figure 4(A) Data matrix of high quality (AUC>0.75) biomarkers for pairwise comparisons between the different cancer localizations calculated based on all mutations. The cancer types are abbreviated as follows: BRCA, breast invasive carcinoma; LGG, brain lower grade glioma; GBM, glioblastoma multiforme; CESC, cervical squamous cell carcinoma and endocervical adenocarcinoma; UCEC, uterine corpus endometrial carcinoma; LAML, acute myeloid leukemia; KIRP, kidney renal papillary cell carcinoma; KIRC, kidney renal clear cell carcinoma; COADREAD, colorectal cancer; LICA, liver cancer; LIHC, liver hepatocellular carcinoma; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; OV, ovarian serous cystadenocarcinoma; PRAD, prostate adenocarcinoma; SKCM, skin cutaneous melanoma; STAD, stomach adenocarcinoma; THCA, thyroid carcinoma; BLCA, bladder urothelial carcinoma. The lower triangle shows numbers of high-quality biomarkers for pathway-based data (PI); the upper triangle—for individual gene-based mutation data (nMR). The intersection of cancer localization terms indicates number of the effective biomarkers for the respective comparison. (B) Cluster dendrogram built for the fifteen cancer types based on mutation biomarker (PI) data for all mutations. Number of biomarkers was used as the distance metric. (C) Cluster dendrogram built for the fifteen cancer types based on mutation biomarker (nMR) data for all mutations. Number of biomarkers was used as the distance metric.