| Literature DB >> 28477017 |
Barani Kumar Rajendran1, Chu-Xia Deng1.
Abstract
Breast cancer is the second most frequently occurring form of cancer and is also the second most lethal cancer in women worldwide. A genetic mutation is one of the key factors that alter multiple cellular regulatory pathways and drive breast cancer initiation and progression yet nature of these cancer drivers remains elusive. In this article, we have reviewed various computational perspectives and algorithms for exploring breast cancer driver mutation genes. Using both frequency based and mutational exclusivity based approaches, we identified 195 driver genes and shortlisted 63 of them as candidate drivers for breast cancer using various computational approaches. Finally, we conducted network and pathway analysis to explore their functions in breast tumorigenesis including tumor initiation, progression, and metastasis.Entities:
Keywords: breast cancer; breast cancer driver genes; cancer drivers; driver mutations; genetic mutations
Mesh:
Year: 2017 PMID: 28477017 PMCID: PMC5564847 DOI: 10.18632/oncotarget.17225
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
List of driver identification methods used to incorporates the prediction of breast cancer driver genes, their working principle and supporting references
| Driver Identification Method | Driver Gene Identification Principle | Citations |
|---|---|---|
| IntOGen | Identifies alterations at transcriptomics level, CN gain and losses in tumor sample. It also integrates OncodriveFM for the identification of accumulation mutations, background mutation rate and OncodriveCLUST for mutation cluster identifications. Further, SIFT, Polyphen and Mutation Assessor are used to predict the impact of mutations. | [ |
| SIFT | Amino acids substitutions and their deleterious impacts prediction. It find the homologous sequences using PSI-BLAST followed by picking sequences with specific diversity and calculating the SIFT scores. | [ |
| PolyPhen-2 | Analyzes non-synonymous SNP using multiple sequence alignment and structure information followed by predicting the probabilistic damaging variants with confidence prediction and at last interpret the results with mutational impact. | [ |
| Mutation Assessor | Predicts mutational impact by calculating functional impact score derived from addition of conservation score and specificity score. | [ |
| Driver DBv2 | Uses large exome and RNAseq datasets to predict the driver genes using several incorporated tools. | [ |
| Active Driver | It identifies significant mutations of cancer genes in active sites of proteins such as mutations in signaling proteins or domains or regulatory elements. It uses gene-centric logistic regression model including multiple factors to estimate mutation significance. | [ |
| Dendrix | This algorithm discovers driver genes with high coverage and high specificity using mutation data. | [ |
| MDPFinder | It combines mutation and expression data to validate the driver genes and their mutated pathways. | [ |
| Simon | It identifies functional mutation impact on proteins, variations in background mutation frequency and genetic code redundancy among tumors. | [ |
| NetBox | It identifies the driver genes by comparing genes and performing network analysis on human interaction Network (HIN) data. | [ |
| MutSigCV | It uses overall mutation rates and distribution patterns and analyzes background mutation rates with patient specific as well as gene specific mutation rates. Finally it includes expression levels and replication periods. | [ |
| MEMo | It identifies the driver genes based on recurrently mutated genes among tumor data with consistent mutational specificity. | [ |
| e-Driver | It manipulates internal distribution of somatic functional missense mutations amongst functional domains by relating mutation rates with other regions of same protein. | [ |
| DawnRank | Uses gene expression data to construct gene network and rank them based on impact and it analyzes somatic alteration data to identify personalized driver alterations. | [ |
| DriverNet | Driver genes are identified based on genomic aberration states of various patients, genes, gene expression data and it further takes biological pathway data into account and builds the network driver genes. | [ |
| MSEA | It predicts cancer driver genes based on patterns of mutation hotspot. | [ |
| iPAC | Identifies non-random somatic mutations in protein using tertiary protein structure information. | [ |
| CoMDP | It uses mutation data to identify driver genes and their pathways. It also predicts genes with other multiple co-occurring biologically significant pathways. | [ |
Figure 1Total number of breast cancer driver genes identified using various computational methods
Figure 2(A) Identified driver genes classified based on their Mutation percentage; (B) High percentage of mutations (>10%) are observed in the identified 63 breast cancer driver genes through the analysis of 9 breast cancer patients data analysis using cBioPortal.
Figure 3Average breast cancer gene mutations identified using cBioPortal projects (4162 breast cancer samples) along with identified top candidate driver genes and their respective chromosomes locations
Identified top candidate breast cancer driver genes (other than known driver genes) and their functional backgrounds
| Identified Driver Genes | Cancer Type | Related pathway | Known Functions | References |
|---|---|---|---|---|
| ADCY3 | Gastric cancer | cAMP/PKA/CREB pathway | Increased cell migration, invasion, and proliferation, which are characteristic of cancer. | [ |
| ARHGAP35 | Osteosarcoma, Breast cancer, Pancreatic carcinoma | Regulation of RhoA activity and focal adhesion and migration | Human glucocorticoid receptor DNA binding factor | [ |
| ARID2 | Hepatocellular carcinoma/melanoma | Chromatin Remodeling | Activating ligand dependent transcription by nuclear receptor | [ |
| ASB10 | Glioblastoma multiform, Ovarian Cancer | Cytokine signaling | Ubiquitination and Ubiquitin protein ligase binding | [ |
| ASH1L | Liver cancer;Leukemia; breast cancer | Tight junction and lysine degradation | Chromatin regulator; Site specific lysine methylation on histone and other proteins | [ |
| BCL6B | Breast cancer;Gastric cancer | P53, MAPK and cancer related pathways | Nucleic acid bindingTumor suppressor gene in gastric cancer | [ |
| BIRC6 | Breast cancer; | Apoptosis and Autophagy | miRNA dependent apoptosis induction | [ |
| CACNA1C | Breast cancer, Gastric, colorectal, pancreatic, leukemia, brain, skin, prostate cancer | Circadian entrainment and NFAT and Cardiac Hypertrophy | High alteration in Ca2+ ion it accelerates cell proliferation, migration and up-regulation in breast cancer | [ |
| COL4A2 | Cardiovascular disease and intracerebral hemorrhage, glaucoma, etc. | Interleukin-3, 5 and GM-CSF signaling and Pathways in cancer. | Regulation of angiogenesis and tumor growth | [ |
| DDX11 | Breast cancer, Fanconi Anemia | Golgi and subsequent modification and unfolded protein response | Genome stability | [ |
| DNAH12 | Prostate cancer | Respiratory electron transport, ATP synthesis chemiosmotic coupling and uncoupling protein for heat production | ATP binding andRegulatory function | [ |
| DNAH14 | Ovarian cancer | Respiratory electron transport, ATP synthesis chemiosmotic coupling and uncoupling protein for heat production | ATP binding andRegulatory function | [ |
| DSPP | Oral squamous cell carcinomas;Prostate and breast cancer | ECM proteoglycan and degradation of the extracellular matrix organization | Vital factor in dentinogenesis; | [ |
| FLG | Nonmelanoma cancer, head and neck, colorectal, breast, ovarian, prostate cancer | AhR pathways | Calcium ion binding | [ |
| FLNB | Breast Cancer; Ovarian cancer; Colorectal cancer | MMP-9 and ERK pathway | RAS induced tumor growth | [ |
| FRMD4A | Gastric cancer;Rectal cancer | - | Protein Binding | [ |
| GOLGA6L2 | Breast cancer;Hapatocellular carcinoma | - | Protein coding | [ |
| GPRIN2 | Rett Syndrome;Breast Cancer | - | Neurite outgrowth | [ |
| GRIA3 | Pancreatic Cancer; Breast cancer | glutamate receptor signaling pathway | excitatory synaptic transmission | [ |
| HECTD4 | Esophageal, non-small-cell lung and head and neck cancer | Protein modification and Ubiquitination | Ubiquitin-protein transferase activity | [ |
| LAMA1 | Breast cancer; Colon cancer | Cancer and Integrin pathway | Receptor binding | [ |
| MAST1 | Breast Cancer | - | Ion/ATP/protein binding | [ |
| MCF2L | Breast cancer | Rho/Rac signaling and p75 NTR-receptor-mediated signaling pathways | Rho-guanyl-nucleotide exchange factor activity | [ |
| MEF2A | Breast cancer | P38 MAPK signaling | Neuronal differentiation and survival | [ |
| NBPF12 | Neuroblastoma; small cell lung cancer neurogenetic diseases | - | CHEA Transcription factor binding site | [ |
| NID1 | Gastrointestinal cancer | Non-integrin membrane-ECM interactions and Degradation of the extracellular matrix | Act as cross-linker with other extracellular matrix | [ |
| NRK | Breast cancer | TNF-alpha-induced signaling pathway | Receptor signaling protein serine/threonine kinase activity and ATP binding | [ |
| OBSCN | Highly mutated in various cancers including breast cancer | RhoA signaling | Structural and regulatory functions | [ |
| PCBP2 | Hepatocellular cancer; Familial breast cancer; lymphocytic leukemia, colorectal cancer | RIG-I/MDA5 mediated induction of IFN-alpha/beta pathway and mRNA splicing pathways | Transcriptional role | [ |
| PCDH11X | Esophageal carcinoma, breast cancer, Prostate cancer | - | Cell adhesion | [ |
| PGR | Breast and Ovarian cancer | oestrogen-mediated pathways | Tumor repressing mechanism | [ |
| PIK3CB | Oral-squamous cell carcinoma, breast cancer and other wide range of cancer | Involved in AKT, PTEN and PIK3CA pathways | Cell cycle growth regulation | [ |
| PIK3CD | Breast, Ovarian and colon cancers | PIK signaling | Transcription binding factor | [ |
| ROCK2 | Breast, lung, ovarian, intestinal cancer | RhoA signaling | Actin cytoskeleton organization, Adhesion, migration, Proliferation and apoptosis. | [ |
| RYR2 | Breast Cancer, Lung Cancer, Bladder cancer | cAMP-dependent PKA activation | Calcium ion binding, Calcium/calmodulin binding | [ |
| SCAF11 | Lung adenocarcinoma, various cancers | Apoptosis | Protein/zinc ion/poly(A)RNA binding | [ |
| SDK2 | Non-small cell lung cancer | - | Adhesion, Promotes synaptic connectivity | [ |
| STAT6 | Breast cancer, Lung cancer | Integrin, Interleukin-3,5 and GM-CSF signaling pathway | IL-4 mediate cell growth regulator, inhibitIL-4 induced cell death | [ |
| TTN | Colorectal, testis, gastric, breast, ovarian, renal cancers | Platelet activation, Signaling and aggregation pathway | Chromosome condensation and segregation | [ |
Mutation profiles of identified top candidate BRCA driver genes
| BRCA Drivers | Substitution % | InDel % | Amplification % | Copy Gain % | Copy Loss % | Deletion (%) | Expression Outliers High % |
|---|---|---|---|---|---|---|---|
| 3.2 | 0.1 | - | 1.4 | 16.5 | 2.9 | - | |
| 1.7 | - | - | 1 | 16.4 | 1.3 | - | |
| CTCF | 1.9 | - | 0.1 | 1.1 | 16 | 2.9 | |
| 2.4 | 0.4 | 0.2 | 0.4 | 12.2 | 2.5 | - | |
| BCL6B | 0.1 | - | 0.1 | 0.4 | 12.2 | 1.4 | - |
| 6.8 | 0.4 | - | 0.6 | 12 | 1.5 | - | |
| 3 | 0.3 | 0.2 | 0.8 | 11.3 | 1.5 | - | |
| PGR | 0.5 | - | 0.2 | 2 | 10.8 | 2.8 | 51.4 |
| 1.3 | - | - | 0.8 | 9.3 | 1.8 | - | |
| 1.3 | - | 0.2 | 1.8 | 7.4 | 1.2 | 66.5 | |
| MCF2L | 0.5 | - | 0.4 | 4 | 6.7 | 2.5 | - |
| COL4A2 | 0.5 | - | 0.3 | 4.1 | 6.5 | 2 | - |
| GOLGA6L2 | 0.1 | - | 0.3 | 2.2 | 6.2 | 1.7 | - |
| 1.4 | 0.1 | 0.1 | 0.3 | 5.6 | 0.8 | - | |
| 1 | - | - | 0.8 | 5.1 | 0.5 | - | |
| PIK3CD | 0.4 | - | 0.2 | 0.8 | 5.1 | 1.2 | 1.2 |
| DNAH12 | 1 | 0.1 | 0.1 | 0.6 | 4.9 | 0.5 | 15.8 |
| FLNB | 1.2 | 0.3 | 0.1 | 0.8 | 4.8 | 0.6 | - |
| ZFP36L1 | 0.5 | 0.1 | 0.1 | 1.4 | 4.7 | 1 | - |
| LAMA1 | 1 | - | 0.1 | 2.7 | 4.3 | 1 | - |
| 1 | - | 0.2 | 4.7 | 4.2 | 0.1 | 52.2 | |
| MAP3K1 | 3.5 | 0.1 | - | 2.2 | 4.1 | 0.8 | - |
| 1.7 | 0.4 | - | 0.7 | 3.9 | 0.9 | - | |
| ASB10 | 0.3 | - | 0.2 | 3.9 | 2.7 | 0.8 | - |
| 0.9 | - | 0.1 | 1 | 2.5 | 0.3 | - | |
| DSPP | 1.2 | 0.3 | 0.2 | 1.1 | 2.3 | 0.4 | - |
| MEF2A | 0.1 | - | 1.2 | 3.8 | 2.3 | 0.3 | - |
| PCDH11X | 0.8 | - | 0.9 | 2.2 | 0.3 | - | |
| CACNA1C | 1.3 | - | 0.9 | 5.1 | 2.1 | 0.3 | - |
| GRIA3 | 1.2 | 0.1 | 0.1 | 1.1 | 2.1 | 0.2 | - |
| TTN | 13.7 | - | 0.1 | 2 | 2 | - | - |
| FOXA1 | 1.3 | - | 0.8 | 4.7 | 2 | 0.4 | 84.2 |
| TBX3 | 0.9 | - | - | 1.6 | 1.9 | 0.2 | - |
| 0.8 | - | 0.8 | 4 | 1.8 | 0.2 | - | |
| 0.8 | - | 0.1 | 1.9 | 1.8 | 0.1 | - | |
| HECTD4 | 1.2 | - | 0.1 | 1.6 | 1.5 | 0.2 | - |
| MAST1 | 1 | - | 0.2 | 3.4 | 1.4 | 0.1 | 22 |
| 1.2 | 0.1 | 0.2 | 3.5 | 1.4 | 0.1 | - | |
| 0.3 | - | 0.1 | 1.9 | 1.4 | 0.1 | - | |
| ROCK2 | 0.5 | - | 0.3 | 1.9 | 1.2 | 0.3 | - |
| 0.3 | - | 0.1 | 2.2 | 1.1 | 0.1 | - | |
| 0.5 | - | 0.2 | 2.4 | 1 | 0.3 | - | |
| SDK2 | 1 | - | 1.4 | 11.6 | 1 | 0.4 | - |
| 1.7 | - | 1.4 | 32.9 | 1 | 0.1 | - | |
| SCAF11 | 0.4 | - | 0.2 | 2.4 | 1 | 0.2 | - |
| STAT6 | 0.1 | - | 0.2 | 2.3 | 0.9 | - | - |
| 0.9 | 2.4 | 0.9 | 7.3 | 0.8 | 0.2 | 81.1 | |
| NID1 | 1 | 0.1 | 1.6 | 33.3 | 0.8 | 0.2 | - |
| 1.3 | - | 0.1 | 2.4 | 0.8 | 0.1 | - | |
| 0.4 | - | 0.8 | 4.4 | 0.8 | 0.3 | - | |
| 0.5 | - | 0.8 | 6.5 | 0.8 | 0.1 | - | |
| PIK3CB | 0.6 | - | 0.3 | 4.8 | 0.8 | - | - |
| BIRC6 | 1.3 | - | 0.1 | 2.4 | 0.7 | 0.1 | - |
| 0.6 | - | 0.8 | 4.5 | 0.7 | 0.3 | - | |
| RYR2 | 3.9 | - | 3.5 | 33.1 | 0.6 | 0.3 | 5.9 |
| DNAH14 | 0.5 | 0.1 | 1.3 | 33.7 | 0.4 | 0.1 | 79.8 |
| TBL1XR1 | 0.5 | - | 0.7 | 8.2 | 0.4 | - | - |
| 32.1 | 0.6 | 0.9 | 8 | 0.3 | - | 0.1 | |
| GPRIN2 | 0.1 | 0.3 | 6.4 | 2.9 | 0.3 | - | - |
| NBPF12 | 0.3 | - | 4.5 | 45 | 0.2 | 0.1 | - |
| ASH1L | 1 | 0.1 | 1.4 | 31.2 | 0.1 | - | - |
| 0.8 | - | 0.8 | 4 | 1.8 | 0.2 | - | |
| FLG | 4.4 | - | 2 | 30.9 | - | 0.1 | - |
Identified driver genes are categorized with Tumor suppressor (* with bold caption); Oncogene (^ with bold caption); Gatekeeper (+ with bold caption).
Figure 4Genetic interaction network of identified top candidate breast cancer driver genes
Figure 5Overall comparisons between published and identified BRCA driver genes