| Literature DB >> 27112211 |
Gabriela Jurca1, Omar Addam1, Alper Aksac1, Shang Gao2, Tansel Özyer3, Douglas Demetrick4, Reda Alhajj5,6.
Abstract
BACKGROUND: Breast cancer is a serious disease which affects many women and may lead to death. It has received considerable attention from the research community. Thus, biomedical researchers aim to find genetic biomarkers indicative of the disease. Novel biomarkers can be elucidated from the existing literature. However, the vast amount of scientific publications on breast cancer make this a daunting task. This paper presents a framework which investigates existing literature data for informative discoveries. It integrates text mining and social network analysis in order to identify new potential biomarkers for breast cancer.Entities:
Keywords: Breast cancer; Data mining; Network analysis; Text mining
Mesh:
Substances:
Year: 2016 PMID: 27112211 PMCID: PMC4845430 DOI: 10.1186/s13104-016-2023-5
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Fig. 1Growth in number of abstracts about breast cancer in PubMed
Fig. 2Text annotation using BeCAS
Fig. 3Outline of the workflow and resources used in the proposed solution
Fig. 4The extraction of the country
Gene-disease associations from gene-year and gene-country analysis
| Gene | Disease associations for gene | Genes that share more diseases with this gene | Country associations for gene | |||
|---|---|---|---|---|---|---|
| Disease name | Score | Gene name | # of shared diseases | Country name | # of abstracts | |
| ERBB2 | Breast neoplasms | 0.414 | EGFR | 15 | United States | 4271 |
| Mammary neoplasms, experimental | 0.4 | PTGS2 | 13 | Italy | 808 | |
| Neoplasm metastasis | 0.396 | SOD2 | 12 | Japan | 806 | |
| Adenocarcinoma | 0.363 | TP53 | 11 | China | 799 | |
| Ovarian neoplasms | 0.331 | STAT3 | 10 | United Kingdom | 674 | |
| Prostatic neoplasms | 0.329 | CCND1 | 10 | Germany | 620 | |
| Lung neoplasms | 0.329 | ESR1 | 10 | France | 486 | |
| Stomach neoplasms | 0.321 | KRAS | 9 | Canada | 433 | |
| Cholangiocarcinoma | 0.308 | TNF | 9 | South Korea | 347 | |
| Glioma | 0.306 | TNFSF10 | 9 | Spain | 329 | |
| ESR1 | Breast neoplasms | 0.423 | SOD2 | 14 | United States | 5429 |
| Alzheimer disease | 0.358 | EGFR | 13 | United Kingdom | 1249 | |
| Neoplasm metastasis | 0.345 | PTGS2 | 12 | Japan | 918 | |
| Carcinoma | 0.344 | TNF | 11 | China | 764 | |
| Coronary artery disease | 0.342 | CDH1 | 10 | Italy | 727 | |
| Migraine disorders | 0.333 | ACE | 10 | France | 569 | |
| Obesity | 0.327 | ERBB2 | 10 | Germany | 517 | |
| Leiomyoma | 0.327 | PTEN | 9 | Canada | 515 | |
| Myocardial infarction | 0.323 | STAT3 | 9 | South Korea | 338 | |
| Infertility, male | 0.321 | TP53 | 9 | Sweden | 299 | |
| PGR | Breast neoplasms | 0.38 | EGFR | 7 | United States | 1887 |
| Endometriosis | 0.346 | ESR1 | 6 | Japan | 456 | |
| Carcinoma | 0.32 | ESR2 | 6 | Italy | 404 | |
| Meningioma | 0.307 | STAT3 | 5 | China | 385 | |
| Adenocarcinoma | 0.304 | EFEMP1 | 5 | United Kingdom | 311 | |
| Mammary neoplasms, animal | 0.3 | CDH1 | 5 | France | 294 | |
| Mammary neoplasms, experimental | 0.3 | PHB | 5 | Germany | 245 | |
| Mesothelioma | 0.3 | PDGFA | 5 | Canada | 188 | |
| Recurrence | 0.3 | STAT5A | 5 | South Korea | 169 | |
| Malignant neoplasm breast | 0.126 | ENO1 | 5 | Sweden | 131 | |
| EGF | Hypomagnesemia 4, renal | 0.6 | SOD2 | 11 | United States | 2199 |
| Wounds and injuries | 0.4 | IL6 | 9 | United Kingdom | 408 | |
| Breast neoplasms | 0.325 | MMP9 | 9 | Japan | 394 | |
| Prostatic neoplasms | 0.322 | PTGS2 | 9 | China | 378 | |
| Carcinoma, hepatocellular | 0.317 | TNF | 9 | Italy | 298 | |
| Neoplasm metastasis | 0.315 | PTEN | 8 | Germany | 239 | |
| Glioblastoma | 0.311 | EGFR | 8 | South Korea | 211 | |
| Adenocarcinoma | 0.307 | IGF1 | 8 | Canada | 205 | |
| Kidney diseases | 0.301 | IL8 | 8 | France | 173 | |
| Stomach ulcer | 0.3 | TGFB1 | 7 | Spain | 112 | |
| BRCA1 | Breast-ovarian cancer, familial, Susceptibility To, 1 | 0.7 | CDH1 | 7 | United States | 1845 |
| Malignant neoplasm breast | 0.54 | CCND1 | 7 | United Kingdom | 395 | |
| Malignant neoplasm of ovary | 0.44 | SOD2 | 7 | Canada | 304 | |
| Breast neoplasms | 0.419 | BRCA2 | 6 | France | 222 | |
| Mammary neoplasms, experimental | 0.4 | HRAS | 6 | The Netherlands | 218 | |
| Ovarian neoplasms | 0.381 | STAT3 | 6 | Italy | 197 | |
| Neoplasms | 0.375 | EGFR | 6 | China | 182 | |
| Carcinoma | 0.366 | ERBB2 | 6 | Spain | 143 | |
| Hereditary breast and ovarian cancer Syndrome | 0.359 | ESR1 | 6 | Germany | 140 | |
| Prostatic neoplasms | 0.318 | AKT1 | 5 | Japan | 124 | |
| BRCA2 | Fanconi anemia, complementation Group D1 | 0.7 | BRCA1 | 6 | United States | 885 |
| Malignant neoplasm breast | 0.54 | CTNNB1 | 6 | United kingdom | 256 | |
| Ovarian neoplasms | 0.464 | ERBB2 | 6 | Canada | 203 | |
| Prostatic neoplasms | 0.409 | PTEN | 5 | Italy | 119 | |
| Medulloblastoma | 0.401 | SOD2 | 5 | The Netherlands | 115 | |
| Breast neoplasms | 0.392 | TNF | 5 | Germany | 102 | |
| Hereditary breast and ovarian cancer Syndrome | 0.334 | TNFSF10 | 5 | France | 100 | |
| Fanconi ANEMIA | 0.326 | AKT1 | 4 | Spain | 93 | |
| Pancreatic neoplasms | 0.309 | BRIP1 | 4 | Australia | 73 | |
| Wilms tumor | 0.3 | CDH1 | 4 | Israel | 70 | |
| CDKN2A | Melanoma-pancreatic cancer syndrome | 0.6 | TP53 | 15 | United States | 1809 |
| Melanoma, cutaneous malignant, susceptibility To, 2 | 0.6 | SOD2 | 12 | China | 431 | |
| Lung neoplasms | 0.442 | KRAS | 9 | Japan | 340 | |
| Stomach neoplasms | 0.411 | PTGS2 | 9 | United Kingdom | 325 | |
| Esophageal neoplasms | 0.41 | ABCB1 | 7 | Italy | 297 | |
| Neoplasms | 0.391 | CSF3 | 7 | France | 224 | |
| Adenocarcinoma | 0.358 | EGFR | 7 | Germany | 218 | |
| Glioma | 0.341 | ESR1 | 6 | South Korea | 202 | |
| Precursor cell lymphoblastic leukemia-Lymphoma | 0.338 | MET | 6 | Canada | 186 | |
| Carcinoma, non-small-cell lung | 0.332 | ERBB2 | 6 | Spain | 136 | |
| ALPPL2 | Abortion, spontaneous | 0.3 | CEACAM1 | 1 | United States | 104 |
| Parkinson disease | 0.003 | HSD17B1 | 1 | Germany | 32 | |
| Stroke | 0.003 | IFI35 | 1 | United Kingdom | 30 | |
| Carcinoma in situ | 0.001 | IFI44 | 1 | Italy | 28 | |
| Seminoma | 0.001 | IFI6 | 1 | France | 18 | |
| Retinal diseases | <0.001 | IFNA10 | 1 | Japan | 16 | |
| Embryonal neoplasm | <0.001 | IGFBP1 | 1 | Canada | 13 | |
| Carcinoma, embryonal | <0.001 | IGFBP6 | 1 | Greece | 11 | |
| – | – | IL11 | 1 | The Netherlands | 9 | |
| – | – | IL12B | 1 | China | 9 | |
| CD99 | Chondrosarcoma, mesenchymal | 0.3 | PDGFRA | 1 | United States | 342 |
| Neuroectodermal tumors, primitive, Peripheral | 0.012 | BCL2 | 1 | Japan | 77 | |
| Sarcoma, ewing | 0.01 | IL1A | 1 | Germany | 74 | |
| Breast neoplasms | 0.005 | MKI67 | 1 | Italy | 63 | |
| Carcinoma | 0.005 | – | – | China | 56 | |
| Neuroectodermal tumors, primitive | 0.004 | – | – | United Kingdom | 52 | |
| Osteosarcoma | 0.003 | – | – | Canada | 37 | |
| Neoplasms | 0.003 | – | – | France | 26 | |
| Lymphoma | 0.003 | – | – | Australia | 24 | |
| Adenocarcinoma | 0.003 | – | – | The Netherlands | 22 | |
| CHI3L1 | Schizophrenia | 0.319 | TNF | 3 | United States | 90 |
| Glioblastoma | 0.311 | MET | 3 | Japan | 17 | |
| Glioma | 0.31 | MGMT | 2 | United Kingdom | 17 | |
| Neoplasm invasiveness | 0.303 | TGM2 | 2 | Italy | 15 | |
| Osteoarthritis | 0.301 | ACO1 | 2 | France | 12 | |
| Asthma-related traits, susceptibility To, 7 | 0.3 | MMP9 | 2 | Denmark | 10 | |
| Hypertension | 0.103 | GDNF | 2 | Germany | 6 | |
| Asthma | 0.017 | FTL | 2 | Australia | 6 | |
| Arthritis, rheumatoid | 0.009 | ENO1 | 2 | Finland | 5 | |
| Neoplasm malignant | 0.005 | EGF | 2 | India | 5 | |
| SOD1 | Amyotrophic lateral sclerosis 1 | 0.66 | TNF | 19 | United States | 265 |
| Amyotrophic lateral sclerosis | 0.551 | SOD2 | 17 | Italy | 42 | |
| Hypertension | 0.402 | IL6 | 15 | Japan | 40 | |
| Deficiency diseases | 0.4 | PTGS2 | 14 | India | 39 | |
| Motor neuron disease | 0.341 | NOS2 | 13 | China | 31 | |
| Down syndrome | 0.323 | CAT | 13 | United Kingdom | 28 | |
| Atherosclerosis | 0.31 | AGT | 11 | Germany | 26 | |
| Diabetes mellitus, type 2 | 0.31 | IL1B | 10 | The Netherlands | 25 | |
| Ischemia | 0.309 | IFNG | 10 | Turkey | 20 | |
| Parkinson disease | 0.309 | ALB | 10 | Canada | 18 | |
| AMN | Imerslund-grasbeck syndrome | 0.601 | TNF | 3 | United States | 138 |
| Acute kidney injury | 0.3 | KNG1 | 3 | United Kingdom | 32 | |
| Neurogenic inflammation | 0.3 | TAC1 | 3 | Germany | 22 | |
| Edema | 0.3 | IL6 | 2 | Japan | 18 | |
| Extravasation of diagnostic and Therapeutic Materials | 0.3 | POMC | 2 | China | 16 | |
| anemia, megaloblastic | 0.003 | CALCA | 2 | Canada | 12 | |
| adrenoleukodystrophy | 0.003 | PTGS2 | 2 | France | 10 | |
| Nervous system malformations | 0.003 | INS | 2 | Italy | 9 | |
| Malabsorption syndromes | 0.001 | KLK1 | 1 | The Netherlands | 9 | |
| Adrenomyeloneuropathy | <0.001 | LCN2 | 1 | Australia | 7 | |
| CD40LG | Hyper-igm immunodeficiency syndrome, Type 1 | 0.629 | CCL2 | 4 | United States | 66 |
| Coronary artery disease | 0.306 | IL1B | 3 | Germany | 15 | |
| Pneumonia | 0.3 | IL6 | 3 | United Kingdom | 12 | |
| Amyotrophic lateral sclerosis | 0.3 | TNF | 3 | Japan | 11 | |
| Hypersensitivity | 0.3 | IL8 | 3 | Italy | 10 | |
| Necrosis | 0.3 | IFNG | 3 | Argentina | 9 | |
| Hypertension, pulmonary | 0.3 | IL5RA | 2 | China | 9 | |
| Diabetes mellitus, type 1 | 0.101 | HMOX1 | 2 | Australia | 7 | |
| Enterocolitis, necrotizing | 0.1 | IL13 | 2 | Denmark | 6 | |
| Periodontal diseases | 0.1 | IL17A | 2 | The Netherlands | 6 | |
| CD79A | Agammaglobulinemia | 0.3 | BTK | 1 | United States | 22 |
| Leukemia, lymphocytic, chronic, B Cell | 0.003 | CD19 | 1 | France | 8 | |
| Lymphoma, non-hodgkin | 0.003 | IGLL1 | 1 | China | 7 | |
| Lymphoma, B-cell | 0.003 | LRRC8A | 1 | Japan | 6 | |
| Leukemia, myeloid, acute | 0.003 | – | – | India | 4 | |
| Leukemia | 0.003 | – | – | Spain | 4 | |
| Multiple myeloma | 0.003 | – | – | Sweden | 3 | |
| Lymphoma | <0.001 | – | – | Belgium | 3 | |
| Takayasu arteritis | <0.001 | – | – | Finland | 3 | |
| Lymphoma, large B-Cell, diffuse | <0.001 | – | – | United Kingdom | 3 | |
| PRL | Prolactinoma | 0.415 | DRD2 | 9 | United States | 294 |
| Hyperprolactinemia | 0.412 | POMC | 8 | United Kingdom | 59 | |
| Adenoma | 0.33 | IL6 | 6 | Italy | 55 | |
| Lupus erythematosus, systemic | 0.325 | CYP19A1 | 6 | Canada | 40 | |
| Pituitary neoplasms | 0.311 | TNF | 6 | France | 31 | |
| Autistic disorder | 0.304 | ESR2 | 5 | Australia | 29 | |
| Growth hormone-secreting pituitary Adenoma | 0.302 | AGT | 5 | Japan | 28 | |
| Endometriosis | 0.301 | CNR1 | 5 | China | 23 | |
| Hypopituitarism | 0.301 | CRH | 5 | Spain | 20 | |
| Amenorrhea | 0.301 | CYP17A1 | 5 | India | 19 | |
| AFP | Carcinoma, hepatocellular | 0.398 | MMP9 | 5 | United States | 46 |
| Liver diseases | 0.303 | HMOX1 | 4 | Japan | 12 | |
| Liver cirrhosis, experimental | 0.3 | ENO1 | 3 | China | 11 | |
| Breast neoplasms | 0.3 | MMP2 | 3 | Germany | 8 | |
| Mammary neoplasms, experimental | 0.3 | ESR1 | 3 | Italy | 7 | |
| Liver neoplasms | 0.019 | HRAS | 3 | France | 6 | |
| Recurrent malignant neoplasm | 0.015 | NOS2 | 3 | Canada | 5 | |
| Hepatitis B | 0.014 | IGF1 | 3 | Ireland | 4 | |
| Neoplasm malignant | 0.012 | PTGS2 | 3 | Turkey | 3 | |
| Down syndrome | 0.011 | TNFSF10 | 3 | Singapore | 3 | |
| POMC | Obesity | 0.454 | TNF | 22 | United States | 27 |
| Proopiomelanocortin deficiency | 0.4 | IL6 | 17 | Italy | 10 | |
| Cushing syndrome | 0.331 | AGT | 15 | Japan | 9 | |
| Pituitary acth hypersecretion | 0.315 | IL1B | 15 | France | 7 | |
| Adrenal cortex diseases | 0.309 | PTGS2 | 15 | United kingdom | 6 | |
| Acth syndrome, ectopic | 0.306 | SOD2 | 14 | Spain | 5 | |
| Heart failure | 0.304 | ALB | 12 | The netherlands | 5 | |
| Spasms, infantile | 0.303 | INS | 12 | Germany | 4 | |
| Hypertension | 0.303 | BDNF | 11 | Austria | 4 | |
| Osteoporosis | 0.303 | CRH | 11 | Poland | 4 | |
Genes are related to diseases depends on gene-year and gene-country analysis
| ERBB2 | PRL | CD79A | CD40LG | AMN | SOD1 | CHI3L1 | AFP | CD99 | CDKN2A | BRCA2 | BRCA1 | EGF | PGR | ESR1 | POMC | ALPPL2 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Breast neoplasms | X | X | X | X | X | X | X | X | |||||||||
| Adenocarcinoma | X | X | X | X | X | ||||||||||||
| Mammary neoplasms, experimental | X | X | X | X | |||||||||||||
| Carcinoma | X | X | X | X | |||||||||||||
| Prostatic neoplasms | X | X | X | X | |||||||||||||
| Malignant neoplasm breast | X | X | X | ||||||||||||||
| Glioma | X | X | X | ||||||||||||||
| Hypertension | X | X | X | ||||||||||||||
| Neoplasms | X | X | X | ||||||||||||||
| Ovarian neoplasms | X | X | X | ||||||||||||||
| Neoplasm metastasis | X | X | X | ||||||||||||||
| Down syndrome | X | X | |||||||||||||||
| Lymphoma | X | X | |||||||||||||||
| Glioblastoma | X | X | |||||||||||||||
| Carcinoma, hepatocellular | X | X | |||||||||||||||
| Hereditary breast and ovarian cancer syndrome | X | X | |||||||||||||||
| Endometriosis | X | X | |||||||||||||||
| Parkinson disease | X | X | |||||||||||||||
| Neoplasm malignant | X | X | |||||||||||||||
| Obesity | XX | ||||||||||||||||
| Amyotrophic lateral sclerosis | X | X | |||||||||||||||
| Coronary artery disease | X | X | |||||||||||||||
| Lung neoplasms | X | X | |||||||||||||||
| Stomach neoplasms | X | X | |||||||||||||||
| Agammaglobulinemia | X | ||||||||||||||||
| Periodontal diseases | X | ||||||||||||||||
| Enterocolitis, necrotizing | X | ||||||||||||||||
| Diabetes mellitus, type1 | X | ||||||||||||||||
| Hypertension, pulmonary | X | ||||||||||||||||
| Necrosis | X | ||||||||||||||||
| Hypersensitivity | X | ||||||||||||||||
| Imerslund-Grasbeck syndrome | X | ||||||||||||||||
| Hyper-Igm immunodeficiency syndrome, type 1 | X | ||||||||||||||||
| Edema | X | ||||||||||||||||
| Adrenomyeloneuropathy | X | ||||||||||||||||
| Acute kidney injury | X | ||||||||||||||||
| Malabsorption syndromes | X | ||||||||||||||||
| Nervous system malformations | X | ||||||||||||||||
| Adrenoleukodystrophy | X | ||||||||||||||||
| Leukemia, lymphocytic, chronic, B-cell | X | ||||||||||||||||
| Neurogenic inflammation | X | ||||||||||||||||
| Extravasation of diagnostic and therapeutic materials | X | ||||||||||||||||
| Pneumonia | X | ||||||||||||||||
| Anemia, megaloblastic | X | ||||||||||||||||
| Adenoma | X | ||||||||||||||||
| Lymphoma, B-cell | X | ||||||||||||||||
| Heart failure | X | ||||||||||||||||
| Acth syndrome, ectopic | X | ||||||||||||||||
| Adrenal cortex diseases | X | ||||||||||||||||
| Pituitary acth hypersecretion | X | ||||||||||||||||
| Cushing syndrome | X | ||||||||||||||||
| proopiomelanocortin deficiency | X | ||||||||||||||||
| Hepatitis B | X | ||||||||||||||||
| Recurrent malignant neoplasm | X | ||||||||||||||||
| Liver neoplasms | X | ||||||||||||||||
| Liver cirrhosis, experimental | XX | ||||||||||||||||
| Liver diseases | X | ||||||||||||||||
| Lymphoma, non-hodgkin | X | ||||||||||||||||
| Amenorrhea | X | ||||||||||||||||
| Growth hormone-secreting pituitary adenoma | X | ||||||||||||||||
| Autistic disorder | X | ||||||||||||||||
| Pituitary neoplasms | X | ||||||||||||||||
| Lupus erythematosus, systemic | X | ||||||||||||||||
| Hyper prolactinemia | X | ||||||||||||||||
| Prolactinoma | X | ||||||||||||||||
| Lymphoma, large B-cell, diffuse | X | ||||||||||||||||
| Takayasu Arteritis | X | ||||||||||||||||
| Multiple myeloma | X | ||||||||||||||||
| Leukemia | X | ||||||||||||||||
| Leukemia, myeloid, acute | X | ||||||||||||||||
| Hypopituitarism | X | ||||||||||||||||
| Ischemia | X | ||||||||||||||||
| Asthma-related traits, susceptibility to, 7 | X | ||||||||||||||||
| Atherosclerosis | X | ||||||||||||||||
| Wilms tumor | X | ||||||||||||||||
| Pancreatic neoplasms | X | ||||||||||||||||
| Fanconi anemia | X | ||||||||||||||||
| Medulloblastoma | X | ||||||||||||||||
| Fanconi anemia, complementation group D1 | X | ||||||||||||||||
| Malignant neoplasm of ovary | X | ||||||||||||||||
| Breast-ovarian cancer, familial, susceptibility to, 1 | X | ||||||||||||||||
| Stomach ulcer | X | ||||||||||||||||
| Kidney diseases | X | ||||||||||||||||
| Wounds and injuries | X | ||||||||||||||||
| Hypomagnesemia 4, renal | X | ||||||||||||||||
| Recurrence | X | ||||||||||||||||
| Mesothelioma | X | ||||||||||||||||
| Mammary neoplasms, animal | X | ||||||||||||||||
| Meningioma | X | ||||||||||||||||
| Infertility, male | X | ||||||||||||||||
| Myocardial infarction | X | ||||||||||||||||
| Leiomyoma | X | ||||||||||||||||
| Migraine disorders | X | ||||||||||||||||
| Alzheimer disease | X | ||||||||||||||||
| Cholangio carcinoma | X | ||||||||||||||||
| Melanoma-pancreatic cancer syndrome | X | ||||||||||||||||
| Melanoma,CutaneousMalignant,SusceptibilityTo,2 | X | ||||||||||||||||
| Esophageal neoplasms | X | ||||||||||||||||
| Precursor cell lymphoblastic leukemia-lymphoma | X | ||||||||||||||||
| Motor neuron disease | X | ||||||||||||||||
| Deficiency diseases | X | ||||||||||||||||
| Amyotrophic lateral sclerosis1 | X | ||||||||||||||||
| Arthritis, rheumatoid | X | ||||||||||||||||
| Asthma | X | ||||||||||||||||
| Spasms, infantile | X | ||||||||||||||||
| Osteoarthritis | X | ||||||||||||||||
| Neoplasm invasiveness | X | ||||||||||||||||
| Schizophrenia | X | ||||||||||||||||
| Osteosarcoma | X | ||||||||||||||||
| Diabetes mellitus, type 2 | X | ||||||||||||||||
| Neuroectodermal tumors, primitive | X | ||||||||||||||||
| Neuroectodermal tumors, primitive, peripheral | X | ||||||||||||||||
| Chondrosarcoma, mesenchymal | X | ||||||||||||||||
| Carcinoma, embryonal | X | ||||||||||||||||
| Embryonal neoplasm | X | ||||||||||||||||
| Retinal diseases | X | ||||||||||||||||
| Seminoma | X | ||||||||||||||||
| Carcinoma in situ | X | ||||||||||||||||
| Stroke | X | ||||||||||||||||
| Abortion, spontaneous | X | ||||||||||||||||
| Carcinoma, non-small-cell lung | X | ||||||||||||||||
| Sarcoma, ewing | X | ||||||||||||||||
| Osteoporosis | X |
Gene-disease associations from gene–gene analysis
| Community ID | Disease names | # of total genes in community | # of genes sharing disease | Gene names |
|---|---|---|---|---|
| 0 | Cancer | 2105 | 133 | EPHB4, MYCN, SOX9, RPL22, SPARC, ABL1, EAF2, PDGFA, PDGFB, SLC39A1, SPP1, RPS3, UNC5B, PIWIL1, GALR2, ETS1, DAG1, ETV4, EWSR1, CHD4, ITGA3, F2R, MMP20, ITGAV, ADAM10, ITGB3, ITGB4, TUBA4A, ZEB2, PTHLH, PTH1R, NMU, TWIST1, STRAP, JAG2, S100A4, HOXA9, BMI1, GJA1, BMP2, BMP4, BMP7, JUP, BMPR1A, JTB, CD82, HOXC8, GPC3, RHOU, NUAK1, CTNNBIP1, ITIH1, BSG, YAP1, GLI1, CTAGE1, PVRL1, KIF14, PLAU, ALAS1, MMP1, MMP2, MMP7, MMP9, MMP11, MMP14, SDC1, NANOS1, ARHGEF6, KIF11, VGF, KLK11, NID2, SFRP1, SFRP2, SFRP4, CD248, ADAMTS1, PODXL, ANXA1, USP28, WNT1, WNT2, WISP1, WNT5A, WNT7A, ARL6IP5, SLIT2, WNT2B, RIN1, SHH, GEMIN5, LAMC2, MMP26, HIF3A, RUNX2, RUNX3, KLK3, CLDN2, CLDN1, SLC2A4, ARPC2, POSTN, USP6, ORM2, HHIP, SMURF2, EFNB2, SPINT2, CD9, FAM107A, CYR61, TIMP2, TIMP3, YKT6, SNAI2, SP5, ROBO1, IRAK3, NDC80, SNAI1, CTNNB1, LUM, CTSB, KLK13, PCDH8, BCR, DKK3, RPL10, SMAD2, SMAD4, RGL4, SMAD7 |
| Breast cancer | 65 | HOXA5, WISP3, WISP2, MEST, PTPN1, HOXB13, BMP5, BMP6, UBE2B, TLK1, ETV1, KLK4, NMI, NEUROD1, ADAM28, CSF1R, PER2, RHOU, LIMD1, PTPRJ, TIMP1, ARNT2, ARID4A, TIMP4, INHBA, LATS2, TNC, USP28, SLC2A3, IGHMBP2, IBSP, VCAN, VTN, AFF3, WASF2, SERPINE1, CST3, POLI, ETS2, CSTA, LAMA3, CTGF, ADAMTS8, FURIN, MMP3, MMP8, LCN2, SIX1, MMP13, WNT9A, PCBP1, F2RL1, F3, CTSK, F7, TUBA4A, F10, SERPINA5, SDC4, RNF11, BMPR2, ANXA8L2, KLK2, PINK1, HOXA1 | ||
| Prostate cancer | 63 | AMBP, RNF14, KLK4, KIAA0196, PTPN1, BMP5, BMP6, BMPR1B, BMPR2, PTPN12, HOXC8, CSF1R, PDX1, EAF2, SERPINA5, PAGE4, SPINT1, SLC39A1, ACAT2, PLG, DSPP, GLI2, COPE, IBSP, VCAN, CLPTM1, EHF, SERPINE1, DVL1, ETV1, PDGFD, LATS2, CDCP1, PLAU, CRISP3, DAZL, TREX2, ELK4, TIMP1, TSPY1, RLN2, ACVR2A, CYSLTR1, ITGA7, MMP12, KLK3, MMP15, MMP17, F2RL1, ATP2A1, F3, CTSK, INHA, GFI1, HOXB13, TIMP4, RPL10, KLK2, ADAMTS9, CST3, RLN1, ZNFX1, ADAMTS13 | ||
| Diabetes mellitus | 60 | RLN2, XYLT2, SERPINB2, PKLR, GJA1, BMP4, BMP6, BMP7, GREM1, NEUROD1, FBP1, UTS2, CALD1, TIMP1, HLA-DMB, TIMP3, PTPRN2, SPP1, TJP1, TNC, PTX3, KCNJ10, PLA2G4A, CLPS, SERPINE1, CST3, CD9, MTTP, SHH, LRP5, ANKRD1, PTPN22, KIF11, CTGF, MMP14, GCK, ISL1, MMP1, MMP2, FTO, MMP8, TIMP2, DCN, F2, CTSB, AKR1B1, F3, ITGB3, CLOCK, AQP7, SDC2, PTGES2, SLC2A4, GGT1, FABP1, FABP2, PINK1, CYBA, SMAD7, FOXC2 | ||
| Colon cancer | 51 | PMP22, MMP25, RNF14, HSPE1, PTPN1, C1GALT1C1, BMPR1A, DKK4, HTR2A, CYSLTR1, STRAP, TIMP1, TIMP4, LLGL1, TJP1, TNC, ASCL2, KLF9, FDPS, TOMM34, CNOT7, ZKSCAN3, SER- PINE1, CEACAM7, SOX17, OLFM4, LYPD3, PLA2G4A, HRH2, DLL1, NTN1, ADAMTS13, MMP3, ACVR2A, LCN2, MMP10, CDCP1, MMP13, ADAMTSL3, SRPRB, F2RL1, AKR1B1, CTSH, CLDN12, ITGB6, SDC2, KLK1, GGT1, B3GNT8, CD226, ACTR2 | ||
| 1 | Diabetes mellitus | 229 | 29 | GH1, GHR, SOCS2, NAMPT, LIPE, RETN, IGF2R, IGFBP1, IGFBP3, NUDT1, LNPEP, ADIPOR2, INS, FGF21, LPL, RBP4, POMC, APOA1, APOA2, IRS1, IDE, APOC3, HSD11B1, CFI, PLTP, LEPR, SLC2A2, ADD1, FABP4 |
| Obesity | 23 | GH1, GHSR, LIPE, IGF1, IGF2, IGFBP3, IGFBP6, ADIPOR2, INSR, SOCS3, SHBG, POMC, APOA2, IRS1, HSD11B1, RETN, SERPINA6, LEP, LEPR, SLC2A2, RBP4, FABP4, ADRB1 | ||
| Breast cancer | 22 | GH1, GH2, GHR, SOCS2, SLC12A6, VIP, IGF1, ADIPOQ, IGFALS, IGFBP1, TIAL1, FOXL2, INS, FBXO31, INSR, SOCS3, SHBG, SLC12A7, LEP, LEPR, ADIPOR2, IGFBPL1 | ||
| Cancer | 21 | IGFBP5, GHRH, GHRHR, GHSR, TPP2, TMPO, CLIC4, ELAVL1, PLAG1, PAPPA, LRP1, IRS1, PTP4A3, ADCYAP1R1, IGF1R, IGF2, IGF2R, OXT, IGFBP2, IGFBP3, IGFBP4 | ||
| Prostate cancer | 17 | GNA13, GHR, VIPR1, MYO6, PMEPA1, LAMB1, NUDT1, INS, SOCS3, LEP, VIP, ADIPOR2, AD- CYAP1, IGF1, ADIPOQ, IGFBP1, IGFBP2 | ||
| 2 | Cancer | 876 | 47 | S100A2, NAT1, S100A11, NES, CDR2, ERBB2, ERBB3, ERBB4, EREG, BCL11B, MET, SCGB2A2, RARB, EGF, LRIG1, RET, NKX2-1, TK1, PA2G4, AXL, ESR1, RARRES1, CYP24A1, RBP1, PROM1, CD24, MKI67, AGR2, GATA4, ETV6, MCM2, ALCAM, MME, GPNMB, KRT7, SCEL, KIR3DL2, CTSD, TACSTD2, KIT, AKR1C1, ALK, AR, RXRA, CXADR, CCKBR, CDX2 |
| Breast cancer | 40 | NEDD8, RNF5, LHCGR, GABRP, PLAC1, CARM1, NCOR1, KRT5, CD1A, NCOA2, SCGB2A1, SCGB1D2, LATS1, RARA, SRA1, STS, MAOA, KRT18, EML4, BTC, HBEGF, TNN, CYP19A1, ESRRA, GABARAP, PIP, HTATIP2, CYP27B1, GATA3, NRG2, NCOR2, GNA12, WWTR1, NRG3, KIAA0100, F8, AKR1C2, AKR1B10, GFRA1, AREG | ||
| Prostate cancer | 28 | MSR1, CARM1, NCOA2, LHB, NCOR2, NR2C1, ERBB3, RARA, STS, PELP1, HPN, CADM4, CYP19A1, ESRRA, ESRRB, HTATIP2, CYP27B1, LPXN, CYP7B1, GC, GNA12, HSD3B2, CHGA, WDR77, AR, RXRB, RXRG, AREG | ||
| Leukemia | 24 | CISH, NCOA2, CKMT2, RHOH, MYH11, BCL11B, RARA, CEBPE, HOXD3, HOXD13, TLE1, GRAP2, PVRL2, GATA1, IRF8, NSD1, ETV6, GNA12, CBL, CTSG, IL2RG, ENO2, F8, SLC4A1 | ||
| Diabetes mellitus | 20 | IRF8, MAP4K5, HTR1A, ACTG2, ADRA2B, GAD2, BTC, GC, AR, ESR1, TRPC6, DBI, CYP24A1, AKR1B10, CHGA, CYP27B1, MME, CD24, EGF, PLXDC1 | ||
| 3 | Cancer | 1210 | 101 | SOX2, VPRBP, ALOX12, HSP90B1, TRAF2, HIF1A, HK2, MAT2A, HSPH1, BAD, GATA6, BCL2, BCL2L1, TNFRSF10A, SPAG1, PTGS2, HNRNPK, NRP2, NRP1, TYMS, SEMA3A, TUBB3, S100P, RHBDF1, KCNA1, NDRG1, CEBPB, SPIN1, RBMX, BID, PKNOX1, FGF5, PLAGL1, KDR, BCL10, TOMM40, FGFR3, FGFR4, FH, FXYD3, EIF2AK3, HSPA5, VEGFC, MEN1, ALOX5, SDHA, SDHB, SDHC, SDHD, ALOX15B, MPG, VEGFB, CA9, IL24, VHL, SEMA4D, SFRP5, ANG, ANGPT1, ANGPT2, BCL2L10, DIABLO, ANXA2, PRDX4, L1CAM, BAG3, CASP3, ID2, BIRC2, BIRC3, XIAP, SIVA1, RELA, LYVE1, POT1, IDH1, CAV3, FIGF, TYMP, SLC2A1, CDC37, PDPN, INTS6, BNIP3, MTAP, LOX, PYCARD, NEK8, ASPH, RBM6, ALOX15, CA1, TMSB10, HSP90AA1, CTTN, ENDOG, ENG, OLIG2, BBC3, EPAS1, BIRC7 |
| Breast cancer | 57 | CRYAB, SLC6A3, PTN, ADM, TUBA1B, PARP1, KLF8, SEMA3C, TXNIP, VWF, PTPRB, CUEDC2, APLN, KLF10, IGFBP7, FCGR2A, SLC16A1, CLU, WFS1, SQSTM1, RBM3, CSNK2A1, RBMX, JAG1, MIF, TRPS1, BAK1, CASP2, MAZ, XBP1, FADS2, FGF4, CASP9, IRF1, KLF4, REL, GNA11, HES1, SMPD1, NFATC1, BCL2A1, CCL16, SLC25A5, TNFRSF10B, APOE, RRAS, IKBKB, HSF1, IL1R1, FASLG, OSGIN1, RSPO1, PLXNA1, PRDX6, LSM1, CACNA1H, BIK | ||
| Colon cancer | 41 | HSPD1, AIFM1, ACSL4, TRAF1, HIP1, NOD1, CLCA1, EFNB3, SAT1, NFATC1, HTRA2, CLU, FGF20, CALR, MYOD1, HPRT1, ANXA5, MIF, PRDX1, FES, CASP6, FGF7, AATF, TMEM97, ATF3, FGF18, GLRX3, TNKS2, LCP1, HSF1, FASLG, HSPA1A, SLC16A7, FGF19, HSPA8, DDIT3, MAF, PMAIP1, KIF2C, MPG, TXN | ||
| Diabetes mellitus | 40 | TXNIP, ADM, PCSK2, PARP1, HIF1A, VWF, CEBPB, ANGPT1, ANGPT2, APLN, IMPDH2, ANG, KCNJ11, SLC16A1, WFS1, CAPN1, PRDX6, KLF10, XBP1, FADS2, CASP10, SI, PLAGL1, RELA, LTBR, TNFRSF10B, APOE, PBX1, PCBD1, TNMD, ENG, HSPA1A, EIF2AK3, HSPA5, ALOX5, SLC2A1, HSPB2, CA1, KLF2, TXN | ||
| Prostate cancer | 38 | HSPD1, RND3, PTN, ADM, AMD1, MAPK8IP1, TPT1, CDC37, IGFBP7, XBP1, GAPDH, SPINK1, CLU, AIFM1, SQSTM1, JAG1, CAPN1, MIF, PBX1, FGF1, LAMA5, LAMC1, FGF8, FGF9, CACNA1H, ATF3, BCL2A1, APOE, IKBKB, CHUK, PCBP2, HSPA1A, HNRNPA1, RELB, LSM1, FABP5, TXN, RPL19 | ||
| 4 | Diabetes mellitus | 601 | 31 | GSTM1, SLC6A2, GSTP1, CYP1A1, GSTT1, MT1A, TSC22D1, ARNT, LIPC, IAPP, CETP, SLC22A4, SLC22A5, AGTR1, AGTR2, PON1, AHSG, UCP2, PYGL, CAT, REN, KEAP1, IL1RAP, ATP2A2, F5, GFPT1, EDN1, EDNRA, SOD1, SOD3, KL |
| Cancer | 26 | GSTM1, EPHX2, GSTP1, CYP1A1, CYP1A2, CYP1B1, SLC22A18, PDCD2, TSC22D1, IAPP, PHF19, RB1, CETP, GSTT1, GLRX, FECH, AOX1, TSPO, APOBEC1, SIM2, AGPAT2, COPS2, MAP4K4, MVP, | ||
| Breast cancer | 24 | SLC22A18, PPARGC1A, CYP2B6, ARNT, CYP4Z1, PIN1, CYP21A2, INSL4, AGTR2, SLC19A3, AHR, SLCO1B3, ZFHX3, AGTR1, CAT, HSD17B1, HSD17B2, ACE, GSTO1, ATP2B2, SLC26A1, EDN1, EDNRA, SOD1 | ||
| Hypertension | 21 | TSPO, PPARGC1A, ACE, EPHX2, ATP2A2, GCLC, UCP2, ENPEP, SLC6A2, CAT, GSTT1, EDN1, HSD3B1, REN, CYP21A2, SLC22A2, SOD3, CFTR, AGTR1, IAPP, DBH | ||
| Atherosclerosis | 21 | GSTM1, VKORC1, SOD3, PON1, AHSG, GSTO1, CYP1A1, GSTT1, GCLM, SOD1, LDLR, NR1H3, ABCC6, EDN1, KL, EDNRA, ABCD1, APOC2, AGTR1, UCP2, EPHX2 | ||
| 5 | Cancer | 988 | 78 | EPHB2, SLC5A5, RABEP2, RHOA, RHOB, RHOC, MST1R, HRAS, RALA, RALB, WNK2, ARHGDIB, RPS6KB1, PEBP1, NEK3, PTPN6, VRK1, KRAS, JUN, MAPK14, KCNA2, PTPRA, ILK, KCNA5, AKT3, JAK2, PXN, BRAF, P2RX5, AKAP12, PTPRK, MAP2, EGR1, SDCBP, KCNH2, PIK3CG, PIK3R1, EZR, RPS6KA2, CXCL17, EIF4A2, EIF4E, DLGAP5, DAB2, IQSEC1, MAP3K1, KHDRBS1, TSC1, GPR56, TNK2, TIAM1, AKT1, AKT2, PLCB2, VAV3, PRMT3, CAV1, HBP1, GPRC5A, ARHGEF2, SNCG, MAP2K4, MELK, KISS1, GDF15, KIAA1524, SPHK1, TRIB3, RAF1, PTK2, DLC1, PKN3, CRK, RAC1, BCAR1, RAC3, LGALS7, ARF6 |
| Breast cancer | 50 | MST1, IL11RA, ADORA2B, LIMK1, EEF2, BMX, SLC9A3R1, DNAJA3, CSK, PHLDA1, IKBKE, SLC9A1, PTPRZ1, CSNK1A1, MAPKAPK2, KCNJ3, DUSP1, PDCD4, DUSP6, MBL2, EIF4EBP1, SH2D3C, EIF4G1, PAK1, ETV5, ATAD2, MLLT4, ROCK1, ACTN4, NR3C2, PLCD1, RHEB, PLD1, RB1CC1, NFATC2, EEF1D, FHL2, CHN2, RACGAP1, TSC2, TUBB, LPAR2, SH2D3A, RAB27A, RPL7A, DIRAS3, GAB2, PTK6, NEK3, WASL | ||
| Prostate cancer | 36 | TYK2, FOXO1, IL11RA, PTK2B, HSPG2, SPRY2, JUND, LIMK1, SET, BMX, MAK, RAP2A, JAK1, NOX1, CSK, EGR1, F2RL3, MAPKAPK2, FDFT1, TLE3, RPS6KA3, EIF4EBP1, CPNE3, LRP2, ETV5, WFDC1, TRPM8, ELK1, PLCG1, UBIAD1, PAK6, REPS2, FHL2, LPAR1, RHEB, ITPR1 | ||
| Diabetes mellitus | 31 | DUSP12, EZR, GIP, ADORA2B, JUN, MAPK14, NOX1, SLC12A3, PTPRN, PIK3CG, LPA, INPPL1, EIF4A2, MBL2, LRP2, PLA2G2A, RDX, AKT1, AKT2, RORC, LRRC7, EIF4E, ARHGEF11, CHRM3, ELMO1, ITPR3, MAP3K1, CRTC2, EXOC4, MSN, TRPC1 | ||
| Rheumatoid arthritis | 24 | SLC5A5, RHOA, JAK2, JUN, MAP2K4, MRAS, NEDD9, BMX, PIK3CG, MAPK14, CENPJ, CSK, EGR1, IKBKE, GDF15, TRPC1, MBL2, EIF4G1, LRP2, C5, LPAR1, GAB2, RAC1, MAP3K2 | ||
| 6 | Cancer | 1397 | 122 | MYC, CDKN1A, CDKN1B, CDKN1C, CDKN2A, CDKN2B, CDKN2C, SP1, SP4, PTTG1, ERCC1, ERCC2, CEBPA, ERH, ATR, STAG1, XRCC1, TRIO, HDAC8, PPP1R13L, BARD1, DCK, NBN, MCM3, EZH2, MCM7, CCND1, CAGE1, CHEK1, ALDH1L1, DCC, RRM1, RRM2, MDM4, ID4, ECT2, GADD45A, MOAP1, TUBG1, RYR1, DDX5, MAP3K4, NIT2, ADH1B, ADH1C, AQP1, HDAC3, CKS1B, FAP, RPRM, MGMT, BRCA1, BRCA2, KCNH1, TMPRSS2, SUPT7L, BUB1, MLH1, CDC73, FHIT, MBD4, PLK1, COPS5, SMYD3, BRMS1, RAD51, FOXM1, PMS2, BCL2L15, HDAC5, RBBP4, NEIL1, RBL2, UBE2C, APC, SHMT1, APEX1, RECQL, E2F2, E2F3, MSH2, LASP1, RNF139, NEK2, XRCC3, SKP2, IGF2BP1, ASH2L, PDLIM5, CCNA2, CCNB1, CCND2, CCND3, CCNE1, CCNG1, MSH6, TRAF4, IGF2BP3, MTA1, RNF2, RFWD2, MTHFR, EPHA2, WIF1, FBXO4, CST6, EXO1, SMARCA4, SMARCB1, DAPK1, RPL11, E2F1, ATM, FSCN1, PUM1, SH2D1A, MUTYH, MAD2L1, PCNA, XPC, AURKB, MYBL2 |
| Breast cancer | 64 | CDK9, RAD52, TOPBP1, FANCD2, MRE11A, HSPB8, MYBBP1A, RPS6KA6, BCAS2, ERCC4 RNASEL, CEBPD, HDAC6, HDAC4, XRCC2, DERL1, NOL3, MUS81, CENPF, CTCF, INHBB, RBBP7, RBBP8, RBL1, CCNE2, POLB, KLF5, C1QA, WWP1, XRCC4, CAPN2, PRC1, PEMT, MED14, PAK2, BCCIP, MTRR, XDH, SMARCE1, FOXP1, SH3GL2, E2F4, NCL, PBOV1, ANXA8, RRAD, SIPA1, CHKA, ATP1B2, MBD2, NOD2, PRDM14, DDB2, DUSP22, RGS2, PALB2, EP300, CLSPN, HIST2H3A, MLLT11, RAD50, RAD17, KPNA2, RAD23B | ||
| Prostate cancer | 41 | GLIPR1, SUMO1, ERCC1, MT2A, IRX5, RNASEL, RCHY1, CEBPD, ERG, PALB2, BRCA2, PI16, BTRC, LZTS1, RBL1, KLF5, SGTA, TSGA10, SMARCA2, CCNA1, PAK2, SMARCC1, MTRR, FOXP1, TSG101, MSH3, PBOV1, RPS27A, TOPORS, SENP1, NUPR1, AQP3, CREBBP, MECP2, MSMB, ELAC2, EP300, CDK5R1, RAD9A, PCNT, RAD21 | ||
| Colon cancer | 39 | BLM, CITED2, SLC6A4, MRE11A, SND1, CDX1, MLH3, DDX17, HTR3A, LMNA, POLD1, CENPA, NOL3, UCHL1, NEIL2, KLF5, MATK, BRD7, TSGA10, RPS6KA6, HLTF, BCAT1, BCHE, CTBP1, E2F4, XPA, MSH3, LTC4S, CDC16, CHKA, CD3EAP, AIM2, METAP2, EP300, PPM1H, DDX5, RAD18, NOD2, CA8 | ||
| Embryoma | 34 | BLM, PCSK1, AVEN, HDAC11, HOXC9, RNASEL, RPRM, RNF2, HTR3A, FBXO4, STAG1, RCHY1, BCAS2, KLF5, UCHL1, WWP1, GAS1, ASS1, MTR, UHRF1, RECQL, NCL, XPO1, CCL23, CBS, MECP2, HDAC8, PALB2, RPL11, MAP3K4, HMG20B, DNMT3L, PCNT, KPNA2 | ||
| 7 | Leukemia | 967 | 77 | MPO, IFNG, CXCR5, IL11, SELE, RAG1, RAG2, SELL, ITGAX, ORM1, IL10, KLRC1, CD2, CD52, IL18, CD5, CSF1, CD7, CD8A, CSF3, CSF3R, CD19, P2RX7, CIITA, CALCA, ASAH1, CD86, TN- FRSF8, CD33, IGHM, CCL21, B2M, PVR, IL21, TNF, LAIR1, CCL2, CD160, HLA-A, ULBP2, CCL3, ICAM1, LAMP1, HLA-B, CCL4, CCL5, CCR4, GNLY, KIR3DL1, CCL11, CTLA4, CCL18, GCNT1, CCL19, ITGA4, CHIT1, CCL22, IL1A, IL1B, ITGAL, ITGAM, LYZ, IL2, IL2RA, IL2RB, IL3RA, TTR, IL4, IFNA1, CD83, IL6, IL7, SPANXB1, BGN, PML, PDCD1LG2, FAIM3 |
| Rheumatoid arthritis | 70 | SELE, OSM, DEFA1, TPSAB1, ADORA3, IL15, IL16, TNFRSF9, IL17A, MAL, MGAT5, CD5, CSF1, KIR2DL1, TIA1, CD14, HPSE, HLA-C, FCGR3A, SELP, ITGA4, HRH4, CD80, CD86, TNFRSF8, ACP5, ICAM1, ICAM3, IL21, MITF, CCL18, IL1A, LAMP3, CXCL13, CD274, MDK, CCRL2, ICOS, CCL3, IRF3, CCR2, CCL5, LTA, CCL3L1, P2RX7, CCL11, CCL13, TLR2, C5AR1, HAMP, GCNT1, CXCL16, CHI3L1, LTB4R2, TNFRSF17, IL1B, IL2, XCL1, CX3CL1, ITGB2, CCL20, IL4, CD276, CD83, CXCL12, IL7, TNF, PML, PDCD1LG2, LGALS9 | ||
| Prostate cancer | 59 | ARG2, IFNB1, A2M, IL10RA, MAGEA1, MAL, MAGEA4, S100A9, CXCL10, IFNG, IL15, IL16, IL18, TES, MGAT5, CSF1, CALCA, CALCR, HLA-A, ASAH1, MPO, SEC62, AGER, IL10, AZGP1, ITGA5, ICAM1, TLR1, TLR3, MCAM, CCL2, CD55, STEAP2, B2M, CCR1, CCR2, CCL5, CCR9, TNF, CTLA4, CRP, ITGA2, GCNT1, CXCL16, CHI3L1, TLR6, CHIT1, S100A8, OSM, LCT, IL1RN, IL2, CSMD1, IL4, IL6, ACPP, PML, PTMA, RING1 | ||
| Diabetes mellitus | 57 | MPO, IFNG, DEFA1, SELE, EPO, IL13, SELL, IL15, SELP, CD4, ITGA2B, CSF3, HLA-A, LCAT, HP, CD86, AGER, GLP1R, ICAM1, TLR3, IL21, P2RX7, CMA1, MDK, MCAM, CD55, HRH4, CCL2, CASQ1, CCR2, CCL5, LTA, ALAD, GNAI2, TNF, CTLA4, GGT2, ITGA2, GCNT1, KIR2DL2, IL1A, ITGAM, HPSE, ITGB2, TTR, IL4, IFNA1, MEF2C, PCK1, CXCL12, CD163, LGALS3, BGLAP, CRP, MC3R, TNFRSF4, APOC1 | ||
| Cancer | 55 | MAGEA3, CCNT1, EPOR, IL13RA2, AMPH, SERPINB4, CEACAM5, KITLG, GALNT3, FCER2, ANPEP, MS4A1, SPN, PDZK1IP1, NCR2, CD99, AFP, EPO, CD34, THY1, CAPG, CYP27A1, VTCN1, TIA1, C1QBP, CEACAM6, CXCL14, ST3GAL6, EBAG9, HPSE2, CCR3, ST3GAL4, PAX5, ATOH1, STIL, BCL6, CASC5, MDK, PBX2, CTSE, MUC2, SLAMF1, ST18, IL3, HPSE, MUC6, HNRNPF, CXCL12, LGALS1, LGALS3, SLC3A2, CD200, CEACAM1, TPD52, FGFBP1 |
The number of gene mentions
| All Abstracts | Abstracts with gene mentions | ||
|---|---|---|---|
| United States | 62,013 | United States | 33,373 |
| United Kingdom | 11652 | China | 6553 |
| China | 8858 | United Kingdom | 6041 |
| Japan | 8807 | Japan | 5299 |
| Italy | 8667 | Italy | 4621 |
| Germany | 7394 | Germany | 4148 |
| France | 6757 | France | 3642 |
| Canada | 6476 | Canada | 3573 |
| The Netherlands | 4071 | South Korea | 2144 |
| Australia | 3601 | The Netherlands | 1844 |
Fig. 5The top 500 most frequently mentioned genes are shown, where radius represents the number of abstracts which mentioned the gene, and the colour represents the country which mentioned the gene the most
Fig. 6Collaboration between the top 10 countries in regard to breast cancer abstracts
Fig. 7Collaboration between the top 10 countries in regard to breast cancer abstracts that contain genes
Fig. 8For the top 21 most frequently mentioned genes, the distribution of gene mentions by country is colored
Top 10 genes are mentioned by each country
| Country name | # of abstracts | Gene name | Country name | # of abstracts | Gene name | ||
|---|---|---|---|---|---|---|---|
| United States | 33,373 | ESR1 | 5429 [16.27 %] | Germany | 4148 | ERBB2 | 620 [14.95 %] |
| ERBB2 | 4271 [12.8 %] | ESR1 | 517 [12.46 %] | ||||
| EGF | 2199 [6.59 %] | PGR | 245 [5.91 %] | ||||
| PGR | 1887 [5.65 %] | EGF | 239 [5.76 %] | ||||
| BRCA1 | 1845 [5.53 %] | CDKN2A | 218 [5.26 %] | ||||
| CDKN2A | 1809 [5.42 %] | SLC20A2 | 191 [4.6 %] | ||||
| SLC20A2 | 1418 [4.25 %] | BRCA1 | 140 [3.38 %] | ||||
| TKT | 1297 [3.89 %] | CYP19A1 | 120 [2.89 %] | ||||
| ACAD9 | 1143 [3.42 %] | KRT75 | 120 [2.89 %] | ||||
| CYP19A1 | 1073 [3.22 %] | TKT | 116 [2.8 %] | ||||
| United Kingdom | 6041 | ESR1 | 1249 [20.68 %] | France | 3642 | ESR1 | 569 [15.62 %] |
| ERBB2 | 674 [11.16 %] | ERBB2 | 486 [13.34 %] | ||||
| CYP19A1 | 425 [7.04 %] | PGR | 294 [8.07 %] | ||||
| EGF | 408 [6.75 %] | CDKN2A | 224 [6.15 %] | ||||
| BRCA1 | 395 [6.54 %] | BRCA1 | 222 [6.1 %] | ||||
| CDKN2A | 325 [5.38 %] | EGF | 173 [4.75 %] | ||||
| PGR | 311 [5.15 %] | SLC20A2 | 165 [4.53 %] | ||||
| BRCA2 | 256 [4.24 %] | TKT | 131 [3.6 %] | ||||
| SLC20A2 | 227 [3.76 %] | CYP19A1 | 120 [3.29 %] | ||||
| INS | 188 [3.11 %] | CTSD | 114 [3.13 %] | ||||
| China | 6553 | ERBB2 | 799 [12.19 %] | Canada | 3573 | ESR1 | 515 [14.41 %] |
| ESR1 | 764 [11.66 %] | ERBB2 | 433 [12.12 %] | ||||
| CDKN2A | 431 [6.58 %] | BRCA1 | 304 [8.51 %] | ||||
| PGR | 385 [5.88 %] | EGF | 205 [5.74 %] | ||||
| EGF | 378 [5.77 %] | BRCA2 | 203 [5.68 %] | ||||
| ACAD9 | 336 [5.13 %] | PGR | 188 [5.26 %] | ||||
| MYLIP | 327 [4.99 %] | CDKN2A | 186 [5.21 %] | ||||
| BCL2 | 312 [4.76 %] | INS | 146 [4.09 %] | ||||
| ABCB1 | 209 [3.19 %] | TKT | 137 [3.83 %] | ||||
| CASP3 | 203 [3.1 %] | SLC20A2 | 136 [3.81 %] | ||||
| Japan | 5299 | ESR1 | 918 [17.32 %] | The Netherlands | 1844 | ESR1 | 267 [14.48 %] |
| ERBB2 | 806 [15.21 %] | BRCA1 | 218 [11.82 %] | ||||
| PGR | 456 [8.61 %] | ERBB2 | 181 [9.82 %] | ||||
| EGF | 394 [7.44 %] | BRCA2 | 115 [6.24 %] | ||||
| CDKN2A | 340 [6.42 %] | PGR | 115 [6.24 %] | ||||
| CYP19A1 | 210 [3.96 %] | EGF | 97 [5.26 %] | ||||
| SLC20A2 | 159 [3 %] | CDKN2A | 90 [4.88 %] | ||||
| CEACAM3 | 151 [2.85 %] | SLC20A2 | 82 [4.45 %] | ||||
| BCL2L14 | 129 [2.43 %] | ABCB1 | 81 [4.39 %] | ||||
| ABCB1 | 129 [2.43 %] | BCL2L14 | 69 [3.74 %] | ||||
| Italy | 4621 | ERBB2 | 808 [17.49 %] | Australia | 1715 | ESR1 | 260 [15.16 %] |
| ESR1 | 727 [15.73 %] | ERBB2 | 166 [9.68 %] | ||||
| PGR | 404 [8.74 %] | PGR | 123 [7.17 %] | ||||
| EGF | 298 [6.45 %] | BRCA1 | 120 [7 %] | ||||
| CDKN2A | 297 [6.43 %] | EGF | 94 [5.48 %] | ||||
| SLC20A2 | 238 [5.15 %] | SLC20A2 | 85 [4.96 %] | ||||
| BRCA1 | 197 [4.26 %] | BRCA2 | 73 [4.26 %] | ||||
| INS | 171 [3.7 %] | INS | 72 [4.2 %] | ||||
| TKT | 159 [3.44 %] | ARL11 | 71 [4.14 %] | ||||
| CYP19A1 | 156 [3.38 %] | CDKN2A | 68 [3.97 %] |
Fig. 9The division of effort by the top 10 countries, for the top 100 genes for those countries
Fig. 10The proportion of gene mentions by each country
Fig. 11In consideration of other genes that these countries have studied, this figure shows how much of that effort was placed on these genes
Fig. 12Hierarchical clustering of the countries, based on the genes that each country studied. This figure shows how similar the research interests are across the countries
Represented 5 highest maximal closed frequent item sets for Gene-Country
| Gene maximal closed frequent item set | Support |
|---|---|
| ERBB2, ESR1, PGR | 48.43 |
| EGF, ERBB2, ESR1 | 46.54 |
| BRCA1, ERBB2, ESR1 | 45.91 |
| BRCA1, BRCA2 | 45.28 |
| CDKN2A, ESR1 | 45.28 |
Fig. 13Black nodes are genes listed in the third gene-country item set in Table 2. As described by GeneMania, the purple connections represent co-expression, whereas the red connections represent physical interaction between the gene products
Represented 5 highest maximal closed frequent item sets for Gene-Year
| Gene maximal closed frequent item set | Support |
|---|---|
| CEACAM3, ESR1 | 82.69 |
| ALPPL2, CD99, CEACAM3, CHI3L1, ESR1, MUC21, SOD1 | 78.85 |
| AMN, CD40LG, CD79A, CEACAM3, ESR1, PRL | 78.85 |
| AFP, CEACAM3, ESR1 | 76.92 |
| CD99, DHPS, POMC | 76.92 |
Fig. 14Black nodes are genes listed in the third Gene-Year item set in Table 3. The connections between the genes are described by GeneMania as blue for co-localization of the gene products and purple for co-expression of the genes
Top 10 diseases associated with genes derived from the union of the top 5 gene-year and gene-country itemsets
| Disease name | Genes |
|---|---|
| Breast neoplasms | ERBB2, ESR1, PGR, EGF, BRCA1, BRCA2, CD99, AFP |
| Adenocarcinoma | ERBB2, PGR, EGF, CDKN2A, CD99 |
| Mammary neoplasms, experimental | ERBB2, PGR, BRCA1, AFP |
| Carcinoma | ESR1, PGR, BRCA1, CD99 |
| Prostatic neoplasms | ERBB2, EGF, BRCA1, BRCA2 |
| malignant neoplasm breast | PGR, BRCA1, BRCA2 |
| Glioma | ERBB2, CDKN2A, CHI3L1 |
| Hypertension | CHI3L1, SOD1, POMC |
| Neoplasm | BRCA1, CDKN2A, CD99 |
| Ovarian neoplasms | ERBB2, BRCA1, BRCA2 |
Fig. 16Two dimensional representations of the fuzzy clusters for the gene-year (a) and for gene-country (b) relationships. For the gene-year clusters, the number of clusters was set to 3, whereas for the gene-country clusters the number of clusters was set to 4. Each color represents a different cluster. Points marked by a blue’x’ are the maximal closed items from the frequent mining analysis in Tables 2 and 3
Fig. 15Validation of the number of fuzzy clusters using various measures
Fig. 17The full gene–gene network derived from the co-occurrence of genes within the abstracts. The ring of noise (disconnected genes) surrounds the network. The network is difficult to understand in this form, prior to pruning
Statistical information for gene–gene network
| Nodes | % | Edges | % | |
|---|---|---|---|---|
| Full network | 8400 | 100 | 213,894 | 100 |
| Giant component | 7620 | 90.71 | 213,877 | 99.99 |
| Pruned giant component | 1089 | 12.96 | 6815 | 3.19 |
Network Analysis measurements for the gene–gene network
| Betweenness centrality | Modularity class | Closeness centrality | Modularity class | ||
|---|---|---|---|---|---|
| ESR1 | 0.09 | 2 | ESR1 | 0.62 | 2 |
| ERBB2 | 0.06 | 2 | ERBB2 | 0.6 | 2 |
| CDKN2A | 0.04 | 6 | CDKN2A | 0.58 | 6 |
| SLC20A2 | 0.03 | 2 | SLC20A2 | 0.57 | 2 |
| EGF | 0.02 | 2 | EGF | 0.57 | 2 |
| PGR | 0.02 | 2 | PGR | 0.56 | 2 |
| BRCA1 | 0.02 | 6 | ACAD9 | 0.55 | 5 |
| CDH1 | 0.02 | 0 | CDH1 | 0.55 | 0 |
| ACAD9 | 0.02 | 5 | MAPK10 | 0.55 | 5 |
| HLA-H | 0.02 | 6 | TKT | 0.55 | 2 |
The top 10 genes with the highest betweenness are shown, as well as the top 10 genes with the highest closeness. The modularity class is also shown, where it denotes the community that the gene belongs to
Fig. 18The gene–gene network. Each community is represented as a different color
Fig. 19The co-expression network retrieved from GeneMania, which was used to validate the relationships between the genes within the smallest community (community 1) from our gene–gene network. Each circle represents a gene, and each purple line represents co-expression between the connected genes
Common diseases in each community
| Cancer | Breast cancer | Prostate cancer | Diabetes mellitus | Colon cancer | Obesity | Leukemia | Hypertension | Atherosclerosis | Rheumatoid arthritis | Embryoma | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Community 0 | X | X | X | X | X | ||||||
| Community 1 | X | X | X | X | X | ||||||
| Community 2 | X | X | X | X | X | ||||||
| Community 3 | X | X | X | X | X | ||||||
| Community 4 | X | X | X | X | X | ||||||
| Community 5 | X | X | X | X | X | ||||||
| Community 6 | X | X | X | X | X | ||||||
| Community 7 | X | X | X | X | X |