| Literature DB >> 32508008 |
Zeeshan Ahmed1,2, Saman Zeeshan3, Dinesh Mendhe1, XinQi Dong1,2.
Abstract
We are entering the era of personalized medicine in which an individual's genetic makeup will eventually determine how a doctor can tailor his or her therapy. Therefore, it is becoming critical to understand the genetic basis of common diseases, for example, which genes predispose and rare genetic variants contribute to diseases, and so on. Our study focuses on helping researchers, medical practitioners, and pharmacists in having a broad view of genetic variants that may be implicated in the likelihood of developing certain diseases. Our focus here is to create a comprehensive database with mobile access to all available, authentic and actionable genes, SNPs, and classified diseases and drugs collected from different clinical and genomics databases worldwide, including Ensembl, GenCode, ClinVar, GeneCards, DISEASES, HGMD, OMIM, GTR, CNVD, Novoseek, Swiss-Prot, LncRNADisease, Orphanet, GWAS Catalog, SwissVar, COSMIC, WHO, and FDA. We present a new cutting-edge gene-SNP-disease-drug mobile database with a smart phone application, integrating information about classified diseases and related genes, germline and somatic mutations, and drugs. Its database includes over 59 000 protein-coding and noncoding genes; over 67 000 germline SNPs and over a million somatic mutations reported for over 19 000 protein-coding genes located in over 1000 regions, published with over 3000 articles in over 415 journals available at the PUBMED; over 80 000 ICDs; over 123 000 NDCs; and over 100 000 classified gene-SNP-disease associations. We present an application that can provide new insights into the information about genetic basis of human complex diseases and contribute to assimilating genomic with phenotypic data for the availability of gene-based designer drugs, precise targeting of molecular fingerprints for tumor, appropriate drug therapy, predicting individual susceptibility to disease, diagnosis, and treatment of rare illnesses are all a few of the many transformations expected in the decade to come.Entities:
Keywords: clinical-genomics; database; diseases; drugs; genes; germline mutations; precision medicine; somatic mutations
Year: 2020 PMID: 32508008 PMCID: PMC7240856 DOI: 10.1002/ctm2.28
Source DB: PubMed Journal: Clin Transl Med ISSN: 2001-1326
Database description and statistics of ICD codes
| Categories | Count |
|---|---|
| Total diagnosis codes | 84 186 |
| Total ICD 10 codes | 70 663 |
| Total ICD 9 codes | 13 523 |
| Distinct diseases | 82 384 |
| Distinct diseases based on ICD 10 codes | 70 629 |
| Distinct diseases based on ICD 9 codes | 13 518 |
Gene database description and statistics
| Categories | Count |
|---|---|
| Genes‐disease combinations | 98 064 |
| Gene types | 26 |
| Chromosomes | 24 |
| Genes (including aliases) | 13 216 |
| Genes (Ensembl IDs) | 10 598 |
| Unique diseases | 12 257 |
| Genes‐disease combinations based on actionable genes | 32 089 |
| Distinguished genes‐disease source combinations | 809 |
| Cancer leading genes | 8063 |
SNP database description and statistics
| Categories | Count |
|---|---|
| SNP‐disease combinations | 101 439 |
| SNPs | 67 727 |
| Strongest SNP risk allele | 73 070 |
| SNP gene IDs | 13 979 |
| Reported genes | 19 669 |
| Regions | 1045 |
| Disease traits | 3041 |
| Literature (PUBMED articles) | 3186 |
| Contexts | 119 |
Gene‐disease databases comparison based on the following features: gene to disease, disease to ICD, data types, data sources, gene capacity, latest update, search results, and user friendly interface
| Database | Weblink | Gene to disease | Disease to ICD | Data types | Data sources | Gene capacity | Latest update | Search results | User friendly | Last date accessed |
|---|---|---|---|---|---|---|---|---|---|---|
| Disease Ontology |
| Yes | No | Human disease ontology | MeSH; ICD; NCI's thesaurus; SNOMED and OMIM | Not found | 2018 | DOID; Disease name | Yes | 01‐22‐2020 |
| DiseaseEnhancer |
| Yes | No | Disease‐associated enhancers | 1866 publications | 308 genes | 2018 | ID;Chr; Position; Disease; Target Gene | Yes | 01‐22‐2020 |
| DISEASES |
| Yes | Yes | Disease‐gene associations; cancer mutations; genome‐wide association studies | GHR; UniProtKB; GWAS results from DistiLD and COSMIC | 17 606 genes | 2018 | Genes; Identifiers; Disease name; Scores; Confidencity; Publication | No | 01‐22‐2020 |
| DisGeNET |
| Yes | No | Genotype‐phenotype relationships | CTD; UniProt; ClinVar; Orphanet; RGD; MGD; GAD | 17 381 genes | 2018 | Gene; Disease; Disease Class; Semantic Type; PMIDs; SNPs | No | 01‐22‐2020 |
| eDGAR |
| Yes | No | Gene/disease relationships | OMIM; Humsavar; ClinVa | 3658 genes | 2016 | Gene; Number of associated diseases; Associated diseases identifiers; Associated diseases names | Yes | 01‐22‐2020 |
| GTR |
| Yes | No | Genetic test information by providers | ClinVar; Genetics & Medicine; GeneReviews; MedGen; OMIM; Orphanet; NHGRI Glossary… | 16 451 genes | 2018 | Associated conditions; Copy number response; Genomic context; Variation; Related articles in PubMed | Yes | 01‐22‐2020 |
| MalaCard |
| Yes | Yes | Gene; genomics; proteins; gene ontology; pathways; drugs; diseases | 76 various sources | 71 150 genes | 2018 | Symbol; Aliases; Disorder; Score; IsCancerCensus; Sources | Yes | 01‐22‐2020 |
| SwissVar |
| Yes | No | Single amino acid polymorphisms (SAPs) and diseases | Disease:UniProtKB/SwissProt and their mapping to MeSH terms; variant:ModSNP database | Not found | Not found | Accession; Disease; Variant | Yes | 01‐22‐2020 |
| miR2Disease |
| Yes | No | MicroRNA deregulation in human diseases | Researchers submission; Pubmed text‐mining | 349 miRNAs | 2008 | miRNA; Disease; Relationship type; Target Gene; Reference | No | 01‐22‐2020 |
| HGMD |
| Yes | No | Germline mutations | GDB; OMIM; 250 journals | 6662 genes | 2017 | Disease/phenotype; Number of mutations; Gene Symbol | Yes | 01‐22‐2020 |
| ClinVar |
| Yes | No | Relationships among variations and human disorders | Clinical testing; research; extraction from the literature | 26 000 genes | 2018 | Variation; Genes; Conditions; Clinical significance; Review status | Yes | 01‐22‐2020 |
| Orphanet |
| Yes | Yes | Rare diseases and orphan drugs | OMIM; ICD10; MeSH; MedDRA; GARD and UMLS | 15 470 genes | 2018 | Genes; Disease; ORPHA ID; ICD | No | 01‐22‐2020 |
| Gene2Function |
| Yes | No | Orthologs among human genes and common genetic model species | OMIM; EBI; GWAS; HGNC; MOD; DIOPT; GO; SGD; ORF; NCBI; PDB; Uniprot | 10 499 genes | 2017 | Gene ID; Gene Symbol; Human Disease; Species Name… | No | 01‐22‐2020 |
FIGURE 1PAS components design, development, and data flow. PAS is an iOS app developed with Swift programming language, XCODE integrated development environment for MacOS, MySQL database management system, PHP scripting language, and UNIX‐based web and database servers
FIGURE 2PAS graphical user interface and work flow design
FIGURE 3PAS (iPhone 11 Pro) screenshot of gene results from searches for the 10 most common infectious diseases in the United States: (A) 22 chlamydia, (B) 86 influenza, (C) 31 staph, (D) 103 herpes, (E) 16 shigellosis, (F) 102 syphilis, (G) 185 pneumonia, (H) 325 hepatitis‐C, (I) 20 common cold, and (J) 17 salmonellosis
FIGURE 4PAS (iPhone 11 Pro) screen shots present searched results for all possible SNPs related to diabetes (2174), immune (2583), and schizophrenia (2286) diseases; searched results for diabetes‐related SNPs: HNF1 (142), HNF4A (46), and PAX4 (2); searched results for auto immune disease‐related SNPs: HLA‐DRB1 (216), TNFRSF1A (35), and PTPN22 (44); and searched results for schizophrenia‐related SNPs: DRD2 (10), CACNB2 (45), and GRM3 (16)
FIGURE 5PAS (iPhone 11 Pro) screen shots present eight searched results for all possible SNPs related to the eight different genes: 216 results for “ESR1” (A), 178 results for “AKT1” (B), 600 results for “ERBB24” (C), 344 results for “BRCA1” (D), 574 results for “BRCA2” (E), 125 results for “RBM10” (F), 110 results for “PTPN13” (G), and 71 results for “PPP6C” (H). Figure 4 also presents four examples of searched results for all possible SNPs and their diseases relationships: 216 results for entered and searched keyword “ESR1” (I), 178 results for entered and searched keyword “AKT1” (J), 600 results for “ERBB24” (K), and 344 results for “BRCA1” (L)
FIGURE 6PAS (iPhone 11 Pro) screen shots present searched results obtained during use cases for four different diseases: diabetes, influenza, fever, and sterile. Diabetes includes total 577 ICD (A), 69 ICD9 (B), 508 ICD10 (C), and 6 NDC (D). Influenza includes total 44 ICD (E), 16 ICD9 (F), 28 ICD10 (G), and 14 NDC (H). Fever includes total 110 ICD (I), 48 ICD9 (J), 62 ICD10 (K), and 284 NDC (L). Sterile includes total 18 ICD (M), 9 ICD9 (N), 9 ICD10 (O), and 138 NDC (P)
FIGURE 7PAS (iPhone 11 Pro) screen shots present searched results obtained for infectious disease i.e., Coronavirus (Fig. 7A), genes (Fig. 7B and 7C), and variants (Fig. 7D and 7E) including ACE2 and TMPRSS2