| Literature DB >> 31632691 |
Kyubum Lee1, Mindy Clyne2, Wei Yu3, Zhiyong Lu1, Muin J Khoury3.
Abstract
Understanding the drivers of research on human genes is a critical component to success of translation efforts of genomics into medicine and public health. Using publicly available curated online databases we sought to identify specific genes that are featured in translational genetic research in comparison to all genomics research publications. Articles in the CDC's Public Health Genomics and Precision Health Knowledge Base were stratified into studies that have moved beyond basic research to population and clinical epidemiologic studies (T1: clinical and population human genome epidemiology research), and studies that evaluate, implement, and assess impact of genes in clinical and public health areas (T2+: beyond bench to bedside). We examined gene counts and numbers of publications within these phases of translation in comparison to all genes from Medline. We are able to highlight those genes that are moving from basic research to clinical and public health translational research, namely in cancer and a few genetic diseases with high penetrance and clinical actionability. Identifying human genes of translational value is an important step towards determining an evidence-based trajectory of the human genome in clinical and public health practice over time. © This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply 2019.Entities:
Keywords: Genetics research; Translational research
Year: 2019 PMID: 31632691 PMCID: PMC6795796 DOI: 10.1038/s41525-019-0100-0
Source DB: PubMed Journal: NPJ Genom Med ISSN: 2056-7944 Impact factor: 8.617
Overall number of genes mentioned and gene-specific publication count categorized by (1) rank of top most common and (2) overall percentage of publications in PubMed, HuGE, and GPH
| PubMed | HuGE | GPH | |
|---|---|---|---|
| Total # of genes | 24,656 | 11,081 | 1846 |
| Top 5 genes | 2.16% | 7.18% | 19.36% |
| Top 10 genes | 3.50% | 12.13% | 28.31% |
| Top 20 genes | 5.54% | 19.54% | 37.64% |
| Top 30 genes | 7.25% | 24.36% | 43.19% |
| Top 50 genes | 10.17% | 31.16% | 49.14% |
| Top 100 genes | 15.17% | 41.83% | 57.67% |
| Top 200 genes | 21.56% | 53.59% | 66.98% |
| Top 400 genes | 29.80% | 65.16% | 76.86% |
| Top 500 genes | 32.83% | 68.76% | 79.74% |
| Top 1000 genes | 43.55% | 79.54% | 88.81% |
| Top 2000 genes | 56.28% | 88.79% | 100.00% (1846 genes) |
| Top 1% genes | 23.83% | 43.54% | 36.23% |
| Top 5% genes | 47.21% | 70.50% | 56.61% |
| Top 10% genes | 60.44% | 81.05% | 65.84% |
Genes ranked by publication count in PubMed, HuGE, and GPH
| Publication count | Statistical significance of the publications compared to PubMed | ||||
|---|---|---|---|---|---|
| Rank | PubMed | HuGE | GPH | HuGE | GPH |
| 1 | TP53 | APOE | BRCA1 | MTHFR | BRCA1 |
| 2 | TNF | MTHFR | BRCA2 | APOE | BRCA2 |
| 3 | EGFR | TNF | EGFR | HLA-DRB1 | PMS2 |
| 4 | VEGFA | EGFR | ERBB2 | GSTM1 | MSH6 |
| 5 | IL6 | HLA-DRB1 | KRAS | KRAS | MSH2 |
| 6 | APOE | TP53 | TP53 | ACE | LDLR |
| 7 | TGFB1 | ACE | BRAF | GSTT1 | MLH1 |
| 8 | MTHFR | IL6 | MLH1 | COMT | ERBB2 |
| 9 | ESR1 | KRAS | MSH2 | BRAF | KRAS |
| 10 | AKT1 | GSTM1 | LDLR | IL10 | PALB2 |
| 11 | HIF1A | IL10 | MSH6 | CYP2C19 | BRAF |
| 12 | NFKB1 | GSTT1 | PMS2 | VDR | EGFR |
| 13 | IL10 | COMT | CYP2C19 | SLC6A4 | NRAS |
| 14 | BRCA1 | BRAF | PIK3CA | CYP3A5 | CYP2C19 |
| 15 | ERBB2 | SLC6A4 | CYP2D6 | ABCB1 | CYP2D6 |
| 16 | MMP9 | BRCA1 | NRAS | EGFR | PCSK9 |
| 17 | HLA-DRB1 | ABCB1 | ALK | CYP2C9 | CHEK2 |
| 18 | IL1B | VDR | CFTR | GSTP1 | ALK |
| 19 | ACE | BRCA2 | CHEK2 | CYP2D6 | PIK3CA |
| 20 | APP | BDNF | PCSK9 | TNF | VKORC1 |
The ranking of “Publication count” column is simply sorted by the number of appearances in each database, and the other column is calculated and ranked using the z-score of each gene representing the significance of the publication count difference compared with PubMed