| Literature DB >> 31114876 |
Zongliang Yue1, Christopher D Willey2, Anita B Hjelmeland3, Jake Y Chen1.
Abstract
BEERE (Biomedical Entity Expansion, Ranking and Explorations) is a new web-based data analysis tool to help biomedical researchers characterize any input list of genes/proteins, biomedical terms or their combinations, i.e. 'biomedical entities', in the context of existing literature. Specifically, BEERE first aims to help users examine the credibility of known entity-to-entity associative or semantic relationships supported by database or literature references from the user input of a gene/term list. Then, it will help users uncover the relative importance of each entity-a gene or a term-within the user input by computing the ranking scores of all entities. At last, it will help users hypothesize new gene functions or genotype-phenotype associations by an interactive visual interface of constructed global entity relationship network. The output from BEERE includes: a list of the original entities matched with known relationships in databases; any expanded entities that may be generated from the analysis; the ranks and ranking scores reported with statistical significance for each entity; and an interactive graphical display of the gene or term network within data provenance annotations that link to external data sources. The web server is free and open to all users with no login requirement and can be accessed at http://discovery.informatics.uab.edu/beere/.Entities:
Year: 2019 PMID: 31114876 PMCID: PMC6602520 DOI: 10.1093/nar/gkz428
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The features comparison between BEERE and other three web servers: ToppGene, Phenolyzer and Semantic Medline
| Feature | BEERE | ToppGene | Phenolyzer | Semantic MEDLINE | |
|---|---|---|---|---|---|
| Algorithm | Ranking algorithm | PageRank, ant-colony | k-step Markov, PageRank, HITS | Gene-disease score | Not mention |
| Iterative ranking | yes | yes | no | no | |
| Evaluation | Statistical model | yes | yes | no | no |
| Data and quality | Relationship quality control | yes | no | yes (factor control) | no |
| Network extension with IOA evaluation | yes | yes (neighbor distance) | no | no | |
| Biomedical term | yes | no | yes (disease) | yes | |
| Visualization | Network with provenance data | yes | yes (provide networks) | yes | yes |
| Grouping annotation | yes | no | no | no | |
Grouping annotation: The node color visualization using the annotation, e.g the genes annotated in different pathways can be visualized in the network using different colors.
The network quality control using different PPI cutoffs
| Expanded | Yes (0.9) | No | ||
|---|---|---|---|---|
| PPI Cutoff | 0.9 | 0.45 | 0.75 | 0.9 |
| IOA | 0.99 (1962/1984) | 0.76 (130/172) | 0.64 (110/172) | 0.52 (90/172) |
| SCN | 0.87 (150/172) | 0.78 (134/172) | 0.68 (117/172) | 0.59 (102/172) |
| Interaction | 6833 | 543 | 326 | 200 |
IOA: Index of Aggregation, SCN: Seed's Candidate Coverage in Network
The top-10 ranked seed genes in expanded network compared to the ranks in non-expanded networks
| 5-star+Exp. | 3-star | 4-star | 5-star | |||||
|---|---|---|---|---|---|---|---|---|
| Seed Gene | Rank |
| Rank |
| Rank |
| Rank |
|
| TP53 | 1 | 0.00053 | 2 | 0.015 | 2 | 0.017 | 4 | 0.0028 |
| EGFR | 2 | 0.0011 | 4 | 0.03 | 4 | 0.034 | 5 | 0.0032 |
| PIK3R1 | 3 | 0.0016 | 5 | 0.037 | 3 | 0.026 | 2 | 0.0012 |
| AKT1 | 4 | 0.0021 | 3 | 0.022 | 6 | 0.051 | 8 | 0.0085 |
| CTNNB1 | 5 | 0.0027 | 1 | 0.0075 | 1 | 0.0085 | 1 | 0.00096 |
| PIK3CA | 6 | 0.0032 | 9 | 0.067 | 5 | 0.043 | 3 | 0.0027 |
| PLK1 | 7 | 0.0037 | 31 | 0.23 | 21 | 0.18 | 12 | 0.036 |
| RAC1 | 8 | 0.0043 | 11 | 0.082 | 10 | 0.085 | 6 | 0.0053 |
| RB1 | 9 | 0.0048 | 10 | 0.075 | 9 | 0.077 | 7 | 0.0065 |
| CCND1 | 10 | 0.0053 | 7 | 0.052 | 7 | 0.06 | 11 | 0.019 |
Figure 1.The pipeline overview of the conjunction analysis in glioblastoma (GBM) genetic candidate’s discovery. In the first part, BEERE offers a ranked order list of critical seed and expanded genes with statistical significance using the expanded network analysis. (A) The input is the 200 genetic candidate genes downloaded from OMIM databases. (B) In BEERE quality control, BEERE automatically maps the queried genes to HAPPI database gene symbols. By verifying the 172 genes matched to HAPPI database, BEERE returns 6833 PPIs passing the PPI cutoff. The network quality is good that Index of Aggregation = 0.99 and Seed’s Coverage in Network = 0.87. (C) BEERE generates the gene ranks. About 87 seed genes are statistically significant and 14 expanded genes are statistically significant. In the second part, BEERE reveals the critical mechanisms using comprehensive term mapping, heterogeneous network analysis, and term ranking. (D) About 28 biomedical entities using genes, aliases and disease terms are the input of the network meta-analysis. (E) The term ranking score distribution and term rank word-cloud intuitively show the important entities such as epidermal growth factors, amyloid genes, ubiquitin genes and tyrosine genes are tightly related to glioblastoma. The provenance of the gene to glioblastoma relationship is displayed on the selected edge such as APP affects glioblastoma with one literature support. The PMID and a link to outsource are displayed by clicking the entry with the detail of the relationships.
The expanded genes validation using the PubMed article term-to-term co-citations and network semantic relationship validation
| Gene | Search term | BEERE Top rank |
| ToppGene rank | ToppGene Normalized rank | PubMed Initial count | PubMed Extended count | Network validation | Literatures | PMID |
|---|---|---|---|---|---|---|---|---|---|---|
| UBC | UBC or ubiquitin or ubiquitin C | 1 (33) | 0.017 | 116 | 32 | 0 | 137 | Augments | 1 | 27766591 |
| APP | APP or amyloid or amyloid beta | 2 (54) | 0.028 | 1 | 1 | 23 | 61 | Affects | 1 | 15302999 |
| MYC | MYC or c-myc or myc proto-oncogene | 3 (69) | 0.035 | 17 | 5 | 300 | 300 | Augments | 1 | 26993778 |
| HDAC1 | HDAC1 or Histone decacetylase | 4 (74) | 0.038 | 46 | 13 | 12 | 12 | Augments | 1 | 27766591 |
| SUMO1 | SUMO1 or ubiquitin | 5 (75) | 0.038 | 373 | 102 | 3 | 138 | Augments | 1 | 27766591 |
| SRC | SRC or Tyrosine-Protein Kinase | 6 (76) | 0.039 | 31 | 9 | 165 | 167 | Affects | 6 | 3146045|15994925|15618223|20947248|19098899|25048528 |
| ABL1 | ABL1 or Tyrosine-Protein Kinase | 7 (83) | 0.042 | 142 | 39 | 4 | 10 | Augments | 1 | 23383209 |
| FYN | FYN or Tyrosine-protein kinase | 8 (85) | 0.043 | 93 | 26 | 17 | 23 | Affects | 1 | 15994925 |
| PCNA | PCNA or Proliferating Cell Nuclear Antigen | 9 (87) | 0.044 | 166 | 45 | 93 | 108 | Indirectly affect | - | - |
| EP300 | EP300 or E1A Binding Protein P300 | 10 (88) | 0.045 | 52 | 15 | 3 | 5 | Affects | 2 | 21489305|26722247 |
| GRB2 | GRB2 or Growth Factor Receptor Bound Protein | 11 (90) | 0.046 | 22 | 6 | 14 | 14 | Indirectly affect | - | - |
| CREBBP | CREBBP or CREB Binding Protein | 12 (92) | 0.047 | 83 | 23 | 1 | 4 | Indirectly affect | - | - |
| EGF | EGF or Epidermal Growth Factor | 13 (93) | 0.047 | 5146 | 1395 | 384 | 1419 | Produces | 1 | 3011820 |
| ESR1 | ESR1 or Estrogen Receptor 1 | 14 (96) | 0.049 | 34 | 10 | 5 | 6 | Produces | 1 | 20841389 |