| Literature DB >> 24865352 |
Kathrin Poos1, Jan Smida2, Michaela Nathrath2, Doris Maugg2, Daniel Baumhoer2, Anna Neumann1, Eberhard Korsching3.
Abstract
Osteosarcoma (OS) is the most common primary bone cancer exhibiting high genomic instability. This genomic instability affects multiple genes and microRNAs to a varying extent depending on patient and tumor subtype. Massive research is ongoing to identify genes including their gene products and microRNAs that correlate with disease progression and might be used as biomarkers for OS. However, the genomic complexity hampers the identification of reliable biomarkers. Up to now, clinico-pathological factors are the key determinants to guide prognosis and therapeutic treatments. Each day, new studies about OS are published and complicate the acquisition of information to support biomarker discovery and therapeutic improvements. Thus, it is necessary to provide a structured and annotated view on the current OS knowledge that is quick and easily accessible to researchers of the field. Therefore, we developed a publicly available database and Web interface that serves as resource for OS-associated genes and microRNAs. Genes and microRNAs were collected using an automated dictionary-based gene recognition procedure followed by manual review and annotation by experts of the field. In total, 911 genes and 81 microRNAs related to 1331 PubMed abstracts were collected (last update: 29 October 2013). Users can evaluate genes and microRNAs according to their potential prognostic and therapeutic impact, the experimental procedures, the sample types, the biological contexts and microRNA target gene interactions. Additionally, a pathway enrichment analysis of the collected genes highlights different aspects of OS progression. OS requires pathways commonly deregulated in cancer but also features OS-specific alterations like deregulated osteoclast differentiation. To our knowledge, this is the first effort of an OS database containing manual reviewed and annotated up-to-date OS knowledge. It might be a useful resource especially for the bone tumor research community, as specific information about genes or microRNAs is quick and easily accessible. Hence, this platform can support the ongoing OS research and biomarker discovery. Database URL: http://osteosarcoma-db.uni-muenster.de.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24865352 PMCID: PMC4034345 DOI: 10.1093/database/bau042
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Database construction pipeline. The database construction is performed in three major steps: automated dictionary-based literature mining, data review and annotation by reviewers and external data sources and data storage in a MySQL relational database with Web interface. The whole pipeline is based on PubMed-derived abstracts related to OS research.
Figure 2.Screenshot of the CDKN1A results page. The database screenshots show the main results page of a gene search and the corresponding MTI network using the example of CDKN1A. (1) The search menu enables the user to search for a gene or microRNA query. (2) Submitting the query delivers the results page for the specific query that shows general information derived from external databases and abstracts associated with the query. (3) The table of abstracts can be browsed using pagination buttons and (4) filtered according to type of samples, potential prognostic and/or therapeutic value or text search within the titles. (5) To receive more manual annotations like experimental settings, biological context and information about the abstracts, an export button is provided. (6 + 7) The MTI network visually illustrates the possible regulatory relationships of the user’s query. A detailed description of the prediction results is given in the table below. (8) Again, users are able to export the table and receive additional information like UTR coordinates and so on.
KEGG pathway enrichment analysis
| ID | KEGG pathway | Number of genes | Number of genes in pathway | FDR | |
|---|---|---|---|---|---|
| hsa05200 | Pathways in cancer | 158 | 327 | 5.74 × 10–48 | 1.11 × 10–45 |
| hsa05215 | Prostate cancer | 59 | 89 | 8.83 × 10–28 | 8.57 × 10–26 |
| hsa05219 | Bladder cancer | 33 | 42 | 9.62 × 10–20 | 6.22 × 10–18 |
| hsa05212 | Pancreatic cancer | 44 | 70 | 1.30 × 10–19 | 6.31 × 10–18 |
| hsa04510 | Focal adhesion | 82 | 200 | 4.58 × 10–19 | 1.78 × 10–17 |
| hsa05222 | Small-cell lung cancer | 46 | 85 | 8.39 × 10–17 | 2.62 × 10–15 |
| hsa05220 | Chronic myeloid leukemia | 42 | 73 | 9.45 × 10–17 | 2.62 × 10–15 |
| hsa05210 | Colorectal cancer | 38 | 62 | 1.46 × 10–16 | 3.55 × 10–15 |
| hsa04110 | Cell cycle | 58 | 128 | 3.74 × 10–16 | 8.07 × 10–15 |
| hsa04350 | TGF-beta signaling pathway | 44 | 85 | 3.55 × 10–15 | 6.89 × 10–14 |
| hsa05223 | Non-small-cell lung cancer | 33 | 54 | 1.72 × 10–14 | 3.04 × 10–13 |
| hsa04115 | p53 signaling pathway | 38 | 69 | 2.01 × 10–14 | 3.25 × 10–13 |
| hsa04210 | Apoptosis | 44 | 89 | 3.12 × 10–14 | 4.66 × 10–13 |
| hsa05214 | Glioma | 36 | 65 | 7.85 × 10–14 | 1.09 × 10–12 |
| hsa05213 | Endometrial cancer | 31 | 52 | 2.86 × 10–13 | 3.70 × 10–12 |
| hsa05218 | Melanoma | 37 | 71 | 4.46 × 10–13 | 5.41 × 10–12 |
| hsa05142 | Chagas’ disease (American trypanosomiasis) | 46 | 104 | 1.37 × 10–11 | 1.57 × 10–11 |
| hsa05221 | Acute myeloid leukemia | 31 | 58 | 1.53 × 10–11 | 1.65 × 10–10 |
| hsa04380 | Osteoclast differentiation | 50 | 128 | 4.04 × 10–11 | 4.12 × 10–10 |
| hsa04012 | ErbB signaling pathway | 39 | 87 | 4.39 × 10–11 | 4.26 × 10–10 |
The table shows the results of the hypergeometric test of KEGG pathways.
aFDR, false discovery rate.
Most frequent genes and microRNAs with potential therapeutic/prognostic impact
| ID | Symbol/Name | Number of abstracts |
|---|---|---|
| 7157 | TP53 | 26 |
| 7422 | VEGFA | 24 |
| 5243 | ABCB1 | 20 |
| 2064 | ERBB2 | 14 |
| 4193 | MDM2 | 14 |
| 5925 | RB1 | 14 |
| 7430 | EZR | 12 |
| 249 | ALPL | 9 |
| 1029 | CDKN2A | 9 |
| 632 | BGLAP | 8 |
| 1019 | CDK4 | 8 |
| 4609 | MYC | 7 |
| 6678 | SPARC | 7 |
| 595 | CCND1 | 6 |
| 4313 | MMP2 | 6 |
| 4318 | MMP9 | 6 |
| 5743 | PTGS2 | 6 |
| 1956 | EGFR | 5 |
| 2353 | FOS | 5 |
| 3939 | LDHA | 5 |
| 4233 | MET | 5 |
| 4288 | MKI67 | 5 |
| MIMAT0000076 | hsa-miR-21-5p | 2 |
| MIMAT0000092 | hsa-miR-92a-3p | 1 |
| MIMAT0000232 | hsa-miR-199a-3p | 1 |
| MIMAT0000267 | hsa-miR-210-3p | 1 |
| MIMAT0000426 | hsa-miR-132-3p | 1 |
| MIMAT0000435 | hsa-miR-143-3p | 1 |
| MIMAT0000447 | hsa-miR-134-5p | 1 |
| MIMAT0000459 | hsa-miR-193a-3p | 1 |
| MIMAT0000686 | hsa-miR-34c-5p | 1 |
| MIMAT0000689 | hsa-miR-99b-5p | 1 |
| MIMAT0000737 | hsa-miR-382-5p | 1 |
| MIMAT0001339 | hsa-miR-422a | 1 |
| MIMAT0004676 | hsa-miR-34b-3p | 1 |
The table lists the number of OS-related abstracts of the most frequently mentioned genes and microRNAs associated with any possible prognostic or therapeutic value. The ID column lists Entrez geneids for genes and miRBase accessions for microRNAs.
amiR-34 family.
Top OS-related microRNAs
| ID | Name | MTI |
|---|---|---|
| MIMAT0000255 | hsa-miR-34a-5p | 139 |
| MIMAT0000686 | hsa-miR-34c-5p | 138 |
| MIMAT0000271 | hsa-miR-214-3p | 128 |
| MIMAT0000430 | hsa-miR-138-5p | 127 |
| MIMAT0000080 | hsa-miR-24-3p | 126 |
| MIMAT0000068 | hsa-miR-15a-5p | 122 |
| MIMAT0000417 | hsa-miR-15b-5p | 121 |
| MIMAT0000100 | hsa-miR-29b-3p | 119 |
| MIMAT0002820 | hsa-miR-497-5p | 118 |
| MIMAT0000084 | hsa-miR-27a-3p | 117 |
| MIMAT0000086 | hsa-miR-29a-3p | 117 |
| MIMAT0000461 | hsa-miR-195-5p | 117 |
| MIMAT0000069 | hsa-miR-16-5p | 116 |
| MIMAT0000763 | hsa-miR-338-3p | 116 |
| MIMAT0000231 | hsa-miR-199a-5p | 110 |
| MIMAT0000423 | hsa-miR-125b-5p | 106 |
| MIMAT0000261 | hsa-miR-183-5p | 100 |
| MIMAT0000691 | hsa-miR-130b-3p | 100 |
The table illustrates the microRNAs regulating most of the genes in the Osteosarcoma Database. All microRNAs regulating ≥100 targets are denoted. The ID column lists miRBase accessions for mature microRNAs.
aMTI, microRNA–target gene interaction.
bmiR-34 family.
cmiR-15 family.
dmiR-29 family.