| Literature DB >> 23951102 |
Salvatore Loguercio1, Benjamin M Good, Andrew I Su.
Abstract
Structured gene annotations are a foundation upon which many bioinformatics and statistical analyses are built. However the structured annotations available in public databases are a sparse representation of biological knowledge as a whole. The rate of biomedical data generation is such that centralized biocuration efforts struggle to keep up. New models for gene annotation need to be explored that expand the pace at which we are able to structure biomedical knowledge. Recently, online games have emerged as an effective way to recruit, engage and organize large numbers of volunteers to help address difficult biological challenges. For example, games have been successfully developed for protein folding (Foldit), multiple sequence alignment (Phylo) and RNA structure design (EteRNA). Here we present Dizeez, a simple online game built with the purpose of structuring knowledge of gene-disease associations. Preliminary results from game play online and at scientific conferences suggest that Dizeez is producing valid gene-disease annotations not yet present in any public database. These early results provide a basic proof of principle that online games can be successfully applied to the challenge of gene annotation. Dizeez is available at http://genegames.org.Entities:
Mesh:
Year: 2013 PMID: 23951102 PMCID: PMC3737187 DOI: 10.1371/journal.pone.0071171
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Dizeez - main game interface.
Figure 2Dizeez - game review interface.
Gene-disease associations provided seven or more times in Dizeez.
| # Votes | Gene Symbol | Gene Name | Disease | OMIM | PharmGKB | DGA | PubMed (PMID) |
| 11 | NBPF3 | neuroblastoma breakpoint family, 3 | neuroblastoma | No | No | No | 19536264, 18493581 |
| 11 | SOX8 | SRY (sex determining region Y)-box 8 | mental retardation | No | No | No | 18076105, 10684944 |
| 9 | ABL1 | c-abl oncogene 1, non-receptor tyrosine kinase | leukemia | No | Yes | Yes | 3313010, 6308652 |
| 9 | SSX1 | Synovial sarcoma, X breakpoint 1 | synovial sarcoma | No | No | Yes | 12037676, 12696068 |
| 8 | APC | Adenomatous polyposis coli | colorectal cancer | Yes | Yes | Yes | 10737795, 2188735 |
| 8 | FES | Feline sarcoma oncogene | sarcoma | No | No | Yes | — |
| 8 | RBP3 | Retinol binding protein 3, interstitial | retinoblastoma | No | No | No | — |
| 8 | GAST | Gastrin | gastrinoma | No | No | No | 7439637, 5648596 |
| 8 | DCC | Deleted in colorectal carcinoma | colorectal cancer | No | No | Yes | 22876889, 22920895 |
| 8 | MAP3K5 | mitogen-activated protein kinase kinase kinase 5 | Cancer | No | No | Yes | 22197930, 22723553 |
| 7 | RB1 | retinoblastoma 1 | retinoblastoma | Yes | No | Yes | 2877398, 3823889 |
| 7 | RET | ret proto-oncogene | Cancer | No | Yes | Yes | 23170308, 23150706 |
| 7 | MLL3 | myeloid/lymphoid or mixed-lineage leukemia 3 | leukemia | No | No | No | — |
| 7 | BACE2 | beta-site APP-cleaving enzyme 2 | Alzheimer's disease | No | No | Yes | 22074738, 22044119 |
| 7 | GTF2I | general transcription factor IIi | developmental disorder | No | No | No | 19897463, 20956978 |
| 7 | MFI2 | antigen p97 (melanoma associated) | melanoma | No | No | Yes | 20935638 |
| 7 | KRAS | v-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog | colorectal cancer | No | Yes | Yes | 23188063, 23182985 |
Figure 3Number of Gene-Disease assertions vs. number of votes, for real- and random gameplay.
The vertical axis represents the number of associations collected during game play (log scale). Red line: real gameplay. Grey bars: mean number of associations after 100 randomizations, with associated standard deviation. ‘7+’ indicates the sum of associations collected with a number of votes equal or greater than 7.
Gene-disease associations provided four or more times in Dizeez and not found in Gene Wiki.
| # Votes | Gene Symbol | Gene Name | Disease | OMIM | PharmGKB | DGA | PubMed (PMID) |
| 6 | HTT | huntingtin | Alzheimer's disease | No | No | No | – |
| 5 | BCL2 | B-cell CLL/lymphoma 2 | leukemia | No | No | Yes | 23118966, 23114648 |
| 5 | MECOM | MDS1 and EVI1 complex locus | sarcoma | No | No | No | 18206536 |
| 5 | PRDM2 | PR domain containing 2 | neuroblastoma | No | No | No | 20878080, 18819740 |
| 4 | AVPR1A | arginine vasopressin receptor 1A | Alzheimer's disease | No | No | No | 21115064 |
| 4 | ATF7 | activating transcription factor 7 | Cancer | No | No | No | 22260696, 17309674 |
Figure 4Concordance between Dizeez-mined associations and Disease and Gene Annotations database.
The ‘concordance ratio’ on the vertical axis is the ratio between the associations supported by DGA and the total number of associations for a given number of votes. ‘7+’ indicates the sum of associations collected with a number of votes between 7 and 11.