| Literature DB >> 18549479 |
Rob Jelier1, Martijn J Schuemie, Antoine Veldhoven, Lambert C J Dorssers, Guido Jenster, Jan A Kors.
Abstract
Anni 2.0 is an online tool (http://biosemantics.org/anni/) to aid the biomedical researcher with a broad range of information needs. Anni provides an ontology-based interface to MEDLINE and retrieves documents and associations for several classes of biomedical concepts, including genes, drugs and diseases, with established text-mining technology. In this article we illustrate Anni's usability by applying the tool to two use cases: interpretation of a set of differentially expressed genes, and literature-based knowledge discovery.Entities:
Mesh:
Year: 2008 PMID: 18549479 PMCID: PMC2481428 DOI: 10.1186/gb-2008-9-6-r96
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1The technology behind Anni at a glance. Yellow balls indicate ontology concepts.
Figure 2Screenshot of Anni. An MDS projection is shown of a test set of 47 genes, organized in 5 groups through a shared commonality (see legend and [17,18]). In the Explorer tab to the left, concept sets are organized in a tree. The toolbar on top provides concept set options and shows the current filter for matching concept profiles. The shown MDS view on a concept set can be used to get an overview of associations between the concepts, as used, for instance, in [22]. Groups of nodes can be selected and the similarities between their concept profiles analyzed in the annotation view. Nodes are colored based on user-defined features.
Figure 3Clustering and annotation of differentially expressed genes. (a) The clustering of genes up-regulated in prostate metastases. The clustering is based on the similarity of the concept profiles of the genes. (b) A fragment of the annotation for cluster A. The annotation view displays for a cluster a group cohesion score with a p-value, and a list of concepts with their percentage contribution to the score. In addition, the weights of the concepts in the concept profiles are shown.
A selection of identified relevant clusters in the set of differentially expressed genes between metastatic and localized prostate cancer
| Cluster | Number of genes | Descriptive concepts |
| Up-regulated | ||
| A | 24 | Kinetochores; mitosis; anaphase-promoting complex |
| B | 5 | Nuclear proteins; tumor markers, biological |
| C | 3 | Unfolded protein response |
| Down-regulated | ||
| A | 7 | Complement system proteins |
| B | 7 | Calponin; smooth muscle myosins |
| C | 4 | Myosin phosphatase; smooth muscle (tissue) |
| D | 7 | Extracellular matrix proteins |
| E | 5 | Transcription factor; proto-oncogene proteins c-fos |
| F | 4 | Cyclin-dependent kinases |
| G | 3 | Melanosomes; membrane protein traffic; exocytosis |
| H | 5 | Membrane transport proteins; symporter |
The most descriptive concepts are shown as given by the Anni annotation view. The two left-most columns depict how many genes in the cluster were either up- or down-regulated in metastasized prostate cancer.
Final ranking of diseases for use case 2
| Rank | Disease name | Score |
| 1 | Leishmaniasis | 0.002417946 |
| 2 | 5.68E-04 | |
| 3 | Extrinsic asthma | 5.44E-04 |
| 4 | Listeriosis | 4.88E-04 |
| 5 | HTLV-I infections | 3.44E-04 |
| 6 | Hepatitis C, chronic | 3.43E-04 |
| 7 | Tropical spastic paraparesis | 3.17E-04 |
| 8 | Epstein-Barr virus infections | 2.73E-04 |
| 9 | Hepatitis B, chronic | 2.38E-04 |
| 10 | Filarial elephantiases | 2.38E-04 |
Final ranking and scores for the query for "IL-12", "IL-10", "Th1 cells", "Th2 cells" and "peripheral mononuclear cells" on the concept set "Diseases or Syndromes".