| Literature DB >> 19074486 |
Raul Rodriguez-Esteban1, Phoebe M Roberts, Matthew E Crawford.
Abstract
Molecular perturbations provide a powerful toolset for biomedical researchers to scrutinize the contributions of individual molecules in biological systems. Perturbations qualify the context of experimental results and, despite their diversity, share properties in different dimensions in ways that can be formalized. We propose a formal framework to describe and classify perturbations that allows accumulation of knowledge in order to inform the process of biomedical scientific experimentation and target analysis. We apply this framework to develop a novel algorithm for automatic detection and characterization of perturbations in text and show its relevance in the study of gene-phenotype associations and protein-protein interactions in diabetes and cancer. Analyzing perturbations introduces a novel view of the multivariate landscape of biological systems.Entities:
Mesh:
Year: 2008 PMID: 19074486 PMCID: PMC2647287 DOI: 10.1093/nar/gkn986
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Examples of relevance and direction annotation
| Relevance | Direction | Sentence | Explanation | PMID |
|---|---|---|---|---|
| Irrelevant | Different members of bcl-2 family may promote or inhibit apoptosis by synthesizing anti- and proapoptotic proteins | Members of a gene family, not a gene, are causing apoptosis phenotype | 17404013 | |
| Irrelevant | The ATF6 and IRE1 pathways cooperatively caused apoptosis via induction of CHOP, activation of XBP1 and phosphorylation of JNK, and the PERK-eIF2α pathway counteracted the proapoptotic processes | Pathways, not genes, are causing apoptosis phenotype | 17464326 | |
| Irrelevant | This result suggests that toxic products such as reactive oxygen species and aldehydes liberated by the action of polyamine oxidase on the acetylated polyamines formed by SSAT may enhance tumor development | Metabolites, not proteins that generate them, are causing tumor development phenotype | 17675337 | |
| Relevant | Unknown | These results indicate that survivin is important for optimal development of bovine blastocysts and confirm that survivin expression suppresses apoptosis of pre-implantation embryos | ‘Survivin expression’ does not indicate if there is a state change in amount | 17075833 |
| Relevant | Increase | Finally, we showed that after 7–9 days of incubation, MCH also inhibits proliferation of non-stimulated PBMC | ‘Incubation’ indicates MCH was added exogenously | 17537530 |
| Relevant | Increase | Taken together, LDM induces apoptosis in a p53-dependent manner when given at low doses, but in a p53-independent manner when given at high doses | ‘Given at low doses’ indicates LDM was added exogenously | 17534142 |
| Relevant | Decrease | Soluble Flt-1 abolished hypoxia/VEGF-induced hyperpermeability | Soluble forms of receptors create inactive complexes | 17311300 |
Perturbation extraction performance
| Count | Precision (%) | Recall (%) | ||
|---|---|---|---|---|
| Perturbation | 690 | 79.2 | 79.8 | 79.4 |
| No ontology | 76.4 | 79.6 | 77.6 | |
| Only ontology | 77.2 | 59.5 | 67.0 | |
| Straightforward | 83.3 | 78.9 | 81.0 | |
| Increased | 436 | 75.3 | 71.1 | 72.9 |
| No ontology | 76.6 | 69.9 | 72.9 | |
| Only ontology | 65.7 | 50.9 | 56.8 | |
| Straightforward | 77.7 | 68.8 | 72.7 | |
| Decreased | 254 | 79.0 | 65.1 | 71.2 |
| No ontology | 78.5 | 63.6 | 70.0 | |
| Only ontology | 79.8 | 58.4 | 66.4 | |
| Straightforward | 80.4 | 67.2 | 72.9 |
Precision, recall and macro-averaged F-measure were calculated using four different combinations of feature sets: ‘full’, ‘no ontology’, ‘only ontology’ and ‘straightforward’. Baseline frequencies were 32.8% for increased perturbations and 18.1% for decreased perturbations.
Ontology occurrence count in sentences from the test corpus, separated by diabetes and cancer phenotypes
| Gene and protein perturbations | Cancer | Diabetes | Cancer | Diabetes | |
|---|---|---|---|---|---|
| Total increasing modifications | 266 | 586 | Total decreasing modifications | 267 | 110 |
| General increasing modifications | General decreasing modifications | ||||
| Activation | 66 | 12 | Down-regulated, -ion | 12 | 0 |
| Administered, -ion | 7 | 120 | Inactivation | 5 | 0 |
| Dose, -age, -dependent | 11 | 86 | Inhibition | 30 | 7 |
| Ectopic | 7 | 0 | Repression | 1 | 0 |
| Enhanced … expression | 10 | 0 | Suppression | 10 | 3 |
| Exogenous | 6 | 23 | DNA decreasing modifications | ||
| i.c.v. | 0 | 11 | Deficiency, -ent | 15 | 12 |
| i.p. | 0 | 7 | Deletion | 3 | 2 |
| Increased | 26 | 18 | Dominant-negative | 3 | 0 |
| Induction | 8 | 2 | Knockout | 1 | 2 |
| Infused, -ion | 0 | 35 | Loss | 12 | 13 |
| Injected, -ion | 7 | 66 | Mutant | 0 | 0 |
| Intracerebroventricular | 0 | 51 | Mutated, -ion | 0 | 3 |
| Intraperitoneal | 1 | 16 | mRNA decreasing modifications | ||
| mg/kg,/kg | 0 | 8 | Anti(-)sense | 9 | 3 |
| Oral | 6 | 4 | Interference | 5 | 1 |
| Overexpressed, -ration | 38 | 0 | Interfering RNA | 2 | 0 |
| Peripherally | 0 | 7 | Knockdown | 12 | 0 |
| Recombinant | 7 | 10 | RNA interference | 4 | 0 |
| Restoration | 2 | 0 | RNAi | 2 | 0 |
| Subcutaneous | 0 | 2 | Short hairpin RNA | 1 | 0 |
| Systemic | 1 | 9 | Silenced | 1 | 0 |
| Treatment | 24 | 22 | siRNA | 15 | 0 |
| Up-regulation | 8 | 0 | Protein decreasing modifications | ||
| DNA increasing modifications | Antagonist | 7 | 28 | ||
| Adenoviral | 2 | 1 | Anti- | 25 | 4 |
| Adenovirus | 0 | 0 | Antibody, -ies | 11 | 3 |
| Gain-of-function | 2 | 0 | Blockade, -ing | 12 | 8 |
| Gene delivery | 1 | 3 | Deactivation | 1 | 0 |
| Transgenic | 3 | 2 | Decoy | 2 | 0 |
| mRNA increasing modifications | Fc- | 0 | 0 | ||
| Inducible | 2 | 0 | Inhibitor | 53 | 15 |
| Protein increasing modifications | Inverse agonist | 0 | 1 | ||
| Activator | 10 | 0 | mAbs | 1 | 0 |
| Agonist | 9 | 54 | Neutralization | 1 | 0 |
| Analog/analogue | 2 | 7 | Reduced … activity | 1 | 5 |
| Soluble | 4 | 0 | |||
| Targeting | 6 | 0 |
Note that total increased and decreased perturbations exceeds values in Table 2 since multiple perturbation terms may appear in a single sentence.
Figure 1.Perturbations extracted from genes involved in the Diabetes Mellitus Type II pathway in the Kyoto Encyclopedia of Genes and Genomes (21). Differences in perturbation direction are large for some members of the pathway, note that the scale bar is logarithmic.
Proposed annotation guidelines
| Relevance | Direction | Molecule | Effect |
|---|---|---|---|
| Relevant | Increase | Gene | Activity |
| Irrelevant | Decrease | RNA | Abundance |
| Unknown | Protein expression |