Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A workflow for mutation extraction and structure annotation.

Literature DB >> 18172931

A workflow for mutation extraction and structure annotation.

Rajaraman Kanagasabai¹, Khar Heng Choo, Shoba Ranganathan, Christopher J O Baker.

Abstract

Rich information on point mutation studies is scattered across heterogeneous data sources. This paper presents an automated workflow for mining mutation annotations from full-text biomedical literature using natural language processing (NLP) techniques as well as for their subsequent reuse in protein structure annotation and visualization. This system, called mSTRAP (Mutation extraction and STRucture Annotation Pipeline), is designed for both information aggregation and subsequent brokerage of the mutation annotations. It facilitates the coordination of semantically related information from a series of text mining and sequence analysis steps into a formal OWL-DL ontology. The ontology is designed to support application-specific data management of sequence, structure, and literature annotations that are populated as instances of object and data type properties. mSTRAPviz is a subsystem that facilitates the brokerage of structure information and the associated mutations for visualization. For mutated sequences without any corresponding structure available in the Protein Data Bank (PDB), an automated pipeline for homology modeling is developed to generate the theoretical model. With mSTRAP, we demonstrate a workable system that can facilitate automation of the workflow for the retrieval, extraction, processing, and visualization of mutation annotations -- tasks which are well known to be tedious, time-consuming, complex, and error-prone. The ontology and visualization tool are available at (http://datam.i2r.a-star.edu.sg/mstrap).

Entities: Gene

Mesh：

Substances：
Proteins

Year: 2007 PMID： 18172931 DOI： 10.1142/s0219720007003119

Source DB: PubMed Journal: J Bioinform Comput Biol ISSN： 0219-7200 Impact factor: 1.122

Keyword Cloud
Cited

16 in total

1. An examination of the OMIM database for associating mutation to a consensus reference sequence.

Authors: Zuofeng Li; Beili Ying; Xingnan Liu; Xiaoyan Zhang; Hong Yu
Journal: Protein Cell Date: 2012-04-04 Impact factor: 14.870

2. tmVar: a text mining approach for extracting sequence variants in biomedical literature.

Authors: Chih-Hsuan Wei; Bethany R Harris; Hung-Yu Kao; Zhiyong Lu
Journal: Bioinformatics Date: 2013-04-05 Impact factor: 6.937

3. ResidueFinder: extracting individual residue mentions from protein literature.

Authors: Ton E Becker; Eric Jakobsson
Journal: J Biomed Semantics Date: 2021-07-21

4. Improved mutation tagging with gene identifiers applied to membrane protein stability prediction.

Authors: Rainer Winnenburg; Conrad Plake; Michael Schroeder
Journal: BMC Bioinformatics Date: 2009-08-27 Impact factor: 3.169

5. Getting started in text mining: part two.

Authors: Andrey Rzhetsky; Michael Seringhaus; Mark B Gerstein
Journal: PLoS Comput Biol Date: 2009-07-31 Impact factor: 4.475

6. Extraction of human kinase mutations from literature, databases and genotyping studies.

Authors: Martin Krallinger; Jose M G Izarzugaza; Carlos Rodriguez-Penagos; Alfonso Valencia
Journal: BMC Bioinformatics Date: 2009-08-27 Impact factor: 3.169

7. Annotation of protein residues based on a literature analysis: cross-validation against UniProtKb.

Authors: Kevin Nagel; Antonio Jimeno-Yepes; Dietrich Rebholz-Schuhmann
Journal: BMC Bioinformatics Date: 2009-08-27 Impact factor: 3.169

8. Between proteins and phenotypes: annotation and interpretation of mutations.

Authors: Christopher J O Baker; Dietrich Rebholz-Schuhmann
Journal: BMC Bioinformatics Date: 2009-08-27 Impact factor: 3.169

9. EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts.

Authors: Süveyda Yeniterzi; Ugur Sezerman
Journal: BMC Bioinformatics Date: 2009-08-27 Impact factor: 3.169

10. A comprehensive assessment of N-terminal signal peptides prediction methods.

Authors: Khar Heng Choo; Tin Wee Tan; Shoba Ranganathan
Journal: BMC Bioinformatics Date: 2009-12-03 Impact factor: 3.169