Literature DB >> 18172926

Retrieving mutation-specific information for human proteins in UniProt/Swiss-Prot Knowledgebase.

Yum Lina Yip1, Nathalie Lachenal, Violaine Pillet, Anne-Lise Veuthey.   

Abstract

The UniProt/Swiss-Prot Knowledgebase records about 30,500 variants in 5,664 proteins (Release 52.2). Most of these variants are manually curated single amino acid polymorphisms (SAPs) with references to the literature. In order to keep the list of published documents related to SAPs up to date, an automatic information retrieval method is developed to recover texts mentioning SAPs. The method is based on the use of regular expressions (patterns) and rules for the detection and validation of mutations. When evaluated using a corpus of 9,820 PubMed references, the precision of the retrieval was determined to be 89.5% over all variants. It was also found that the use of nonstandard mutation nomenclature and sequence positional correction is necessary to retrieve a significant number of relevant articles. The method was applied to the 5,664 proteins with variants. This was performed by first submitting a PubMed query to retrieve articles using gene or protein names and a list of mutation-related keywords; the SAP detection procedure was then used to recover relevant documents. The method was found to be efficient in retrieving new references on known polymorphisms. New references on known SAPs will be rendered accessible to the public via the Swiss-Prot variant pages.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18172926     DOI: 10.1142/s021972000700320x

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  18 in total

1.  Improved mutation tagging with gene identifiers applied to membrane protein stability prediction.

Authors:  Rainer Winnenburg; Conrad Plake; Michael Schroeder
Journal:  BMC Bioinformatics       Date:  2009-08-27       Impact factor: 3.169

2.  Refinement of coding SNPs in the human aryl hydrocarbon receptor gene using ISNPranker: An integrative-SNP ranking web-tool.

Authors:  Younes Aftabi; Saleh Rafei; Habib Zarredar; Amir Amiri-Sadeghan; Mohsen Akbari-Shahpar; Zahra Khoshkam; Ensiyeh Seyedrezazadeh; Majid Khalili; Faramarz Mehrnejad; Sasan Fereidouni; B Paige Lawrence
Journal:  Comput Biol Chem       Date:  2020-11-17       Impact factor: 2.877

3.  Application of text-mining for updating protein post-translational modification annotation in UniProtKB.

Authors:  Anne-Lise Veuthey; Alan Bridge; Julien Gobeill; Patrick Ruch; Johanna R McEntyre; Lydie Bougueleret; Ioannis Xenarios
Journal:  BMC Bioinformatics       Date:  2013-03-22       Impact factor: 3.169

4.  Prospects for the automated extraction of mutation data from the scientific literature.

Authors:  Peter D Stenson; David N Cooper
Journal:  Hum Genomics       Date:  2010-10       Impact factor: 4.639

5.  Interpretation of the consequences of mutations in protein kinases: combined use of bioinformatics and text mining.

Authors:  Jose M G Izarzugaza; Martin Krallinger; Alfonso Valencia
Journal:  Front Physiol       Date:  2012-08-22       Impact factor: 4.566

6.  Challenges in the association of human single nucleotide polymorphism mentions with unique database identifiers.

Authors:  Philippe E Thomas; Roman Klinger; Laura I Furlong; Martin Hofmann-Apitius; Christoph M Friedrich
Journal:  BMC Bioinformatics       Date:  2011-07-05       Impact factor: 3.169

7.  Extraction of human kinase mutations from literature, databases and genotyping studies.

Authors:  Martin Krallinger; Jose M G Izarzugaza; Carlos Rodriguez-Penagos; Alfonso Valencia
Journal:  BMC Bioinformatics       Date:  2009-08-27       Impact factor: 3.169

8.  Annotation of protein residues based on a literature analysis: cross-validation against UniProtKb.

Authors:  Kevin Nagel; Antonio Jimeno-Yepes; Dietrich Rebholz-Schuhmann
Journal:  BMC Bioinformatics       Date:  2009-08-27       Impact factor: 3.169

9.  From SNPs to pathways: integration of functional effect of sequence variations on models of cell signalling pathways.

Authors:  Anna Bauer-Mehren; Laura I Furlong; Michael Rautschka; Ferran Sanz
Journal:  BMC Bioinformatics       Date:  2009-08-27       Impact factor: 3.169

10.  Easy retrieval of single amino-acid polymorphisms and phenotype information using SwissVar.

Authors:  Anaïs Mottaz; Fabrice P A David; Anne-Lise Veuthey; Yum L Yip
Journal:  Bioinformatics       Date:  2010-01-26       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.