Literature DB >> 32399560

UniRule: a unified rule resource for automatic annotation in the UniProt Knowledgebase.

Alistair MacDougall1, Vladimir Volynkin1, Rabie Saidi1, Diego Poggioli1,2, Hermann Zellner1, Emma Hatton-Ellis1, Vishal Joshi1, Claire O'Donovan1, Sandra Orchard1, Andrea H Auchincloss3, Delphine Baratin3, Jerven Bolleman3, Elisabeth Coudert3, Edouard de Castro3, Chantal Hulo3, Patrick Masson3, Ivo Pedruzzi3, Catherine Rivoire3, Cecilia Arighi4, Qinghua Wang4, Chuming Chen4, Hongzhan Huang4, John Garavelli4, C R Vinayaka5, Lai-Su Yeh5, Darren A Natale5, Kati Laiho5, Maria-Jesus Martin1, Alexandre Renaux1, Klemens Pichler1.   

Abstract

MOTIVATION: The number of protein records in the UniProt Knowledgebase (UniProtKB: https://www.uniprot.org) continues to grow rapidly as a result of genome sequencing and the prediction of protein-coding genes. Providing functional annotation for these proteins presents a significant and continuing challenge.
RESULTS: In response to this challenge, UniProt has developed a method of annotation, known as UniRule, based on expertly curated rules, which integrates related systems (RuleBase, HAMAP, PIRSR, PIRNR) developed by the members of the UniProt consortium. UniRule uses protein family signatures from InterPro, combined with taxonomic and other constraints, to select sets of reviewed proteins which have common functional properties supported by experimental evidence. This annotation is propagated to unreviewed records in UniProtKB that meet the same selection criteria, most of which do not have (and are never likely to have) experimentally verified functional annotation. Release 2020_01 of UniProtKB contains 6496 UniRule rules which provide annotation for 53 million proteins, accounting for 30% of the 178 million records in UniProtKB. UniRule provides scalable enrichment of annotation in UniProtKB.
AVAILABILITY AND IMPLEMENTATION: UniRule rules are integrated into UniProtKB and can be viewed at https://www.uniprot.org/unirule/. UniRule rules and the code required to run the rules, are publicly available for researchers who wish to annotate their own sequences. The implementation used to run the rules is known as UniFIRE and is available at https://gitlab.ebi.ac.uk/uniprot-public/unifire.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32399560      PMCID: PMC7750954          DOI: 10.1093/bioinformatics/btaa485

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

1.  PIRSitePredict for protein functional site prediction using position-specific rules.

Authors:  Chuming Chen; Qinghua Wang; Hongzhan Huang; Cholanayakanahalli R Vinayaka; John S Garavelli; Cecilia N Arighi; Darren A Natale; Cathy H Wu
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

Review 2.  Protein function prediction--the power of multiplicity.

Authors:  Robert Rentzsch; Christine A Orengo
Journal:  Trends Biotechnol       Date:  2009-02-27       Impact factor: 19.536

3.  InterProScan 5: genome-scale protein function classification.

Authors:  Philip Jones; David Binns; Hsin-Yu Chang; Matthew Fraser; Weizhong Li; Craig McAnulla; Hamish McWilliam; John Maslen; Alex Mitchell; Gift Nuka; Sebastien Pesseat; Antony F Quinn; Amaia Sangrador-Vegas; Maxim Scheremetjew; Siew-Yit Yong; Rodrigo Lopez; Sarah Hunter
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

4.  HAMAP in 2015: updates to the protein family classification and annotation system.

Authors:  Ivo Pedruzzi; Catherine Rivoire; Andrea H Auchincloss; Elisabeth Coudert; Guillaume Keller; Edouard de Castro; Delphine Baratin; Béatrice A Cuche; Lydie Bougueleret; Sylvain Poux; Nicole Redaschi; Ioannis Xenarios; Alan Bridge
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 19.160

5.  New computational approaches to understanding molecular protein function.

Authors:  Jacquelyn S Fetrow; Patricia C Babbitt
Journal:  PLoS Comput Biol       Date:  2018-04-05       Impact factor: 4.475

6.  ECO, the Evidence & Conclusion Ontology: community standard for evidence information.

Authors:  Michelle Giglio; Rebecca Tauber; Suvarna Nadendla; James Munro; Dustin Olley; Shoshannah Ball; Elvira Mitraka; Lynn M Schriml; Pascale Gaudet; Elizabeth T Hobbs; Ivan Erill; Deborah A Siegele; James C Hu; Chris Mungall; Marcus C Chibucos
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

7.  UniProt: a worldwide hub of protein knowledge.

Authors: 
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

8.  InterPro in 2019: improving coverage, classification and access to protein sequence annotations.

Authors:  Alex L Mitchell; Teresa K Attwood; Patricia C Babbitt; Matthias Blum; Peer Bork; Alan Bridge; Shoshana D Brown; Hsin-Yu Chang; Sara El-Gebali; Matthew I Fraser; Julian Gough; David R Haft; Hongzhan Huang; Ivica Letunic; Rodrigo Lopez; Aurélien Luciani; Fabio Madeira; Aron Marchler-Bauer; Huaiyu Mi; Darren A Natale; Marco Necci; Gift Nuka; Christine Orengo; Arun P Pandurangan; Typhaine Paysan-Lafosse; Sebastien Pesseat; Simon C Potter; Matloob A Qureshi; Neil D Rawlings; Nicole Redaschi; Lorna J Richardson; Catherine Rivoire; Gustavo A Salazar; Amaia Sangrador-Vegas; Christian J A Sigrist; Ian Sillitoe; Granger G Sutton; Narmada Thanki; Paul D Thomas; Silvio C E Tosatto; Siew-Yit Yong; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

9.  PIRSF family classification system for protein functional and evolutionary analysis.

Authors:  Anastasia N Nikolskaya; Cecilia N Arighi; Hongzhan Huang; Winona C Barker; Cathy H Wu
Journal:  Evol Bioinform Online       Date:  2007-02-10       Impact factor: 1.625

10.  The Pfam protein families database in 2019.

Authors:  Sara El-Gebali; Jaina Mistry; Alex Bateman; Sean R Eddy; Aurélien Luciani; Simon C Potter; Matloob Qureshi; Lorna J Richardson; Gustavo A Salazar; Alfredo Smart; Erik L L Sonnhammer; Layla Hirsh; Lisanna Paladin; Damiano Piovesan; Silvio C E Tosatto; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  15 in total

Review 1.  PANTHER: Making genome-scale phylogenetics accessible to all.

Authors:  Paul D Thomas; Dustin Ebert; Anushya Muruganujan; Tremayne Mushayahama; Laurent-Philippe Albou; Huaiyu Mi
Journal:  Protein Sci       Date:  2021-11-25       Impact factor: 6.725

2.  Proteomic Tools for the Analysis of Cytoskeleton Proteins.

Authors:  Carlos Barreto; Andriele Silva; Eliza Wiech; Antonio Lopez; Avdar San; Shaneen Singh
Journal:  Methods Mol Biol       Date:  2022

3.  Machine Learning Prediction of Antimicrobial Peptides.

Authors:  Guangshun Wang; Iosif I Vaisman; Monique L van Hoek
Journal:  Methods Mol Biol       Date:  2022

4.  Proteomics in Non-model Organisms: A New Analytical Frontier.

Authors:  Michelle Heck; Benjamin A Neely
Journal:  J Proteome Res       Date:  2020-08-20       Impact factor: 4.466

5.  Molecular docking of alpha-enolase to elucidate the promising candidates against Streptococcus pneumoniae infection.

Authors:  Muhammad Hassan; Atif Amin Baig; Syed Awais Attique; Shafqat Abbas; Fizza Khan; Sara Zahid; Qurat Ul Ain; Muhammad Usman; Nordin Bin Simbak; Mohammad Amjad Kamal; Hanani Ahmad Yusof
Journal:  Daru       Date:  2021-02-03       Impact factor: 3.117

6.  SwissBioPics-an interactive library of cell images for the visualization of subcellular location data.

Authors:  Philippe Le Mercier; Jerven Bolleman; Edouard de Castro; Elisabeth Gasteiger; Parit Bansal; Andrea H Auchincloss; Emmanuel Boutet; Lionel Breuza; Cristina Casals-Casas; Anne Estreicher; Marc Feuermann; Damien Lieberherr; Catherine Rivoire; Ivo Pedruzzi; Nicole Redaschi; Alan Bridge
Journal:  Database (Oxford)       Date:  2022-04-12       Impact factor: 4.462

7.  Diverse Taxonomies for Diverse Chemistries: Enhanced Representation of Natural Product Metabolism in UniProtKB.

Authors:  Marc Feuermann; Emmanuel Boutet; Anne Morgat; Kristian B Axelsen; Parit Bansal; Jerven Bolleman; Edouard de Castro; Elisabeth Coudert; Elisabeth Gasteiger; Sébastien Géhant; Damien Lieberherr; Thierry Lombardot; Teresa B Neto; Ivo Pedruzzi; Sylvain Poux; Monica Pozzato; Nicole Redaschi; Alan Bridge
Journal:  Metabolites       Date:  2021-01-12

8.  UniProt: the universal protein knowledgebase in 2021.

Authors: 
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

9.  Phage Annotation Guide: Guidelines for Assembly and High-Quality Annotation.

Authors:  Dann Turner; Evelien M Adriaenssens; Igor Tolstoy; Andrew M Kropinski
Journal:  Phage (New Rochelle)       Date:  2021-12-16

10.  PredictProtein - Predicting Protein Structure and Function for 29 Years.

Authors:  Michael Bernhofer; Christian Dallago; Tim Karl; Venkata Satagopam; Michael Heinzinger; Maria Littmann; Tobias Olenyi; Jiajun Qiu; Konstantin Schütze; Guy Yachdav; Haim Ashkenazy; Nir Ben-Tal; Yana Bromberg; Tatyana Goldberg; Laszlo Kajan; Sean O'Donoghue; Chris Sander; Andrea Schafferhans; Avner Schlessinger; Gerrit Vriend; Milot Mirdita; Piotr Gawron; Wei Gu; Yohan Jarosz; Christophe Trefois; Martin Steinegger; Reinhard Schneider; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.