Literature DB >> 28545394

AGeNNT: annotation of enzyme families by means of refined neighborhood networks.

Florian Kandlinger1,2, Maximilian G Plach1, Rainer Merkl3.   

Abstract

BACKGROUND: Large enzyme families may contain functionally diverse members that give rise to clusters in a sequence similarity network (SSN). In prokaryotes, the genome neighborhood of a gene-product is indicative of its function and thus, a genome neighborhood network (GNN) deduced for an SSN provides strong clues to the specific function of enzymes constituting the different clusters. The Enzyme Function Initiative ( http://enzymefunction.org/ ) offers services that compute SSNs and GNNs.
RESULTS: We have implemented AGeNNT that utilizes these services, albeit with datasets purged with respect to unspecific protein functions and overrepresented species. AGeNNT generates refined GNNs (rGNNs) that consist of cluster-nodes representing the sequences under study and Pfam-nodes representing enzyme functions encoded in the respective neighborhoods. For cluster-nodes, AGeNNT summarizes the phylogenetic relationships of the contributing species and a statistic indicates how unique nodes and GNs are within this rGNN. Pfam-nodes are annotated with additional features like GO terms describing protein function. For edges, the coverage is given, which is the relative number of neighborhoods containing the considered enzyme function (Pfam-node). AGeNNT is available at https://github.com/kandlinf/agennt .
CONCLUSIONS: An rGNN is easier to interpret than a conventional GNN, which commonly contains proteins without enzymatic function and overly specific neighborhoods due to phylogenetic bias. The implemented filter routines and the statistic allow the user to identify those neighborhoods that are most indicative of a specific metabolic capacity. Thus, AGeNNT facilitates to distinguish and annotate functionally different members of enzyme families.

Entities:  

Keywords:  Enzyme function; GNN; Genome content; Genome neighborhood network; Homology-free annotation; SSN; Sequence similarity network

Mesh:

Substances:

Year:  2017        PMID: 28545394      PMCID: PMC5445326          DOI: 10.1186/s12859-017-1689-6

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  35 in total

1.  Improving the quality of protein similarity network clustering algorithms using the network edge weight distribution.

Authors:  Leonard Apeltsin; John H Morris; Patricia C Babbitt; Thomas E Ferrin
Journal:  Bioinformatics       Date:  2010-11-29       Impact factor: 6.937

2.  Evolution of substrate specificity in a recipient's enzyme following horizontal gene transfer.

Authors:  Lianet Noda-García; Aldo R Camacho-Zarco; Sofía Medina-Ruíz; Paul Gaytán; Mauricio Carrillo-Tripp; Vilmos Fülöp; Francisco Barona-Gómez
Journal:  Mol Biol Evol       Date:  2013-06-25       Impact factor: 16.240

3.  Assignment of function to a domain of unknown function: DUF1537 is a new kinase family in catabolic pathways for acid sugars.

Authors:  Xinshuai Zhang; Michael S Carter; Matthew W Vetting; Brian San Francisco; Suwen Zhao; Nawar F Al-Obaidi; Jose O Solbiati; Jennifer J Thiaville; Valérie de Crécy-Lagard; Matthew P Jacobson; Steven C Almo; John A Gerlt
Journal:  Proc Natl Acad Sci U S A       Date:  2016-07-11       Impact factor: 11.205

4.  Identification of four genes necessary for biosynthesis of the modified nucleoside queuosine.

Authors:  John S Reader; David Metzgar; Paul Schimmel; Valérie de Crécy-Lagard
Journal:  J Biol Chem       Date:  2003-12-02       Impact factor: 5.157

5.  Using sequence similarity networks for visualization of relationships across diverse protein superfamilies.

Authors:  Holly J Atkinson; John H Morris; Thomas E Ferrin; Patricia C Babbitt
Journal:  PLoS One       Date:  2009-02-03       Impact factor: 3.240

6.  A General Strategy for the Discovery of Metabolic Pathways: d-Threitol, l-Threitol, and Erythritol Utilization in Mycobacterium smegmatis.

Authors:  Hua Huang; Michael S Carter; Matthew W Vetting; Nawar Al-Obaidi; Yury Patskovsky; Steven C Almo; John A Gerlt
Journal:  J Am Chem Soc       Date:  2015-11-12       Impact factor: 15.419

Review 7.  Enterobactin: an archetype for microbial iron transport.

Authors:  Kenneth N Raymond; Emily A Dertz; Sanggoo S Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2003-03-24       Impact factor: 11.205

8.  G-NEST: a gene neighborhood scoring tool to identify co-conserved, co-expressed genes.

Authors:  Danielle G Lemay; William F Martin; Angie S Hinrichs; Monique Rijnkels; J Bruce German; Ian Korf; Katherine S Pollard
Journal:  BMC Bioinformatics       Date:  2012-09-28       Impact factor: 3.169

9.  The InterPro protein families database: the classification resource after 15 years.

Authors:  Alex Mitchell; Hsin-Yu Chang; Louise Daugherty; Matthew Fraser; Sarah Hunter; Rodrigo Lopez; Craig McAnulla; Conor McMenamin; Gift Nuka; Sebastien Pesseat; Amaia Sangrador-Vegas; Maxim Scheremetjew; Claudia Rato; Siew-Yit Yong; Alex Bateman; Marco Punta; Teresa K Attwood; Christian J A Sigrist; Nicole Redaschi; Catherine Rivoire; Ioannis Xenarios; Daniel Kahn; Dominique Guyot; Peer Bork; Ivica Letunic; Julian Gough; Matt Oates; Daniel Haft; Hongzhan Huang; Darren A Natale; Cathy H Wu; Christine Orengo; Ian Sillitoe; Huaiyu Mi; Paul D Thomas; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2014-11-26       Impact factor: 16.971

10.  Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation.

Authors:  José P Faria; James J Davis; Janaka N Edirisinghe; Ronald C Taylor; Pamela Weisenhorn; Robert D Olson; Rick L Stevens; Miguel Rocha; Isabel Rocha; Aaron A Best; Matthew DeJongh; Nathan L Tintle; Bruce Parrello; Ross Overbeek; Christopher S Henry
Journal:  Front Microbiol       Date:  2016-11-24       Impact factor: 5.640

View more
  2 in total

Review 1.  Genomic Enzymology: Web Tools for Leveraging Protein Family Sequence-Function Space and Genome Context to Discover Novel Functions.

Authors:  John A Gerlt
Journal:  Biochemistry       Date:  2017-08-22       Impact factor: 3.162

2.  Conserved genomic neighborhood is a strong but no perfect indicator for a direct interaction of microbial gene products.

Authors:  Robert Esch; Rainer Merkl
Journal:  BMC Bioinformatics       Date:  2020-01-03       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.