Literature DB >> 24990605

ARBitrator: a software pipeline for on-demand retrieval of auto-curated nifH sequences from GenBank.

Philip Heller1, H James Tripp1, Kendra Turk-Kubo1, Jonathan P Zehr1.   

Abstract

MOTIVATION: Studies of the biochemical functions and activities of uncultivated microorganisms in the environment require analysis of DNA sequences for phylogenetic characterization and for the development of sequence-based assays for the detection of microorganisms. The numbers of sequences for genes that are indicators of environmentally important functions such as nitrogen (N2) fixation have been rapidly growing over the past few decades. Obtaining these sequences from the National Center for Biotechnology Information's GenBank database is problematic because of annotation errors, nomenclature variation and paralogues; moreover, GenBank's structure and tools are not conducive to searching solely by function. For some genes, such as the nifH gene commonly used to assess community potential for N2 fixation, manual collection and curation are becoming intractable because of the large number of sequences in GenBank and the large number of highly similar paralogues. If analysis is to keep pace with sequence discovery, an automated retrieval and curation system is necessary.
RESULTS: ARBitrator uses a two-step process composed of a broad collection of potential homologues followed by screening with a best hit strategy to conserved domains. 34 420 nifH sequences were identified in GenBank as of November 20, 2012. The false-positive rate is ∼0.033%. ARBitrator rapidly updates a public nifH sequence database, and we show that it can be adapted for other genes.
AVAILABILITY AND IMPLEMENTATION: Java source and executable code are freely available to non-commercial users at http://pmc.ucsc.edu/∼wwwzehr/research/database/. CONTACT: zehrj@ucsc.edu SUPPLEMENTARY INFORMATION: SUPPLEMENTARY INFORMATION is available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24990605     DOI: 10.1093/bioinformatics/btu417

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

1.  Contribution of Heterotrophic Diazotrophs to N2 Fixation in a Eutrophic River: Free-Living vs. Aggregate-Associated.

Authors:  Eyal Geisler; Eyal Rahav; Edo Bar-Zeev
Journal:  Front Microbiol       Date:  2022-02-14       Impact factor: 5.640

2.  High levels of heterogeneity in diazotroph diversity and activity within a putative hotspot for marine nitrogen fixation.

Authors:  Lauren F Messer; Claire Mahaffey; Charlotte M Robinson; Thomas C Jeffries; Kirralee G Baker; Jaime Bibiloni Isaksson; Martin Ostrowski; Martina A Doblin; Mark V Brown; Justin R Seymour
Journal:  ISME J       Date:  2015-11-27       Impact factor: 10.302

3.  Contribution of mono and polysaccharides to heterotrophic N2 fixation at the eastern Mediterranean coastline.

Authors:  E Rahav; M J Giannetto; E Bar-Zeev
Journal:  Sci Rep       Date:  2016-06-16       Impact factor: 4.379

4.  Diversity and Activity of Diazotrophs in Great Barrier Reef Surface Waters.

Authors:  Lauren F Messer; Mark V Brown; Miles J Furnas; Richard L Carney; A D McKinnon; Justin R Seymour
Journal:  Front Microbiol       Date:  2017-06-07       Impact factor: 5.640

5.  Sewage outburst triggers Trichodesmium bloom and enhance N2 fixation rates.

Authors:  Eyal Rahav; Edo Bar-Zeev
Journal:  Sci Rep       Date:  2017-06-29       Impact factor: 4.379

6.  Mesopelagic N2 Fixation Related to Organic Matter Composition in the Solomon and Bismarck Seas (Southwest Pacific).

Authors:  Mar Benavides; Pia H Moisander; Hugo Berthelot; Thorsten Dittmar; Olivier Grosso; Sophie Bonnet
Journal:  PLoS One       Date:  2015-12-11       Impact factor: 3.240

7.  Diazotroph Diversity in the Sea Ice, Melt Ponds, and Surface Waters of the Eurasian Basin of the Central Arctic Ocean.

Authors:  Mar Fernández-Méndez; Kendra A Turk-Kubo; Pier L Buttigieg; Josephine Z Rapp; Thomas Krumpen; Jonathan P Zehr; Antje Boetius
Journal:  Front Microbiol       Date:  2016-11-23       Impact factor: 5.640

8.  Evaluation of Primers Targeting the Diazotroph Functional Gene and Development of NifMAP - A Bioinformatics Pipeline for Analyzing nifH Amplicon Data.

Authors:  Roey Angel; Maximilian Nepel; Christopher Panhölzl; Hannes Schmidt; Craig W Herbold; Stephanie A Eichorst; Dagmar Woebken
Journal:  Front Microbiol       Date:  2018-04-30       Impact factor: 5.640

9.  nifPred: Proteome-Wide Identification and Categorization of Nitrogen-Fixation Proteins of Diaztrophs Based on Composition-Transition-Distribution Features Using Support Vector Machine.

Authors:  Prabina K Meher; Tanmaya K Sahu; Jyotilipsa Mohanty; Shachi Gahoi; Supriya Purru; Monendra Grover; Atmakuri R Rao
Journal:  Front Microbiol       Date:  2018-05-29       Impact factor: 5.640

10.  A database of metazoan cytochrome c oxidase subunit I gene sequences derived from GenBank with CO-ARBitrator.

Authors:  Philip Heller; James Casaletto; Gregory Ruiz; Jonathan Geller
Journal:  Sci Data       Date:  2018-08-07       Impact factor: 6.444

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.