Literature DB >> 15879451

Fast parsers for Entrez Gene.

Mingyi Liu1, Andrei Grigoriev.   

Abstract

NCBI completed the transition of its main genome annotation database from Locuslink to Entrez Gene in Spring 2005. However, to this date few parsers exist for the Entrez Gene annotation file. Owing to the widespread use of Locuslink and the popularity of Perl programming language in bioinformatics, a publicly available high performance Entrez Gene parser in Perl is urgently needed. We present four such parsers that were developed using several parsing approaches (Parse::RecDescent, Parse::Yapp, Perl-byacc and Perl 5 regular expressions) and provide the first in-depth comparison of these sophisticated Perl tools. Our fastest parser processes the entire human Entrez Gene annotation file in under 12 min on one Intel Xeon 2.4 GHz CPU and can be of help to the bioinformatics community during and after the transition from Locuslink to Entrez Gene.

Entities:  

Mesh:

Year:  2005        PMID: 15879451     DOI: 10.1093/bioinformatics/bti488

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  3 in total

1.  iCartiGD: the Integrated Cartilage Gene Database.

Authors:  Ming-Yiu Yeung; David K Smith; Matthew S Y Chan; Cheuk M Li; Brian C Wong; Kenneth M C Cheung; Keith D K Luk; Kathryn S E Cheah; Pak Sham; Danny Chan; You-Qiang Song
Journal:  BMC Genet       Date:  2007-02-23       Impact factor: 2.797

2.  EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries.

Authors:  Robin P Smith; William J Buchser; Marcus B Lemmon; Jose R Pardinas; John L Bixby; Vance P Lemmon
Journal:  BMC Bioinformatics       Date:  2008-04-10       Impact factor: 3.169

3.  Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank.

Authors:  Allison Piovesan; Maria Caracausi; Marco Ricci; Pierluigi Strippoli; Lorenza Vitale; Maria Chiara Pelleri
Journal:  DNA Res       Date:  2015-11-17       Impact factor: 4.458

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.