| Literature DB >> 23197660 |
Abstract
EcoGene (http://ecogene.org) is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations. Major improvements to EcoGene in the past decade include (i) graphic presentations of genome map features; (ii) ability to design Boolean queries and Venn diagrams from EcoArray, EcoTopics or user-provided GeneSets; (iii) the genome-wide clone and deletion primer design tool, PrimerPairs; (iv) sequence searches using a customized EcoBLAST; (v) a Cross Reference table of synonymous gene and protein identifiers; (vi) proteome-wide indexing with GO terms; (vii) EcoTools access to >2000 complete bacterial genomes in EcoGene-RefSeq; (viii) establishment of a MySql relational database; and (ix) use of web content management systems. The biomedical literature is surveyed daily to provide citation and gene function updates. As of September 2012, the review of 37 397 abstracts and articles led to creation of 98 425 PubMed-Gene links and 5415 PubMed-Topic links. Annotation updates to Genbank U00096 are transmitted from EcoGene to NCBI. Experimental verifications include confirmation of a CTG start codon, pseudogene restoration and quality assurance of the Keio strain collection.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23197660 PMCID: PMC3531124 DOI: 10.1093/nar/gks1235
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The main GenePage tab of the EcoGene 3.0 GenePage for lacZ. The main areas of interest are circled and described in the text.
Figure 2.Interactive Venn diagrams and Boolean queries using genesets. Clicking on the number in any of the Venn diagram sectors produces a list of genes. The Venn diagrams can be saved as a PNG image for use in presentations or publications; a black-and-white version is available to save on printer and publishing costs.
Figure 3.A GenePage Protein tab showing the GO terms with automatic expansion to a full list of less specific ancestor GO terms.
Figure 4.The E. coli K-12 ignome.
Figure 5.The y-gene code wheel depicting the systematic uncharacterized gene nomenclature rationale for E. coli K-12 is shown.
Figure 6.EcoGene 3.0 MySql database schema. Some minor fields have been deleted for clarity. The five modules are color-coded.