| Literature DB >> 16381961 |
Liying Cui1, Narayanan Veeraraghavan, Alexander Richter, Kerr Wall, Robert K Jansen, Jim Leebens-Mack, Izabela Makalowska, Claude W dePamphilis.
Abstract
The Chloroplast Genome Database (ChloroplastDB) is an interactive, web-based database for fully sequenced plastid genomes, containing genomic, protein, DNA and RNA sequences, gene locations, RNA-editing sites, putative protein families and alignments (http://chloroplast.cbio.psu.edu/). With recent technical advances, the rate of generating new organelle genomes has increased dramatically. However, the established ontology for chloroplast genes and gene features has not been uniformly applied to all chloroplast genomes available in the sequence databases. For example, annotations for some published genome sequences have not evolved with gene naming conventions. ChloroplastDB provides unified annotations, gene name search, BLAST and download functions for chloroplast encoded genes and genomic sequences. A user can retrieve all orthologous sequences with one search regardless of gene names in GenBank. This feature alone greatly facilitates comparative research on sequence evolution including changes in gene content, codon usage, gene structure and post-transcriptional modifications such as RNA editing. Orthologous protein sets are classified by TribeMCL and each set is assigned a standard gene name. Over the next few years, as the number of sequenced chloroplast genomes increases rapidly, the tools available in ChloroplastDB will allow researchers to easily identify and compile target data for comparative analysis of chloroplast genes and genomes.Entities:
Mesh:
Year: 2006 PMID: 16381961 PMCID: PMC1347418 DOI: 10.1093/nar/gkj055
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1ChloroplastDB overview. (A) Database structure and relationship of data tables. PK: primary key. FK: foreign key. (B) Data flow and filtering steps to ensure the high quality of data stored in the database.
Figure 2Examples of analysis using the ChloroplastDB web interface. (A) Homepage of the database. (B) Search results for the gene ‘rbcL’. (C) The gene view page linked to search result for each gene, including mRNA editing information. (D) BLAST results, with options to download sequences from the BLAST search. (E) Putative orthologous gene set listed as ‘Tribes’. (F) The organism page presents a summary of genomes and extracted features in the database for batch download.