| Literature DB >> 17274839 |
Abstract
The annotation of most genomes becomes outdated over time, owing in part to our ever-improving knowledge of genomes and in part to improvements in bioinformatics software. Unfortunately, annotation is rarely if ever updated and resources to support routine reannotation are scarce. Wiki software, which would allow many scientists to edit each genome's annotation, offers one possible solution.Entities:
Mesh:
Year: 2007 PMID: 17274839 PMCID: PMC1839116 DOI: 10.1186/gb-2007-8-1-102
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Overview of sequencing and annotation for a whole-genome shotgun project, for example, sequencing a bacterial genome. First (a), genomic DNA is purified, broken into short fragments and cloned into E. coli. The cloned fragments are then sequenced from both ends on an automated sequencing machine. The resulting sequences (shown in (b) as they appear on the sequencing machine display) are then assembled using a complex software program that identifies overlaps into (c) large, contiguous sequences representing the chromosomes from the original DNA. Gaps are filled until the genome is complete. (d) Annotation begins with the execution of several gene-finding programs, such as Glimmer, which identifies protein-coding genes, tRNAScan, which identifies tRNAs, and other programs for other genome features. (e) These initial predictions are used as the basis for BLAST searches against large protein databases, which identify related proteins based on sequence similarity. Translated (Blastx) searches are then used to scan the databases to detect any proteins that match the DNA regions in between predicted genes. Customized annotation programs are used to decide what name and function to assign to each protein, leading to (f) the final annotated genome.