| Literature DB >> 17570856 |
Jason D Gans1, Murray Wolinsky.
Abstract
BACKGROUND: The ability to visualize genomic features and design experimental assays that can target specific regions of a genome is essential for modern biology. To assist in these tasks, we present Genomorama, a software program for interactively displaying multiple genomes and identifying potential DNA hybridization sites for assay design.Entities:
Mesh:
Year: 2007 PMID: 17570856 PMCID: PMC1906841 DOI: 10.1186/1471-2105-8-204
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Comparing features of freely available, stand-alone genome viewers
| Program | Platformsa | Input formatsb | Graphic output formatsc | Source code available | Circular view | Linear view | Real time navigation | Multiple genomes | Annotation editing and creation | Annotation searching | Sequence searching |
| Apollo [2] | Java | GAME XML, GFF, GBK, EMBL, FASTA | PS | ||||||||
| Argo [3] | Java | GFF, GBK, GENSCAN, BLAST | Printer | ||||||||
| Artemis [4] | Java | EMBL, GBK, FASTA, GFF | JPG, PNG | ||||||||
| Bluejay [5] | Java | XML | Printer, SVG | ||||||||
| CGView [6] | Java | PTT, XML | PNG, JPG, SVG | ||||||||
| DNAvis [7] | Windows, Linux | GFF, FASTA | |||||||||
| GATA [8] | Java | GFF | PNG | ||||||||
| GeneViTo [9] | Java | PTT+FFN+FNA | JPG | ||||||||
| GenoMap [10] | Tcl/Tk | GRS | PS | ||||||||
| Genome2D [11] | Windows | GBK, FASTA, GLIMMER, PARADOX | Printer, WMF, BMP | ||||||||
| GenomeComp [12] | Perl/Tk | EMBL, GBK, FASTA | PS | ||||||||
| GenomePlot [13] | Tcl/Tk/Perl | tab delimited | PS, GIF, TIFF, JPG | ||||||||
| GenomeViz [14] | Tcl/Tk/Perl (no Windows) | tab delimited | PS | ||||||||
| Genome Workbench [15] | OS X, Windows, Linux | ASN.1, XML, FASTA, GFF | |||||||||
| Genomorama | OS X, Windows, Linux | EMBL, GBK, ASN.1, FASTA, PTT | PS, GIF | ||||||||
| IGB [16] | Java | GFF, FASTA, PSL, DAS | Printer | ||||||||
| Mauve [17] | Java | GBK, FASTA, SEQ | PNG, JPG | ||||||||
| SeqVISTA [18] | Java | EMBL, FASTA | JPG | ||||||||
| Sockeye [19] | Java | EMBL (via server), GFF | JPG |
aPrograms that use Java, Tcl/Tk and Perl are expected to run on any operating system. bCommon file formats include the GenBank flat file (GBK), EMBL flat file (EMBL), nucleic acid sequence file (FASTA), general feature format (GFF) and protein table file (PTT). A complete list of genome annotation file formats can be found on the Genomorama project webpage. cThe graphic output format labeled "Printer" indicates direct output to an attached printer.
Figure 1Genomorama can load and display the multiple annotated contigs stored in a whole genome shotgun GBK file. This screen shot shows five contigs from Sphingopyxis alaskensis RB2256 (extracted from the NCBI [21] file wgs.AAIP.1.gbff) and the associated sequence quality scores (from the NCBI [21] file wgs.AAIP.1.qscore). Quality scores are proportional to the negative log of the probability that a given base has been incorrectly assigned as an A, T, G or C and are shown as black plots superimposed over each contig track. The value of a quality score for each track is interactively displayed on the menu bar as a user specified score [i.e. "user(90)"] for the annotation track and base currently selected by the cursor.
Figure 2Comparing the time to load human chromosome 1. The time to load Homo sapiens chromosome 1 is used to compare the performance of Genomorama and two Java based tools: Apollo [2] and Argo [3]. The time to load the GBK file [GenBank:NC_000001.9] from the local hard drive is shown for three computing platforms: a high-end OS X 10.4.8 workstation (dual 3 Ghz Intel Xeon CPUs, 3 GB ram, Java 1.5.0), a mid-range Linux Red Hat 4.0.1 workstation (dual 2.4 GHz Intel Xeon CPUs, 1 GB ram, Java 1.4.2) and low-end OS X 10.3.9 desktop (single 1.8 GHz G5 PowerPC CPU, 512 MB ram, Java 1.4.2). The Java-based programs were run from the command line with the arguments "-Xms32m -Xmx1024m" to increase the amount of memory allowed to the Java virtual machine. Providing Java with more than 1 GB of memory did not improve performance (results not shown). Each program loaded the genome file twice (to ensure fair OS disk caching) and the second load time is reported. For all platforms, Genomorama loads the genome file more than an order of magnitude faster than either of the Java-based programs.
Figure 3Genomorama supports sequence searching with PCR primers. The genomic neighborhood of the amplicon (shown in orange) produced by the B. anthracis [GenBank:NC_003997.3] chromosomal specific PCR primers, M.Ctg032 [32]. The amplicon is contained within a glycosyl transferase (show in yellow). The amplicon annotation was added to the genome by selecting the "annotate" button on the Hybridize dialog box.