| Literature DB >> 15980582 |
Benjamin R Landsteiner1, Michael R Olson, Robert Rutherford.
Abstract
The Current Comparative Table (CCT) software program enables working biologists to automate customized bioinformatics searches, typically of remote sequence or HMM (hidden Markov model) databases. CCT currently supports BLAST, hmmpfam and other programs useful for gene and ortholog identification. The software is web based, has a BioPerl core and can be used remotely via a browser or locally on Mac OS X or Linux machines. CCT is particularly useful to scientists who study large sets of molecules in today's evolving information landscape because it color-codes all result files by age and highlights even tiny changes in sequence or annotation. By empowering non-bioinformaticians to automate custom searches and examine current results in context at a glance, CCT allows a remote database submission in the evening to influence the next morning's bench experiment. A demonstration of CCT is available at http://orb.public.stolaf.edu/CCTdemo and the open source software is freely available from http://sourceforge.net/projects/orb-cct.Entities:
Mesh:
Year: 2005 PMID: 15980582 PMCID: PMC1160193 DOI: 10.1093/nar/gki432
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1A small part of a sample current comparative table. The molecule and description columns contain information supplied by the user; the other columns contain data generated by automated searches against user-specified databases. In these database columns, the cells contain links to the result of each search. Links are color-coded to reflect the length of time since the search result last changed. To generate the table shown, CCT was given a set of 48 genes in the Mycobacterium tuberculosis (MTB) Dormancy regulon (10) and directed to monitor four remote databases (Pfam, the MTB proteome and two incomplete genomes, M.smegmatis and M.marinum). The pattern of red indicates that a new release of the M.marinum genome was downloaded in the past 24 h (generating many new red tblastn links), and that a new sequence (seq program) of an ortholog (homolog program) was found for the TB protein Rv1812c. The predicted ortholog for Rv0079, in contrast, did not change as a result of the update and the corresponding links remain blue. More detail concerning any result can be viewed by clicking on the corresponding link(s).
Figure 2Example of a result file revised to reflect a new release of the Pfam database. Red highlighting indicates new data; black highlighting shows removed data. The regions above and below the horizontal line show different parts of the same result file. This hmmpfam search result shows that the ‘Conserved Hypothetical Protein’ Rv2030c matches a new domain in Pfam, specifically the ‘erythromycin esterase’ domain. CCT's highlighting also shows a second finding: the model for the ‘phosphoribosyl’ domain has been changed subtly (note the changed amino acids in the subject line). Taken together with the change in database size for the new release, the E-value for this search has changed somewhat. To view the newest results only (without highlighting to show changes) a user can click on the ‘Unhighlighted File’ link.