| Literature DB >> 16734895 |
Galina Glazko1, Michael Coleman, Arcady Mushegian.
Abstract
We present psi-square, a program for searching the space of gene vectors. The program starts with a gene vector, i.e., the set of measurements associated with a gene, and finds similar vectors, derives a probabilistic model of these vectors, then repeats search using this model as a query, and continues to update the model and search again, until convergence. When applied to three different pathway-discovery problems, psi-square was generally more sensitive and sometimes more specific than the ad hoc methods developed for solving each of these problems before.Entities:
Year: 2006 PMID: 16734895 PMCID: PMC1489924 DOI: 10.1186/1745-6150-1-13
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Figure 1Phyletic vectors of 37 bacterial COGs related to flagella biogenesis and function. Bacteria with flagellar phenotype and flagella-related genes are clustered to highlight the 'flagella genomic signature'.
Sensitivity and specificity of psi-square, TTG and PP algorithms in prediction of flagellae components.
| Psi-square: single query | Psi-square:combined query | TTG | PP | |
| False Positives (FP) | 16 | 39 | 6 | 24 |
| True Positives (TP) | 29 | 34 | 27 | 22 |
| False Negatives (FN) | 8 | 3 | 10 | 15 |
| Number of predicted proteins: | 45 | 73 | 33 | 46 |
Figure 2Phyletic vectors and COGs associated with flagella phenotype, identified by psi-square and TTG algorithms (45 COGs and 33 COGs, respectively), with COG1298 used as a query. a) 27 COGs in benchmark (see text), also found by psi-square and TTG; b) 5 COGs found by psi-square and TTG; c) 2 COGs found by psi-square and in benchmark and one COG found by TTG only; d) 8 COGs found in benchmark only; e) 11 COGs found by psi-square only. COG numbers and functional annotations are shown in the right-hand column. Note that the species' order in this figure is different from Figure 1 and reflects the evolutionary relatedness of species.
Figure 3Expression vectors for the closest matches retrieved by psi-square with query PFA0110w in Plasmodium IDC dataset. Two best matches per iteration (nine iterations before convergence) are shown.