| Literature DB >> 24923821 |
Wen Wei1, Yuan-Nong Ye1, Sen Luo1, Yan-Yan Deng1, Dan Lin1, Feng-Biao Guo2.
Abstract
Knowledge of an organism's fitness for survival is important for a complete understanding of microbial genetics and effective drug design. Current essential gene databases provide only binary essentiality data from genome-wide experiments. We therefore developed a new database that Integrates quantitative Fitness Information for Microbial genes (IFIM). The IFIM database currently contains data from 16 experiments and 2186 theoretical predictions. The highly significant correlation between the experiment-derived fitness data and our computational simulations demonstrated that the computer-generated predictions were often as reliable as the experimental data. The data in IFIM can be accessed easily, and the interface allows users to browse through the gene fitness information that it contains. IFIM is the first resource that allows easy access to fitness data of microbial genes. We believe this database will contribute to a better understanding of microbial genetics and will be useful in designing drugs to resist microbial pathogens, especially when experimental data are unavailable. Database URL: http://cefg.uestc.edu.cn/ifim/ or http://cefg.cn/ifim/Entities:
Mesh:
Year: 2014 PMID: 24923821 PMCID: PMC4207227 DOI: 10.1093/database/bau052
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Details of data collection and processing for the IFIM database.
Figure 2.Assessment of the accuracy of prediction. (A) Linear regression R2 between experiment-derived fitness and computation-predicting fitness, (B) Correlation analysis among experimental fitness data sets.
Figure 3.Predicted fitness distribution of essential and nonessential genes in 21 bacterial strains. Essentiality information was obtained from the DEG database.
Figure 4.Geptop predicted fitness values. (A) Predicted fitness values for E. coli. (B) Predicted fitness values for S. typhimurium. Error bars represent 90% confidence intervals on the estimates of the means. UE: universal essential; CE: conditional essential; NE: nonessential.
Fitness values of multiple copy genes in the computational predictions and experiment data from four data sets
| Gene | Dataset | |||
|---|---|---|---|---|
| Geptop | STY01 | STY02 | STY03 | |
| t4161 (0.237) | t4161 (0.231) | t4161 (0.245) | t4161 (0.260) | |
| t4237 (0.733) | t4237 (0.742) | t4237 (0.837) | t4237 (0.833) | |
| t0126 (0.263) | t0126 (0.102) | t0126 (0.128) | t0126 (0.142) | |
| t1042 (0.835) | t1042 (0.616) | t1042 (0.780) | t1042 (0.772) | |
| t4024 (0.448) | t4024 (0.419) | t4024 (0.863) | t4024 (0.563) | |
| t4557 (0.834) | t4557 (0.559) | t4557 (0.819) | t4557 (0.689) | |
Figure 5.The web interface for the IFIM database. (A) The Home & Browse page; (B) Examples of Browse or Search result pages. (I) choosing different sources to browse data sets, (II) choosing different taxonomy levels to browse data sets and (III) choosing different data sets to download; (C) The Download pages; (D) An example of a fitness information page; (E) The Analysis page; (F) The link page showing a NCBI link to a gene.