| Literature DB >> 22355228 |
Dongsheng Che, Mohammad Shabbir Hasan, Han Wang, John Fazekas, Jinling Huang, Qi Liu.
Abstract
Genomic islands (GIs) are genomic regions that are originally transferred from other organisms. The detection of genomic islands in genomes can lead to many applications in industrial, medical and environmental contexts. Existing computational tools for GI detection suffer either low recall or low precision, thus leaving the room for improvement. In this paper, we report the development of our Ensemble algorithm for Genomic Island Detection (EGID). EGID utilizes the prediction results of existing computational tools, filters and generates consensus prediction results. Performance comparisons between our ensemble algorithm and existing programs have shown that our ensemble algorithm is better than any other program. EGID was implemented in Java, and was compiled and executed on Linux operating systems. EGID is freely available at http://www5.esu.edu/cpsc/bioinfo/software/EGID.Entities:
Keywords: Bacterial genomes; Ensemble algorithm; Genomic islands
Year: 2011 PMID: 22355228 PMCID: PMC3280502 DOI: 10.6026/007/97320630007311
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1The flowchart of our computational framework for GI prediction
Figure 2Illustrative examples of candidate GI regions, where two candidate GI regions are close in (A) and distant in (B). Each vertical bar represents the vote of a GI gene by multiple GI tools. The candidate GI regions meeting the threshold value are underlined.
Figure 3Circular representations of the Escherichia coli O157:H7 str. Sakai (NC_002695) showing predicted GIs, with each circle predicted by each program. The predicted GIs from the outer to the inner circle are EGID, AlienHunter, COLOMBO SIGI-HMM, INDeGenIUS, Island-Path, and PAI-IDA. The shaded parts show the predicted GIs by EGID, and evidenced GIs by other programs.