| Literature DB >> 26992782 |
Wen Wei, Feng Gao, Meng-Ze Du, Hong-Li Hua, Ju Wang, Feng-Biao Guo.
Abstract
Genomic islands are genomic fragments of alien origin in bacterial and archaeal genomes, usually involved in symbiosis or pathogenesis. In this work, we described Zisland Explorer, a novel tool to predict genomic islands based on the segmental cumulative GC profile. Zisland Explorer was designed with a novel strategy, as well as a combination of the homogeneity and heterogeneity of genomic sequences. While the sequence homogeneity reflects the composition consistence within each island, the heterogeneity measures the composition bias between an island and the core genome. The performance of Zisland Explorer was evaluated on the data sets of 11 different organisms. Our results suggested that the true-positive rate (TPR) of Zisland Explorer was at least 10.3% higher than that of four other widely used tools. On the other hand, the new tool did not lose overall accuracy with the improvement in the TPR and showed better equilibrium among various evaluation indexes. Also, Zisland Explorer showed better accuracy in the prediction of experimental island data. Overall, the tool provides an alternative solution over other tools, which expands the field of island prediction and offers a supplement to increase the performance of the distinct predicting strategy. We have provided a web service as well as a graphical user interface and open-source code across multiple platforms for Zisland Explorer, which is available at http://cefg.uestc.edu.cn/Zisland_Explorer/ or http://tubic.tju.edu.cn/Zisland_Explorer/.Entities:
Keywords: cumulative GC profile; genomic islands; homogeneity; prediction
Mesh:
Year: 2017 PMID: 26992782 PMCID: PMC5429010 DOI: 10.1093/bib/bbw019
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
Comparison of core algorithm of each method
| Zisland Explorer | Codon usage bias Amino acid bias | Similarity within a island | ||
| Cumulative GC profile | Similarity within a island | |||
| Islander | tRNA site Mobility gene | |||
| IslandPath-DIMOB | Di-nucleotide bias | Mobility gene | ||
| SIGI-HMM | Codon usage bias | |||
| IslandPick | Comparison with closely related genomes |
Figure 1Zisland Explorer workflow.
Performance of Zisland Explorer
| 0.338 | 0.988 | 0.754 | 0.454 | 0.996 | 0.818 | |
| 0.291 | 0.980 | 0.841 | 0.313 | 0.995 | 0.799 | |
| 0.628 | 0.982 | 0.948 | 0.553 | 0.990 | 0.940 | |
| 0.930 | 0.962 | 0.960 | 0.720 | 0.987 | 0.934 | |
| 1.000 | 0.917 | 0.923 | 0.713 | 1.000 | 0.730 | |
| 0.786 | 0.962 | 0.956 | 0.297 | 0.961 | 0.931 | |
| 0.305 | 0.983 | 0.913 | 0.214 | 1.000 | 0.759 | |
| 0.523 | 0.969 | 0.930 | 0.379 | 0.995 | 0.827 | |
| 0.459 | 0.965 | 0.880 | 0.353 | 0.989 | 0.778 | |
| 0.521 | 0.955 | 0.905 | 0.307 | 0.995 | 0.788 | |
| 0.614 | 0.954 | 0.926 | 0.454 | 0.987 | 0.894 | |
| 0.374 | 0.984 | 0.897 | 0.356 | 1.000 | 0.750 | |
| 0.472 | 0.998 | 0.972 | 0.226 | 0.997 | 0.933 | |
| 1.000 | 0.994 | 0.995 | 0.831 | 1.000 | 0.911 | |
| Average | 0.589 | 0.971 | 0.914 | 0.441 | 0.992 | 0.842 |
Figure 2Performance comparison of Zisland Explorer with other tools. A colour version of this figure is available at BIB online: https://academic.oup.com/bib.
The proportion of ICEs predicted by five tools
| BPGI2 | 21.9% | 42.3% | 48.0% | 0.0% | 0.0% |
| BPGI3 | 0.0% | 56.2% | 59.7% | 0.0% | 0.0% |
| BPGI4 | 0.0% | 56.5% | 26.7% | 0.0% | 0.0% |
| BPGI7 | 30.8% | 48.2% | 42.0% | 0.0% | 0.0% |
| PMGI6 | 100.0% | 0.0% | 49.3% | 59.9% | 0.0% |
| PMGI7 | 54.2% | 0.0% | 39.8% | 95.6% | 100.0% |
| STGI12 (SPI-7) | 90.2% | 74.0% | 18.8% | 22.8% | 99.9% |
| SEGI4 | 100.0% | 93.9% | 0.0% | 19.7% | 0.0% |
| Average | 49.6% | 46.4% | 35.5% | 24.8% | 25.0% |
Improvement of each tool by combing with Zisland Explorer
| IslandPick | 51.59% | –2.60% | 3.63% | 24.50% | 45.86% | 40.58% |
| IslandPath-DIMOB | 29.30% | –1.97% | 1.61% | 13.66% | 16.51% | 15.63% |
| SIGI-HMM | 28.97% | –1.94% | 1.66% | 13.52% | 18.65% | 18.07% |
| Islander | 44.85% | –2.75% | 2.67% | 21.05% | 33.98% | 30.86% |
| IslandPick | 36.73% | –0.73% | 12.69% | 18.00% | 43.14% | 36.04% |
| IslandPath-DIMOB | 19.71% | –0.62% | 6.44% | 9.54% | 17.12% | 14.29% |
| SIGI-HMM | 19.96% | –0.72% | 6.69% | 9.62% | 20.09% | 15.43% |
| Islander | 34.84% | –0.75% | 12.82% | 17.04% | 39.07% | 30.59% |
Figure 3Webserver and GUI.
Figure 4Proportion of islands in each genome. Light red is for bacteria and light blue for archaea. A colour version of this figure is available at BIB online: https://academic.oup.com/bib.