| Literature DB >> 29905770 |
Claire Bertelli1, Fiona S L Brinkman1.
Abstract
Motivation: Genomic islands (GIs) are clusters of genes of probable horizontal origin that play a major role in bacterial and archaeal genome evolution and microbial adaptability. They are of high medical and industrial interest, due to their enrichment in virulence factors, some antimicrobial resistance genes and adaptive metabolic pathways. The development of more sensitive but precise prediction tools, using either sequence composition-based methods or comparative genomics, is needed as large-scale analyses of microbial genomes increase.Entities:
Mesh:
Year: 2018 PMID: 29905770 PMCID: PMC6022643 DOI: 10.1093/bioinformatics/bty095
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Mean GI prediction accuracy assessed using the 104 genomes of the reference C-dataset and overlap with the literature dataset in six genomes
| Method | MCC | F-score | Accuracy | Precision | Recall | |
|---|---|---|---|---|---|---|
| Old genome files | IslandPath-DIMOB v1.0.0 | 0.51 | 0.55 | 0.77 | 0.88 | 0.46 |
| IslandPath-DIMOB v0.2.0 | 0.39 | 0.44 | 0.73 | 0.83 | 0.34 | |
| New genome files | IslandViewer 4 | 0.70 | 0.78 | 0.89 | 0.90 | 0.73 |
| IslandPath-DIMOB v1.0.0 | 0.49 | 0.55 | 0.77 | 0.87 | 0.47 | |
| SIGI-HMM | 0.35 | 0.37 | 0.73 | 0.92 | 0.26 | |
| MTGIpick | 0.32 | 0.56 | 0.70 | 0.55 | 0.68 | |
| Zisland Explorer | 0.20 | 0.23 | 0.69 | 0.85 | 0.18 | |
| Islander | 0.19 | 0.20 | 0.70 | 0.97 | 0.14 | |
| MSGIP | 0.15 | 0.20 | 0.68 | 0.87 | 0.16 | |
| Literature dataset | Literature | 0.89 | 0.94 | 0.94 | 1 | 0.89 |
Old genome files for the reference dataset as available in RefSeq before July 2014.
New genome files for the reference dataset downloaded from RefSeq on February 9, 2017.
Mean GI prediction accuracy and overlap with the C-dataset assessed using the reference L-dataset comprising six genomes
| Method | MCC | F-score | Accuracy | Precision | Recall | |
|---|---|---|---|---|---|---|
| Multiple predictors | IslandViewer 4 | 0.64 | 0.75 | 0.79 | 0.998 | 0.62 |
| Single predictors | IslandPath-DIMOB v1.0.0 | 0.54 | 0.67 | 0.72 | 0.979 | 0.52 |
| MTGIpick | 0.50 | 0.78 | 0.75 | 0.82 | 0.74 | |
| SIGI-HMM | 0.36 | 0.42 | 0.60 | 0.998 | 0.29 | |
| Islander | 0.32 | 0.35 | 0.56 | 1 | 0.23 | |
| MSGIP | 0.31 | 0.44 | 0.62 | 0.95 | 0.35 | |
| Zisland Explorer | 0.18 | 0.26 | 0.52 | 0.83 | 0.17 | |
| Comparative dataset | C-dataset | 0.43 | 0.51 | 0.65 | 1 | 0.37 |
Fig. 1.GI prediction accuracy of IslandPath-DIMOB and other individual methods compared with IslandViewer 4 composite method, based on the C-dataset (comparative genomics-based dataset). Prediction accuracy on the dataset of GIs identified by the comparative genomics approach IslandPick in 104 genomes. The boxplot shows the median, and the first and third quartiles as the lower and upper hinges. Outliers are indicated as black dots, if they exceed 1.5 times the interquartile range. IslandViewer 4 is a composite method, combining IslandPath-DIMOB, SIGI-HMM and IslandPick, whereas other tools are individual methods