| Literature DB >> 26200753 |
Rian Pierneef1, Louis Cronje1, Oliver Bezuidt1, Oleg N Reva2.
Abstract
The Predicted Genomic Islands database (Pre_GI) is a comprehensive repository of prokaryotic genomic islands (islands, GIs) freely accessible at http://pregi.bi.up.ac.za/index.php. Pre_GI, Version 2015, catalogues 26 744 islands identified in 2407 bacterial/archaeal chromosomes and plasmids. It provides an easy-to-use interface which allows users the ability to query against the database with a variety of fields, parameters and associations. Pre_GI is constructed to be a web-resource for the analysis of ontological roads between islands and cartographic analysis of the global fluxes of mobile genetic elements through bacterial and archaeal taxonomic borders. Comparison of newly identified islands against Pre_GI presents an alternative avenue to identify their ontology, origin and relative time of acquisition. Pre_GI aims to aid research on horizontal transfer events and materials through providing data and tools for holistic investigation of migration of genes through ecological niches and taxonomic boundaries.Entities:
Mesh:
Year: 2015 PMID: 26200753 PMCID: PMC5630688 DOI: 10.1093/database/bav058
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Percentage of re-identification of 51 known islands from PAIDB by IslandViewer tools and SWGIS with different threshold parameters.
Figure 2.Proposed donor–recipient relationship by using LingvoCom 2D projection utility. Two dark green spots on the plot represent OUP of X. fastidiosa 9a5c (at centre point) and P. dioxanivorans CB1190 (on first principal axis) chromosomes. Light green circles depict 1/2 of the distance between patterns calculated for the chromosomes. The island of X. fastidiosa is shown as a red small circle and that of P. dioxanivorans as a blue circle. Islands were plotted along the second principal axis according to distance values between OUP of islands and host chromosomes.
Figure 3.Pre_GI visualization of BLASTN comparison of the DNA sequence of a newly identified island against Pre_GI. The query sequence is depicted by the top red line and a hit subject sequence is represented by the bottom red line. Dark green boxes represent genes. Connecting blue strips between query and subject indicate high-scoring sequence pairs.
Figure 4.Distribution of number of identified islands and estimated events of HT. Each data point represents a bacterial genome. The vertical axis indicates the number of islands predicted in a genome. The amount of distinct acquisitions of the islands for each genome is depicted on the horizontal axis.
Figure 5.Distribution of sequence similarity (horizontal axis) and composition similarity (vertical axis) links stored in Pre_GI. Each spot represents an island. The congregation of spots (islands) result in darker groupings. Two groups of islands are visible: one rich in BLASTN links, and the other poor of sequence similarity links.
Figure 6.Boxplots of OUP similarities between islands of different taxonomical levels. OUP similarity links were grouped into eight categories: Genome—compositional similarity links between islands found in the same genome; Strain—links found between islands in genomes of two different strains of the same species; Species—links different species of the same genus; Genus—different genera of the same family; Family—different families of the same order; Order—different orders of the same phylum; Phylum—different phyla of the same domain; and Domain—OU links between islands found in bacteria of different domains. Statistical values were also calculated for all 69 176 627 OU links stored in Pre_GI. Numbers of OU links in each category are indicated. Horizontal dashed lines depict the mean value for each category. Vertically adjacent boxes represent lower and upper inter-quartiles with the median values as the line of adjoining. Vertical dashed lines with bars correspondingly represent lower and upper outer-quartiles of distribution. The links above and below the outer-quartiles are shown as small open circles.
Bacterial genomes with eclectic GIs
| Species and strain | Phylum |
Short description
| No GIs |
|---|---|---|---|
|
| Bacteroidetes | Gut microflora, symbiont, pathogen | 35 |
|
| Bacteroidetes | Gut microflora, symbiont | 39 |
|
| Actinobacteria | Gut microflora, symbiont | 19 |
|
| Gammaproteobacteria | Soil bacterium | 32 |
|
| Actinobacteria | Pathogen | 18 |
|
| Actinobacteria | Pathogen | 16 |
|
| Deferribacteres | Marine bacterium, biodegradation of pollutants | 10 |
|
| Firmicutes | Soil bacterium, biodegradation of pollutants | 14 |
|
| Deltaproteobacteria | Deep-sea sulfate reducer | 20 |
|
| Firmicutes | Carbon monoxide-utilizing acetogen | 33 |
|
| Bacteroidetes | Filamentous marine bacterium | 28 |
|
| Fibrobacteres | Cellulolytic organism | 24 |
|
| Deltaproteobacteria | Uranium bioremediation organism | 30 |
|
| Acidobacteria | Tundra soil organism | 26 |
|
| Gammaproteobacteria | Marine bacterium producing an algicidal agent | 57 |
|
| Gammaproteobacteria | Marine bacterium | 24 |
|
| Gammaproteobacteria | Marine bacterium | 26 |
|
| Alphaproteobacteria | Polar marine bacterium | 17 |
|
| Alphaproteobacteria | Polar marine bacterium | 12 |
|
| Firmicutes | Soil silicate degrading bacterium | 36 |
|
| Bacteroidetes | Gut microflora, symbionts | 45 |
|
| Bacteroidetes | Oral microflora, symbionts | 14 |
|
| Crenarchaeota | Arsenate-reducing hyperthermophile | 20 |
|
| Crenarchaeota | Hyperthermophile | 19 |
|
| Actinobacteria | Oral microflora, symbionts | 13 |
|
| Gammaproteobacteria | Insect endosymbiont | 17 |
|
| Spirochaetales | Soda lake alkaliphilic anaerobe | 12 |
|
| Bacteroidetes | Free-living non-pathogenic bacterium | 53 |
|
| Gammaproteobacteria | Endosymbiont of marine shipworms | 26 |
a Short description was taken from corresponding NCBI and GOLDCARD bioprojects