| Literature DB >> 31771223 |
YoungJae Hur1, Mauricio Chalita1,2, Sung-Min Ha2, Inwoo Baek3, Jongsik Chun2,3,4.
Abstract
Vibrio cholerae is the causative agent of cholera, which is a severe, life-threatening diarrheal disease. The current seventh pandemic has not been eradicated and the outbreak is still ongoing around the world. The evolution of the pandemic-causing strain has been greatly influenced by lateral gene transfer, and the mechanisms of acquisition of pathogenicity in V. cholerae are mainly involved with genomic islands (GIs). Thus, detecting GIs and their comprehensive information is necessary to understand the continuing resurgence and newly emerging pathogenic V. cholerae strains. In this study, 798 V. cholerae strains were tested using the GI-Scanner algorithm, which was developed to detect candidate GIs and identify them in a comparative genomics approach. The algorithm predicted 435 highly possible genomic islands, and we built a database, called Vibrio cholerae Genomic Island Database (VCGIDB). This database shows advanced results that were acquired from a large genome set using phylogeny-based predictions. Moreover, VCGIDB is a highly expendable database that does not require intensive computation, which enables us to update it with a greater number of genomes using a novel genomic island prediction method. The VCGIDB website allows the user to browse the data and presents the results in a visual manner.Entities:
Keywords: Vibrio cholerae; comparative genomics; database; genomic island
Year: 2019 PMID: 31771223 PMCID: PMC6963734 DOI: 10.3390/pathogens8040261
Source DB: PubMed Journal: Pathogens ISSN: 2076-0817
Three groups of genome sets accumulated and expanded prediction results. Note that genomes from a higher assembly level predicted relatively more genomic islands (GIs).
| Assembly Level | Accumulated Number of Strains | Accumulated Number of Predicted GIs |
|---|---|---|
| Complete or Chromosome | 58 | 198 |
| Scaffold | 277 | 312 |
| Contig | 798 | 435 |
Figure 1Comparison of GI prediction results from different databases or methods. (a) Re-identification failures rate for GI databases: False-negative rates were calculated by failures rate of re-identification of 28 reference GIs from previous study [6]. The strains that were not included in all databases were excluded during the comparison. (b) Prediction results overlapping with other GI databases: All prediction results were collected and clustered in the same genomic context. The GI-Scanner algorithm outperformed due to the massive genome comparison.
Figure 2The hypothetical evolutionary history of the (a) VPI-1 and (b) VSP-1 genomic islands. The 171 selected strains were shown on the phylogeny tree. The pandemic strains and the seventh pandemic strains were shaded with colors. Blue dots were marked where the hypothetical insertion event occurred, and red dots were marked where the hypothetical deletion event occurred. GI containing strain were labeled with blue. The reference strain N16961 was marked with dark brown highlight.