| Literature DB >> 25096633 |
Bas E Dutilh, Cristiane C Thompson, Ana C P Vicente, Michel A Marin, Clarence Lee, Genivaldo G Z Silva, Robert Schmieder, Bruno G N Andrade, Luciane Chimetto, Daniel Cuevas, Daniel R Garza, Iruka N Okeke, Aaron Oladipo Aboderin, Jessica Spangler, Tristen Ross, Elizabeth A Dinsdale, Fabiano L Thompson, Timothy T Harkins, Robert A Edwards1.
Abstract
BACKGROUND: Vibrio cholerae is a globally dispersed pathogen that has evolved with humans for centuries, but also includes non-pathogenic environmental strains. Here, we identify the genomic variability underlying this remarkable persistence across the three major niche dimensions space, time, and habitat.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25096633 PMCID: PMC4141962 DOI: 10.1186/1471-2164-15-654
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Overview of the strains analyzed in this study. Approximate geographical origin, isolation source and date of 274 V. cholerae strains. See Additional file 1 for details.
Figure 2genome plot. Circos plots [14] of the two major V. cholerae R-18308 scaffolds representing chromosome 1 (top) and chromosome 2 (bottom). From the outer circle inwards: scale; ORFs per strand (colored by functional category, see legend); prophages (orange: small defective prophage; red: CTX; green: PP1; blue: superintegron, SI); and read mappings of the 38 other sequenced genomes (blue, from the middle circle outwards: R-18246, R-18273, R-18303, R-18304, R-18316, R-18317, R-18327, R-18338, R-18348, VC08, VC1005, VC102, VC111, VC120, VC14, VC150, VC172, VC179, VC200, VC201, VC21, VC214, VC22, VC307, VC311, VC33, VC341, VC434, VC46, VC500, VC504, VC75, VC77, VC83, VC833, VC91, VC95, VC998).
Figure 3Phylogenomic tree of genomes. A phylogenomic tree based on genome-wide marker SNPs illustrates the breadth of 274 V. cholerae genomes included in this study. Four complete V. mimicus genomes were included as an outgroup. Branch lengths indicate the number of substitutions per SNP site. Several clusters mentioned in the text are shown. Branches are colored by the continent where the strains were isolated. The isolation source (habitat) of the strains is indicated. All strains belong to the O1 serogroup unless mentioned otherwise. Note that the transcontinental transmission event from South America to Southeast Asia (labeled “D” in reference [3], see Figure one and Supplementary Figure S3 therein) was not confirmed, and we suspect this is due to switching of the labels between strain A390 (Bangladesh 1987) and strain A316 (Argentina 1993) in that article as the positions of those strains are switched in our phylogeny.
Summary of the genotypic variables
| Attribute | Explanation | Before | After | Annotated |
|---|---|---|---|---|
| Protein families | CD-HIT clusters [ | 21,146 | 17,560 | 9,819 |
| Functions | Level-3 subsystems [ | 4,260 | 3,105 | 1,828 |
| SNPs | Marker SNPs [ | 7,880 | 2,545 | 659 |
| Subsystems | Level-3 subsystems [ | 706 | 444 | 398 |
| Phages | Phages [ | 6 | 4 | 4 |
| Clusters | Remove redundancy [ | 0 | 1,647 | 0 |
| Total | 33,998 | 25,305 | 12,708 |
Number of variables is shown before and after the clustering procedure to remove redundancy [6], as well as the number of variables annotated with level-1 subsystems [9]. The full matrix of 25,305 variables used in the manuscript is provided in Additional file 3 and Additional file 4. See text for details.
Figure 4The important subsystems for each niche dimension. Presence of level-1 subsystem categories in the top 5% most important functionally annotated genotypic variables for RFs in three niche dimensions. See Additional file 5 for details, the percentage can be changed in that file to dynamically update the bar chart.
Modified emulsion PCR conditions for mate pair library
| Stage | Step | Temperature | Time |
|---|---|---|---|
| Hold | Denature | 94°C | 6 minutes |
| Cycle (40) | Denature | 94°C | 30 seconds |
| Anneal | 58°C | 30 seconds | |
| Extend | 72°C | 90 seconds | |
| Cycle (20) | Denature | 94°C | 30 seconds |
| Extend | 58°C | 18 minutes | |
| Hold | 4°C | infinity |
The emulsion PCR conditions of the mate pair library construction for Ion Torrent sequencing of V. cholerae strain R-18308.
Phages present in the set of 31 genomes from the SEED database
| Phage | Source | Initially identified in genome | Genomes |
|---|---|---|---|
| CTX | PhAnToMe id 141904.3, RefSeq id NC_015209 | N16961 | 195 |
| RS | PhAnToMe id 141904.3, RefSeq id NC_015209 | N16961 | 115 |
| Kappa | PhAnToMe id 493906.2; RefSeq id NC_010275 | 1587, AM-19226, B33, MJ-1236, MO10, NCTC-8457, RC9, RC27 | 53 |
| WASA | Ref. [ | INDRE-91/1 | 18 |
| Mu-like | Identified using PhiSpy [ | 12129(1), TM-11079-80, TMA-21, V51 | 10 |
| VP882 | Identified using PhiSpy [ | AM-19226, TM-11079-80 | 5 |
The phages in this table were mapped by homology to all V. cholerae genomes (see Methods).