| Literature DB >> 29306353 |
Inès Levade1, Yves Terrat1, Jean-Baptiste Leducq1, Ana A Weil2,3, Leslie M Mayo-Smith2, Fahima Chowdhury4, Ashraful I Khan4, Jacques Boncy5, Josiane Buteau5, Louise C Ivers3,6,7, Edward T Ryan2,3,8, Richelle C Charles2,3, Stephen B Calderwood2,3,9, Firdausi Qadri4, Jason B Harris2,10, Regina C LaRocque2,3, B Jesse Shapiro1.
Abstract
Cholera is a severe, water-borne diarrhoeal disease caused by toxin-producing strains of the bacterium Vibrio cholerae. Comparative genomics has revealed 'waves' of cholera transmission and evolution, in which clones are successively replaced over decades and centuries. However, the extent of V. cholerae genetic diversity within an epidemic or even within an individual patient is poorly understood. Here, we characterized V. cholerae genomic diversity at a micro-epidemiological level within and between individual patients from Bangladesh and Haiti. To capture within-patient diversity, we isolated multiple (8 to 20) V. cholerae colonies from each of eight patients, sequenced their genomes and identified point mutations and gene gain/loss events. We found limited but detectable diversity at the level of point mutations within hosts (zero to three single nucleotide variants within each patient), and comparatively higher gene content variation within hosts (at least one gain/loss event per patient, and up to 103 events in one patient). Much of the gene content variation appeared to be due to gain and loss of phage and plasmids within the V. cholerae population, with occasional exchanges between V. cholerae and other members of the gut microbiota. We also show that certain intra-host variants have phenotypic consequences. For example, the acquisition of a Bacteroides plasmid and non-synonymous mutations in a sensor histidine kinase gene both reduced biofilm formation, an important trait for environmental survival. Together, our results show that V. cholerae is measurably evolving within patients, with possible implications for disease outcomes and transmission dynamics.Entities:
Keywords: Vibrio cholerae; biofilm; comparative genomics; horizontal gene transfer; within-host evolution
Mesh:
Year: 2017 PMID: 29306353 PMCID: PMC5761273 DOI: 10.1099/mgen.0.000142
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Nucleotide and amino acid changes identified in the V. cholerae core genome
| Region | Type* | Isolates | Ref. nucleotide | Alt. nucleotide | Nucleotide position† | NS/S | Protein | Ref. amino acid | Alt. amino acid | Gene annotation | Patient allele frequency‡ |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Bangladesh | iSNV | B1C19 | C | T | CHR1, 746965 | Intergenic | – | – | – | – | 1/19 |
| Bangladesh | iSNV | B4C12 | C | T | CHR1, 2549743 | S | ACQ61457.1 | D | D | RNA polymerase sigma-70 factor | 1/17 |
| Bangladesh | iSNV | B5C11 | T | G | CHR1, 2764922 | S | ACQ61649.1 | V | V | Toxin co-regulated pilus biosynthesis protein F | 1/20 |
| Haiti | iSNV | H1C5 | G | T | CHR1, 2240431 | NS | ACQ61177.1 | R | L | Sensor histidine kinase | 1/9 |
| Haiti | iSNV | H1C4 H1C8 | G | A | CHR1, 1785021 | NS | ACQ60802.1 | R | H | TetR family transcriptional regulator | 2/9 |
| Haiti | iSNV | H1C6 | G | T | CHR1, 2241580 | NS | ACQ61177.1 | R | L | Sensor histidine kinase | 1/9 |
| Bangladesh | Patient | B2+B3 | T | C | CHR1, 1295440 | Intergenic | – | – | – | – | 20/20 |
| Bangladesh | Patient | B4 | C | T | CHR1, 610212 | NS | ACQ59736.1 | G | R | TatD family hydrolase | 17/17 |
| Bangladesh | Patient | B4 | C | T | CHR2, 1007076 | NS | ACQ62879.1 | A | T | Hypothetical protein | 17/17 |
| Bangladesh | Patient | B2+B3 | G | A | CHR1, 1036008 | Intergenic | – | – | – | – | 20/20 |
| Bangladesh | Patient | B4 | C | T | CHR1, 1400553 | Intergenic | – | – | – | – | 17/17 |
| Bangladesh | Patient | B4 | G | A | CHR2, 350409 | NS | ACQ62264.1 | P | L | PTS system mannitol-specific EIICBA component | 17/17 |
| Bangladesh | Patient | B2+B3 | G | A | CHR1, 2301641 | NS | ACQ61224.1 | P | S | LacI family transcription regulator | 20/20 |
| Bangladesh | Patient | B1+B5 | C | T | CHR1, 359133 | N | ACQ59516.1 | A | V | Bifunctional purine biosynthesis protein PurH | 19/19 |
Abbreviations: Ref, reference allele; Alt, alternative allele; NS, non-synonymous; S, synonymous; CHR1, chromosome 1; CHR2, chromosome 2; iSNVs, intra-host single nucleotide variants.
*Mutations segregating within patients are denoted iSNVs; mutations fixed between patients are denoted 'Patient.'
†Nucleotide positions are based on the reference Vibrio cholerae MJ-1236 (GenBank accession numbers CP001485.1 and CP001486.1).
‡Patient allele frequency shows the allele frequency of the alternative (minor) allele.
Fig. 1.Culture and sequencing of isolates of Vibrio cholerae from eight acutely infected patients. To study within-patient evolution, selective media was used to culture stool samples from five patients from Bangladesh (B1 to B5) and three patients from Haiti (H1 to H3). Between eight and 20 colonies were isolated from each patient and sequenced separately. For patient B1, we performed a sub-culture of one isolate (dotted outline) and sequenced 12 of these new isolates as a control for cultured-induced and sequencing artefacts. We independently called variants, compared them between isolates within each patient to identify the intra single nucleotide variants (iSNVs, coloured circles) and determined whether they were intergenic (i), synonymous (S), or non-synonymous (NS) mutations.
Flexible gene content variation within and between patients
| Patient | No. genes fixed within patients | No. genes variable within patients | No. singletons |
|---|---|---|---|
| B1 | 111 | 11 | 5 |
| B2 | 61 | 35 | 0 |
| B3 | 61 | 51 | 0 |
| B4 | 61 | 49 | 0 |
| B5 | 111 | 5 | 0 |
| H1 | 14 | 68 | 0 |
| H2 | 14 | 103 | 25 |
| H3 | 14 | 63 | 0 |
Singletons are defined as genes only found in one isolate, and are also counted as variable genes within patients. Genes fixed within patients are present in all isolates from a patient, but are absent in at least one other isolate in the study.
Fig. 2.Presence/absence profile and taxonomic affiliation of gene families in the flexible genome. Red in the heatmap indicates gene presence; black indicates absence. Each column shows the presence/absence profile for a unique gene family. The heatmap is ordered by patient along the vertical axis. B1C1 is the control, subcultured from patient B1, and contains no flexible genome variation. The horizontal axis is ordered by hierarchical clustering, yielding four clusters: A, B, C and D. The taxonomic affiliation of each gene family (best blast hit) is indicated with dots above the heatmap.
Fig. 3.Bayesian phylogenic tree of 35 V. cholerae genotypes sampled over 3 years in Bangladesh and Haiti. The maximum clade credibility tree represents the genealogy of sequences in the study, reconstructed from concatenated hqSNVs, using beast. Coloured squares (shades of blue and purple) represent the time-course isolates collected from Bangladeshi patients from March 2011 to December 2013 (one isolate per patient). Patients for whom we measured intra-host variation (B1–B5 and H1–H3) are shown as circles. Filled circles indicate the putative ancestral genotype, and empty circles indicate putatively derived iSNVs. The median node age and divergence date in months and years are indicated at the nodes. The blue bars represent the 95 % HPD intervals for divergence time estimates, and posterior probabilities are represented on the branches.
McDonald–Kreitman test for differential selection within and between patients
| Bangladesh | Haiti | |||
|---|---|---|---|---|
| Population | NS | S | NS | S |
| Polymorphic (within patient) | 0 | 2 | 3 | 0 |
| Fixed (between patients) | 5 | 0 | 0 | 0 |
Counts of non-synonymous (NS) and synonymous (S) polymorphic sites (within patient iSNVs) and fixed sites (between patients) for Bangladeshi and Haitian patients.
Fig. 4.Significant excess of non-synonymous iSNVs in patient H1. (a) Distribution of 122 V. cholerae isolates containing different categories of iSNVs (I, intergenic; S, synonymous; Nsyn, non-synonymous) or no detectable iSNVs, according to geographic region (BGD, Bangladesh; HTI, Haiti). Patients from Haiti have a significant excess of Nsyn iSNVs (red; **P≤0.01; 10 000 random permutations of isolates among regions). (b) Distribution of 122 V. cholerae isolates containing different iSNVs, or no detectable iSNVs, by patient. Patient H1 has a significant excess of isolates with Nsyn iSNVs (**P≤0.001; 10 000 random permutations of mutations across patients; Supplementary Methods). ns, not significant.
Fig. 5.Biofilm formation of isolates from patients H1 and H2. Optical density at 595 nm was measured for four to 12 replicates of each isolate, after growth for 48 h at 30 °C. Statistical comparisons were made using a non-parametric Mann-Whitney test (*, P<0.05; ***, P<0.0001). Circles represent genomes with either variation in gene content (dark triangle) or iSNV variation (cross). (a) Isolates from patient H1. Isolate H1C1 represents the ancestral genotype, H1C4 has a non-synonymous mutation in a transcriptional regulator gene, and H1C5 and H1C6 have different non-synonymous mutations in the same gene, the histidine kinase gene. (b) Isolates from patient H2. Isolate H2C5 represents the ancestral genotype, with no variation in the gene content, and H2C3 harbours a plasmid. RBM is a biofilm knockout strain, and LB and saline are negative controls.