| Literature DB >> 34427503 |
Inès Levade1,2, Ashraful I Khan3, Fahima Chowdhury3, Stephen B Calderwood4,5,6, Edward T Ryan4,5, Jason B Harris4,7, Regina C LaRocque4,5, Taufiqur R Bhuiyan3, Firdausi Qadri3, Ana A Weil8, B Jesse Shapiro1,2,9.
Abstract
Vibrio cholerae can cause a range of symptoms, from severe diarrhea to asymptomatic infection. Previous studies using whole-genome sequencing (WGS) of multiple bacterial isolates per patient showed that V. cholerae can evolve modest genetic diversity during symptomatic infection. To further explore the extent of V. cholerae within-host diversity, we applied culture-based WGS and metagenomics to a cohort of both symptomatic and asymptomatic cholera patients from Bangladesh. While metagenomics allowed us to detect more mutations in symptomatic patients, WGS of cultured isolates was necessary to detect V. cholerae diversity in asymptomatic carriers, likely due to their low V. cholerae load. Using both metagenomics and isolate WGS, we report three lines of evidence that V. cholerae hypermutators evolve within patients. First, we identified nonsynonymous mutations in V. cholerae DNA repair genes in 5 out of 11 patient metagenomes sequenced with sufficient coverage of the V. cholerae genome and in 1 of 3 patients with isolate genomes sequenced. Second, these mutations in DNA repair genes tended to be accompanied by an excess of intrahost single nucleotide variants (iSNVs). Third, these iSNVs were enriched in transversion mutations, a known hallmark of hypermutator phenotypes. While hypermutators appeared to generate mostly selectively neutral mutations, nonmutators showed signs of convergent mutation across multiple patients, suggesting V. cholerae adaptation within hosts. Our results highlight the power and limitations of metagenomics combined with isolate sequencing to characterize within-patient diversity in acute V. cholerae infections, while providing evidence for hypermutator phenotypes within cholera patients. IMPORTANCE Pathogen evolution within patients can impact phenotypes such as drug resistance and virulence, potentially affecting clinical outcomes. V. cholerae infection can result in life-threatening diarrheal disease or asymptomatic infection. Here, we describe whole-genome sequencing of V. cholerae isolates and culture-free metagenomic sequencing from stool of symptomatic cholera patients and asymptomatic carriers. Despite the typically short duration of cholera, we found evidence for adaptive mutations in the V. cholerae genome that occur independently and repeatedly within multiple symptomatic patients. We also identified V. cholerae hypermutator phenotypes within several patients, which appear to generate mainly neutral or deleterious mutations. Our work sets the stage for future studies of the role of hypermutators and within-patient evolution in explaining the variation from asymptomatic carriage to symptomatic cholera.Entities:
Keywords: Vibrio cholerae; asymptomatic carriage; cholera; convergent evolution; genomics; hypermutation; intrahost diversity; metagenomics; natural selection; population genetics; within-patient evolution
Year: 2021 PMID: 34427503 PMCID: PMC8407408 DOI: 10.1128/mSystems.00889-21
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1Summary of the culture-dependent and culture-free metagenomics workflows for the characterization of Vibrio cholerae within-patient diversity. Stool or rectal swab samples were collected from symptomatic and asymptomatic Vibrio cholerae-infected individuals and processed using two different approaches: culture, DNA extraction, and whole-genome sequencing of multiple isolates per patient (A); and genome-resolved metagenomics involving DNA extraction directly from a microbiome sample, followed by sequencing, assembly, genome binning, and dereplication to generate metagenome-assembled genomes (MAGs), and within-host diversity profiling by mapping reads back to the MAGs (B).
Within-patient V. cholerae diversity profiles from 15 metagenomes
| Patient and/or day | Total no. of iSNVs | No. of nonsynonymous iSNVs | No. of synonymous iSNVs | No. of intergenic iSNVs | Mean coverage (×) | iRep value | DNA repair and proofreading genes with NS mutation |
|---|---|---|---|---|---|---|---|
| A | 93 | 6 | 0 | 87 | 451.3 | 3.34 | |
| B | 18 | 7 | 5 | 6 | 111.4 | 1.7 | |
| C | 6 | 0 | 1 | 5 | 111.8 | 1.7 | |
| D | 41 | 22 | 9 | 10 | 10 | 5.43 | DNA polymerase II |
| E | |||||||
| Day 1 | 8 | 2 | 1 | 5 | 351 | 3.25 | |
| Day 2 | 21 | 7 | 1 | 13 | 258 | 1.23 | |
| F | 207 | 133 | 47 | 27 | 18.2 | 2.48 | DNA mismatch repair endonuclease MutL; nuclease SbcCD subunit C |
| G | 16 | 12 | 3 | 1 | 7.7 | 1.73 | |
| H | 32 | 21 | 11 | 0 | 98.5 | 4.75 | Excinuclease ABC subunit UvrB |
| I | 75 | 55 | 20 | 0 | 13 | 2.79 | MutT/nudix family protein |
| J | 6 | 1 | 0 | 5 | 424.6 | 1.84 | |
| K | 25 | 13 | 6 | 6 | 18 | 1.69 | Formamidopyrimidine-DNA glycosylase mutM |
| L | 13 | 9 | 1 | 3 | 164.4 | 2.67 | |
| M | 2 | 0 | 1 | 1 | 113 | 2.65 | |
| N | 7 | 2 | 1 | 3 | 6.7 | 2.27 |
Mutations segregating within patients are denoted iSNVs. The number of iSNVs and mean coverage values were computed with InStrain (27), and replication rates were determined with iRep (39).
FIG 2Within-patient Vibrio cholerae diversity quantified from metagenomic data. (A) Minor allele frequency and distribution of intergenic, synonymous, and nonsynonymous iSNVs across the two Vibrio cholerae chromosomes for 14 patients with over 5× coverage of the V. cholerae genome. (B) Numbers of intergenic, synonymous, and nonsynonymous iSNVs for each patient. (C) Spectrum of within-patient mutation in hypermutators compared to nonmutators. Error bars represent standard errors of the mean. Only samples with 6 or more iSNVs were included to reduce noise from low counts.
Set of genes with convergent mutations identified in more than one patient
| Protein (UniProt ID) | Mutation(s) | |||||||
|---|---|---|---|---|---|---|---|---|
| Patient A | Patient B |
| Patient E |
|
|
|
| |
| Hemolysin (VC cytolysin) ( | NS (0.22) | 3 NS (0.22-0.43) | ||||||
| 2-Aminoethylphosphonate ABC transporter ferric-binding protein ( | NS (0.05) | NS (0.05) | ||||||
| Peptidase B ( | NS (0.33) | NS (0.09) | ||||||
| Nuclease SbcCD subunit C ( | S (0.28) | NS (0.09) | ||||||
| C4-dicarboxylate transport sensor protein ( | NS (0.08) | NS (0.11) | ||||||
| Zinc/cadmium/mercury/lead-transporting ATPase ( | NS (0.08) | NS (0.06) | ||||||
| Hypothetical protein ( | NS (0.14) | NS (0.14) | ||||||
| Hypothetical protein ( | NS (0.33) | NS (0.11) | ||||||
| Formamidopyrimidine-DNA glycosylase mutM ( | S (0.18) | NS (0.08) | ||||||
| Phosphoribosylformylglycinamidine synthase ( | NS (0.06) | S (0.08) | ||||||
The presence of a synonymous or nonsynonymous iSNV in each gene and each patient is indicated by S or NS, respectively, and the minor allele frequency is shown in parentheses. None of the mutations were found at the same nucleotide or codon position. Underlined patient designations indicate patients containing likely hypermutators. Only genes and patients containing more than one mutated gene are shown.
FIG 3Phylogeny and pangenome of 48 Vibrio cholerae isolates from index cases and their asymptomatic contacts. The phylogeny was inferred using maximum parsimony. The percentages of replicate trees in which the associated taxa clustered together in the bootstrap test (1,000 replicates) are shown next to the branches. Filled circles represent isolates from index cases, and empty circles represent isolates from their asymptomatic contacts. Patient N is the index case of household 58. All other index cases are labeled “.00”, with household contacts labeled “.01” and “.02”. The heat map of gene presence-absence is based on 106 genes in the flexible genome. Colored blocks in the heat map indicate gene presence; white indicates gene absence. Each row corresponds to an isolate from the phylogenetic tree, and each column represents an orthologous gene family. Each unique color represents a different individual. Different contigs are represented as alternating blocks of black and light gray.
Nucleotide changes identified in core genes of the V. cholerae isolates from index cases [patients 56.00, 57.00, and 58.00 (N)] and their asymptomatic contacts
| Type | Isolate(s) | Mutation type | Nucleotide position in MJ-1236 | Ref nucleotide | Alt nucleotide | Gene annotation | Patients with metagenomic samples with same variant |
|---|---|---|---|---|---|---|---|
| iSNV | 58.01d7C1 | NS | Chr1, 53054 | G | A | DNA mismatch repair protein MutS | |
| SNP | Households 56 and 57 | S | Chr1, 198988 | G | A | MSHA biogenesis protein MshQ | |
| iSNV | 58.01d7C1 | NS | Chr1, 209665 | G | A | MSHA biogenesis protein MshN | |
| iSNV | 56.00C4 | NS | Chr1, 374172 | C | T | UDP- | |
| SNP | Household 58 | NS | Chr1, 410638 | G | A | Phosphopantetheine adenylyltransferase | M, N |
| SNP | Households 56 and 57 | NS | Chr1, 754154 | C | T | 1,4-Dihydroxy-2-naphthoate polyprenyltransferase | |
| SNP | Household 58 | S | Chr1, 841538 | C | T | SSU ribosomal protein S4p | L, M, N |
| SNP | Household 58 | S | Chr1, 1315021 | T | G | Exported zinc metalloprotease YfgC precursor | L, M, N |
| iSNV | 58.02C1 | S | Chr1, 1576083 | C | A | Periplasmic thiol:disulfide oxidoreductase DsbB | |
| SNP | Patient N | NS | Chr1, 1689779 | A | C | Sigma-54 dependent transcriptional regulator | |
| SNP | Contacts 58.01 and 58.02 | NS | Chr1, 2301641 | G | A | Putative membrane protein | |
| iSNV | 58.01d7C1 | NS | Chr1, 1744854 | C | T | Hypothetical protein | |
| SNP | Contacts 58.01 and 58.02 | NS | Chr1, 2262202 | A | G | Serine transporter | |
| SNP | Households 56 and 57 | NS | Chr1, 2301641 | C | T | LacI family DNA-binding transcriptional regulator | D, J, K |
| iSNV | 57.00C5 | NS | Chr1, 2509468 | C | T | Cyclic-di-GMP-modulating response regulator | |
| iSNV | 56.01C1 | NS | Chr1, 2588496 | C | T | Amidophosphoribosyltransferase | |
| iSNV | 58.01d7C1 | NS | Chr1, 2693815 | C | T | PTS system, trehalose-specific IIB component | |
| SNP | Household 58 | NS | Chr1, 2806858 | A | T | Citrate lyase alpha chain | L, M, N |
| iSNV | 56.00C1 | S | Chr1, 3037471 | A | G | Hypothetical protein | |
| SNP | Patient N | NS | Chr1, 3059131 | C | T | DNA polymerase V (UmuC) | |
| SNP | Households 56 and 57 | NS | Chr1, 3095039 | G | A | Outer membrane protein OmpU | D, F, G, I, J, K |
| SNP | Contacts 58.01 and 58.02 | S | Chr1, 3105102 | C | T | Glutamate-1-semialdehyde aminotransferase | |
| iSNV | 58.01d7C1 | NS | Chr2, 528409 | C | T | Vibriolysin, extracellular zinc protease |
Genome position is according to the MJ-1236 reference genome (CP001485.1, CP001486.1). Mutations segregating within patients are denoted iSNVs; mutations fixed between patients are denoted SNPs. SNPs fixed within all members of one or more households are also designated household SNPs. Patient allele frequency shows the allele frequency of the alternative (minor) allele. Ref, reference allele; Alt, alternative allele; NS, nonsynonymous; S, synonymous; Chr1, chromosome 1; Chr2, chromosome 2; MSHA, mannose-sensitive hemagglutinin; SSU, small subunit; PTS, phosphotransferase system.
Flexible gene content variation within and between patients
| Patient | No. of genes fixed within patients | No. of genes variable within patients | No. of singletons |
|---|---|---|---|
| 56.00 | 88 | 6 | 0 |
| 56.01 | 86 | 10 | 0 |
| 57.00 | 87 | 8 | 0 |
| 57.01 | 87 | 8 | 0 |
| 58.00 (N) | 62 | 9 | 0 |
| 58.01 | 36 | 65 | 2 |
| 58.02 | 39 | 67 | 1 |
Singletons are defined as genes found in only one isolate and are also counted as variable genes within patients. Genes fixed within patients are present in all isolates from a patient but are absent in at least one other isolate in the study.