| Literature DB >> 30777818 |
Nicole E Wheeler1,2,3, Timothy Blackmore4, Angela D Reynolds5, Anne C Midwinter5, Jonathan Marshall5, Nigel P French5,6, Matthew S Savoian7, Paul P Gardner2,3,8, Patrick J Biggs5,9,10.
Abstract
Campylobacter jejuni is the most common cause of bacterial diarrheal disease in the world. Clinical outcomes of infection can range from asymptomatic infection to life-threatening extraintestinal infections. This variability in outcomes for infected patients has raised questions as to whether genetic differences between C. jejuni isolates contribute to their likelihood of causing severe disease. In this study, we compare the genomes of ten C. jejuni isolates that were implicated in extraintestinal infections with reference gastrointestinal isolates, in order to identify unusual patterns of sequence variation associated with infection outcome. We identified a collection of genes that display a higher burden of uncommon mutations in invasive isolates compared with gastrointestinal close relatives, including some that have been previously linked to virulence and invasiveness in C. jejuni. Among the top genes identified were mreB and pgp1, which are both involved in determining cell shape. Electron microscopy confirmed morphological differences in isolates carrying unusual sequence variants of these genes, indicating a possible relationship between extraintestinal infection and changes in cell morphology.Entities:
Keywords: Campylobacter jejuni; bacteremia; random forest
Mesh:
Year: 2019 PMID: 30777818 PMCID: PMC6421344 DOI: 10.1099/mgen.0.000251
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.NeighborNet diagram of the genetic similarity of strains included in the study. The scale bar indicates 0.01 substitutions per site in a core gene alignment. Invasive strains are highlighted in red type. Each invasive strain was paired with two gastrointestinal strains, shown in black type, from the Oxfordshire surveillance project from the same clonal complex. Clonal complexes are highlighted in bold blue type.
Fig. 2.A random forest model identifies a subset of genes that show increased sequence divergence in invasive isolates (a): Performance of the random forest model compared with that of 100 models built using permuted labels. Accuracy of the classifier built using true classes is indicated with a red line, and accuracies for the permuted models are shown as a histogram. (b): Feature importance for each gene in a representative random forest model. Grey lines correspond to empirical P-values of 0.05 (dashed line) and 0.1 (dotted line), calculated using feature importance values derived from permuted data.
Genes with the greatest value in discriminating between invasive and gastrointestinal
Reference locus ID refers to the locus ID for the homologous sequence in ATCC 700819. VI: variable importance, as measured by mean decrease in classification accuracy upon permuting a variable. % top: percentage of models that listed this gene among its top predictors.
| Cj0276 | Rod shape-determining protein MreB | 0.007 | 0.011 | 100 | |
| Cj0803 | Lipid export ABC transport protein | 0.003 | 0.026 | 100 | |
| Cj1568c | NADH dehydrogenase subunit L | 0.003 | 0.027 | 100 | |
| Cj0723c | Peptidase, M48 family protein | 0.003 | 0.036 | 100 | |
| Cj1236 | COG1565: Uncharacterized conserved protein | 0.002 | 0.038 | 100 | |
| Cj0584 | Cytochrome c552 precursor | 0.002 | 0.040 | 100 | |
| Cj1357c | DNA polymerase III subunit delta' | 0.002 | 0.040 | 100 | |
| Cj1345c | Glutamate synthase [NADPH] small chain | 0.002 | 0.046 | 100 | |
| Cj1706c | Purine nucleoside phosphorylase | 0.002 | 0.048 | 100 | |
| Cj0612c | LSU ribosomal protein L4p (L1e) | 0.002 | 0.050 | 100 | |
| Cj1476c | Non-heme iron-containing ferritin | 0.002 | 0.054 | 100 | |
| Cj1572c | Pyruvate-flavodoxin oxidoreductase | 0.002 | 0.057 | 100 | |
| Cj1131c | NADH-ubiquinone oxidoreductase chain H | 0.002 | 0.062 | 99 | |
| Cj0041 | Xaa-Pro aminopeptidase | 0.002 | 0.066 | 97 | |
| Cj0653c | UDP-glucose 4-epimerase | 0.002 | 0.067 | 99 | |
| Cj0002 | Flagellar hook-length control protein FliK | 0.002 | 0.067 | 95 | |
| Cj0477 | LSU ribosomal protein L7/L12 (P1/P2) | 0.002 | 0.069 | 96 | |
| Cj1288c | Glutamyl-tRNA synthetase | 0.001 | 0.069 | 94 | |
| Cj0002 | DNA polymerase III subunit beta | 0.001 | 0.073 | 94 | |
| Cj0500 | tRNA 2-selenouridine synthase | 0.001 | 0.082 | 86 | |
| Cj0530 | Periplasmic protein | 0.001 | 0.081 | 84 |
Fig. 3.Partial sequence alignment highlighting unique mutations in nrfA in invasive strains, compared with lineage-specific mutations observed in other regions. The sequence alignment was visualised using Jalview (http://www.jalview.org) and coloured by BLOSUM62 score. The maximum-likelihood tree was reconstructed using RAxML (v.8.2.8) using the GTR-GAMMA model and 100 bootstraps. The PDB structure 1FS7 was visualised using Jmol (www.jmol.org). The highlighted residues are the only unique mutations in sequence alignment and occur in a conserved haem binding site. Haem molecules are highlighted in red.
Fig. 4.Sequence changes in cell shape genes relate to changes in cell morphology. Bitscore values are plotted for extraintestinal and gastrointestinal isolates for two cell-shape-determining proteins identified as associated with invasiveness which appeared in the top scoring genes. High bitscores correspond to strong agreement between individual protein sequences and modelled sequence constraints for that gene. Large differences in bitscore within an ST indicate the accumulation of a mutation/mutations that are not frequently observed in that gene within the . The position of the points is offset slightly (up to 0.3 bits) to allow discrimination of strains with identical scores for each gene. Electron micrographs for representative invasive isolates from this study are shown to illustrate the cell morphologies described. Size scale is the same for each image. Note: one ST50 sample has been excluded from the plot because its score for pgp1 was much lower and distorted the figure.