| Literature DB >> 28713351 |
Cody J Buchanan1,2, Andrew L Webb1, Steven K Mutschall1, Peter Kruczkiewicz1, Dillon O R Barker1,2, Benjamin M Hetman1, Victor P J Gannon1, D Wade Abbott3, James E Thomas2, G Douglas Inglis3, Eduardo N Taboada1.
Abstract
Campylobacter jejuni is a leading human enteric pathogen worldwide and despite an improved understanding of its biology, ecology, and epidemiology, limited tools exist for identifying strains that are likely to cause disease. In the current study, we used subtyping data in a database representing over 24,000 isolates collected through various surveillance projects in Canada to identify 166 representative genomes from prevalent C. jejuni subtypes for whole genome sequencing. The sequence data was used in a genome-wide association study (GWAS) aimed at identifying accessory gene markers associated with clinically related C. jejuni subtypes. Prospective markers (n = 28) were then validated against a large number (n = 3,902) of clinically associated and non-clinically associated genomes from a variety of sources. A total of 25 genes, including six sets of genetically linked genes, were identified as robust putative diagnostic markers for clinically related C. jejuni subtypes. Although some of the genes identified in this study have been previously shown to play a role in important processes such as iron acquisition and vitamin B5 biosynthesis, others have unknown function or are unique to the current study and warrant further investigation. As few as four of these markers could be used in combination to detect up to 90% of clinically associated isolates in the validation dataset, and such markers could form the basis for a screening assay to rapidly identify strains that pose an increased risk to public health. The results of the current study are consistent with the notion that specific groups of C. jejuni strains of interest are defined by the presence of specific accessory genes.Entities:
Keywords: Campylobacter jejuni; clinical association; genome sequence; genome-wide association study; linkage analysis; molecular marker discovery; molecular risk assessment
Year: 2017 PMID: 28713351 PMCID: PMC5492696 DOI: 10.3389/fmicb.2017.01224
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Epidemiological characteristics of 34 CGF subtypes targeted for whole-genome sequencingbased on the Canadian Campylobacter Comparative Genomic Fingerprinting Database (C3GFdb).
| Proportion of isolates in subtype (%)4 | |||||||
|---|---|---|---|---|---|---|---|
| CGF Subtype | Cohort1 | Cluster Size2 | Cluster Rank3 | H | A | E | U |
| 0169.001.002 | CA | 837 | 1 | 30.8% | 62.7% | 6.3% | 0.1% |
| 0695.006.001 | UN | 709 | 2 | 13.7% | 80.4% | 5.9% | 0.0% |
| 0083.001.002 | UN | 655 | 3 | 15.9% | 83.2% | 0.8% | 0.2% |
| 0926.002.001 | UN | 636 | 4 | 9.0% | 74.2% | 16.8% | 0.0% |
| 0044.003.001 | CA | 373 | 6 | 59.0% | 40.5% | 0.5% | 0.0% |
| 0957.001.001 | UN | 349 | 7 | 5.7% | 69.6% | 24.6% | 0.0% |
| 0853.011.001 | UN | 272 | 9 | 12.5% | 87.1% | 0.4% | 0.0% |
| 0882.005.001 | UN | 265 | 10 | 6.8% | 81.1% | 9.4% | 2.6% |
| 0982.001.002 | CA | 249 | 11 | 26.9% | 68.3% | 4.8% | 0.0% |
| 0811.009.002 | NCA | 237 | 12 | 4.2% | 43.9% | 51.9% | 0.0% |
| 0735.005.001 | UN | 232 | 13 | 19.8% | 66.8% | 13.4% | 0.0% |
| 0253.004.001 | UN | 230 | 14 | 21.7% | 75.2% | 3.0% | 0.0% |
| 0960.007.001 | UN | 224 | 15 | 15.2% | 76.8% | 5.4% | 2.7% |
| 0731.001.005 | UN | 221 | 16 | 13.6% | 81.9% | 4.5% | 0.0% |
| 0923.002.001 | UN | 206 | 18 | 8.3% | 61.7% | 30.1% | 0.0% |
| 0269.004.001 | CA | 154 | 24 | 36.4% | 63.6% | 0.0% | 0.0% |
| 0811.008.001 | NCA | 135 | 26.5 | 0.7% | 45.9% | 53.3% | 0.0% |
| 0173.004.001 | CA | 123 | 31 | 30.9% | 57.7% | 11.4% | 0.0% |
| 0173.002.004 | UN | 116 | 32.5 | 23.3% | 76.7% | 0.0% | 0.0% |
| 0933.004.002 | CA | 108 | 36 | 29.6% | 65.7% | 4.6% | 0.0% |
| 0893.001.001 | UN | 102 | 37 | 9.8% | 82.4% | 7.8% | 0.0% |
| 0933.008.001 | UN | 98 | 40 | 22.4% | 75.5% | 2.0% | 0.0% |
| 0949.001.002 | UN | 94 | 41 | 16.0% | 72.3% | 11.7% | 0.0% |
| 0960.003.002 | UN | 92 | 42.5 | 13.0% | 67.4% | 19.6% | 0.0% |
| 0904.002.002 | UN | 87 | 44 | 11.5% | 74.7% | 12.6% | 1.1% |
| 0103.001.002 | CA | 82 | 48.5 | 32.9% | 67.1% | 0.0% | 0.0% |
| 0077.001.003 | CA | 81 | 51 | 33.3% | 66.7% | 0.0% | 0.0% |
| 0238.007.002 | UN | 75 | 55.5 | 20.0% | 80.0% | 0.0% | 0.0% |
| 0260.007.001 | UN | 75 | 55.5 | 21.3% | 78.7% | 0.0% | 0.0% |
| 0844.001.001 | NCA | 68 | 63 | 1.5% | 23.5% | 75.0% | 0.0% |
| 0253.001.002 | CA | 61 | 69.5 | 29.5% | 68.9% | 0.0% | 1.6% |
| 0535.001.003 | UN | 55 | 76.5 | 9.1% | 36.4% | 54.5% | 0.0% |
| 0817.003.001 | NCA | 55 | 76.5 | 0.0% | 20.0% | 80.0% | 0.0% |
| 0083.007.001 | CA | 51 | 81 | 58.8% | 41.2% | 0.0% | 0.0% |
Significant genes observed after GWAS analysis of genome sequences from representative Clinically Associated (CA) and Non-Clinically Associated (NCA) C. jejuni subtypes.
| Marker | Raw | Holm-corrected2 | 11168 Ortholog | Gene name | Function | Linkage group |
|---|---|---|---|---|---|---|
| 11168_00051 | 4.29E-10 | 8.39E-07 | Hypothetical protein | LG1 | ||
| 11168_00052 | 5.28E-10 | 1.03E-06 | Hypothetical protein | |||
| 11168_00169 | 3.36E-11 | 6.61E-08 | Putative iron transport protein | LG2 | ||
| 11168_00170 | 3.36E-11 | 6.61E-08 | Putative TonB-denpendent outer membrane receptor | |||
| 11168_00171 | 3.36E-11 | 6.60E-08 | Biopolymer transport protein | |||
| 11168_00172 | 3.36E-11 | 6.60E-08 | Biopolymer transport protein | |||
| 11168_00173 | 3.36E-11 | 6.60E-08 | TonB transport protein | |||
| 11168_00230 | 6.12E-19 | 1.21E-15 | Putative MCP-domain signal transduction protein | |||
| 11168_00243 | 6.48E-34 | 1.28E-30 | Putative dihydroorotase | LG3 | ||
| 11168_00244 | 3.10E-27 | 6.14E-24 | Small hydrophobic protein | |||
| 11168_00248 | 6.57E-25 | 1.30E-21 | Putative molybdopterin containing oxidoreductase | LG4 | ||
| 11168_00249 | 6.57E-25 | 1.30E-21 | Putative cytochrome C-type haem-binding Periplasmic protein | |||
| 11168_00277 | 1.30E-17 | 2.57E-14 | Putative acetyltransferease | LG5 | ||
| 11168_00278 | 1.35E-18 | 2.66E-15 | Aspartate 1-decarboxylase precursor | |||
| 11168_00279 | 1.35E-18 | 2.66E-15 | Pantoate-beta-alanine ligase | |||
| 11168_00280 | 1.35E-18 | 2.66E-15 | 3-methyl-2-oxobutanoate hydroxymethyltransferase | |||
| 11168_00281 | 1.09E-16 | 2.15E-13 | Putative periplasmic beta-lactamase | |||
| 11168_00703 | 6.98E-24 | 1.38E-20 | Putative ABC transport system permease | |||
| 11168_00718 | 3.36E-11 | 6.59E-08 | TonB transport protein | LG6 | ||
| 11168_00719 | 3.36E-11 | 6.59E-08 | Ferric enterobactin uptake receptor | |||
| 11168_01072 | 4.90E-11 | 9.59E-08 | Putative integral membrane protein. | |||
| 11168_01201 | 6.12E-19 | 1.21E-15 | Putative isomerase | |||
| 11168_01309 | 5.30E-15 | 1.04E-11 | Putative secreted serine protease | |||
| 11168_01519 | 4.29E-10 | 8.39E-07 | Putative oxidoreductase | |||
| 11168_01610 | 4.29E-10 | 8.38E-07 | Hypothetical protein | |||
| 06_2866_00597 | 6.89E-28 | 1.36E-24 | Di-/tripeptide transporter | |||
| 06_7515_00723 | 4.19E-16 | 8.24E-13 | Prophage Lp2 protein 6 | |||
| 07_0675_00227 | 2.62E-11 | 5.15E-08 | Elongation factor G | |||