| Literature DB >> 25381664 |
William G Miller1, Emma Yee2, Mary H Chapman2, Timothy P L Smith3, James L Bono3, Steven Huynh2, Craig T Parker2, Peter Vandamme4, Khai Luong5, Jonas Korlach5.
Abstract
The Campylobacter lari group is a phylogenetic clade within the epsilon subdivision of the Proteobacteria and is part of the thermotolerant Campylobacter spp., a division within the genus that includes the human pathogen Campylobacter jejuni. The C. lari group is currently composed of five species (C. lari, Campylobacter insulaenigrae, Campylobacter volucris, Campylobacter subantarcticus, and Campylobacter peloridis), as well as a group of strains termed the urease-positive thermophilic Campylobacter (UPTC) and other C. lari-like strains. Here we present the complete genome sequences of 11 C. lari group strains, including the five C. lari group species, four UPTC strains, and a lari-like strain isolated in this study. The genome of C. lari subsp. lari strain RM2100 was described previously. Analysis of the C. lari group genomes indicates that this group is highly related at the genome level. Furthermore, these genomes are strongly syntenic with minor rearrangements occurring only in 4 of the 12 genomes studied. The C. lari group can be bifurcated, based on the flagella and flagellar modification genes. Genomic analysis of the UPTC strains indicated that these organisms are variable but highly similar, closely related to but distinct from C. lari. Additionally, the C. lari group contains multiple genes encoding hemagglutination domain proteins, which are either contingency genes or linked to conserved contingency genes. Many of the features identified in strain RM2100, such as major deficiencies in amino acid biosynthesis and energy metabolism, are conserved across all 12 genomes, suggesting that these common features may play a role in the association of the C. lari group with coastal environments and watersheds. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2014. This work is written by US Government employees and is in the public domain in the US.Entities:
Keywords: UPTC; flagella; hemagglutination; methylome
Mesh:
Year: 2014 PMID: 25381664 PMCID: PMC4986449 DOI: 10.1093/gbe/evu249
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Strains Sequenced in This Study
| Coverage | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Strain | Type Strain | Source | Location | Optical Map | 454 | Illumina | PacBio | Total | Reference | Accession Number(s) | |
| Y | Marine mammal | Scotland | Y | 107× | H; 1,347× | N/A | 1,454× | CP007770 | |||
| N | Human | Canada (ONT) | Y | 71× | H; 1,287× | N/A | 1,358× | CP007771 | |||
| UPTC CCUG 22395 | N | Human | France | N | 61× | H; 1,478× | N/A | 1,539× | N/A | CP007776 | |
| UPTC NCTC 11845 | N | River water | United Kingdom | Y | 123× | H; 1,048× | 182× | 1,352× | N/A | CP007775 | |
| UPTC RM16701 | N | River water | United States (CA) | N | 82× | M; 708× | N/A | 790× | N/A | CP007777 | |
| UPTC RM16712 | N | River water | United States (CA) | N | 110× | M; 551× | N/A | 661× | N/A | CP007778 | |
| Y | Shellfish | The Netherlands | Y | 63× | H; 1,175× | 222× | 1,459× | CP007766CP007767 (pPEL1)CP007768 (pPEL2) | |||
| N | Gentoo penguin | S. Georgia, Antarctica | Y | 35× | H; 1,146× | 141× | 1,323× | CP007772 | |||
| Y | Gray-headed albatross | S. Georgia, Antarctica | Y | 136× | H; 1,086× | 138× | 1,360× | CP007773 | |||
| N | Black-headed gull | Sweden | Y | 57× | H; 1,381× | N/A | 1,438× | CP007774 | |||
| N | River water | United States (CA) | N | 84× | M; 475× | 164× | 723× | N/A | CP007769 | ||
Note.—H, HiSeq sequencing; M, MiSeq sequencing; N/A, not applicable.
aIf Campylobacter spp. strain RM16704 represents a novel species, then it would likely be designated as the type strain.
General Features of the Campylobacter lari Group Genomes
| UPTC NCTC 11845 | UPTC CCUG 22395 | UPTC RM16701 | UPTC RM16712 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chromosome | |||||||||||||
| Size (kb) | 1,525.46 | 1,502.10 | 1,791.51 | 1,523.03 | 1,516.46 | 1,565.01 | 1,782.54 | 1,853.00 | 1,557.54 | 1,711.37 | 1,517.95 | 1,465.08 | |
| % G+C content | 29.70 | 29.73 | 29.36 | 29.86 | 29.90 | 29.74 | 29.94 | 29.75 | 28.47 | 28.51 | 28.57 | 28.19 | |
| CDS numbers | 1,495 | 1,451 | 1,702 | 1,481 | 1,475 | 1,507 | 1,675 | 1,770 | 1,490 | 1,591 | 1,478 | 1,440 | |
| Assigned function (% CDS) | 836 (56) | 821 (57) | 837 (49) | 835 (56) | 822 (56) | 834 (55) | 840 (50) | 832 (47) | 817 (55) | 827 (52) | 830 (56) | 812 (56) | |
| Pseudogenes | 18 | 20 | 21 | 15 | 9 | 20 | 51 | 56 | 30 | 19 | 19 | 22 | |
| General function (% CDS) | 373 (25) | 378 (26) | 426 (25) | 390 (26) | 387 (26) | 382 (25) | 458 (27) | 485 (27) | 388 (26) | 404 (25) | 383 (26) | 360 (25) | |
| Hypothetical (% CDS) | 286 (19) | 252 (17) | 439 (26) | 256 (17) | 266 (18) | 291 (19) | 377 (23) | 453 (26) | 285 (19) | 360 (23) | 265 (18) | 268 (19) | |
| Prophage/genetic islands | 1 | 1 | 2 | 2 | 1 | 2 | 2 | 3 | 0 | 2 | 1 | 1 | |
| Ribosomal RNA operons | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | |
| CRISPR | N | N | Y | Y | N | N | Y | N | Y | Y | N | N | |
| G:C tracts ≥8 nt (# HV) | 15 (15) | 15 (14) | 23 (22) | 10 (10) | 14 (14) | 18 (16) | 50 (46) | 44 (43) | 41 (39) | 26 (26) | 17 (17) | 27 (23) | |
| Plasmids (size kb) | 46.2 | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | 47.8; 3.6 | N/A | N/A | |
| Gene classes | |||||||||||||
| Signal transduction | |||||||||||||
| Che/Mot proteins | 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 | 8 | |
| MCP | 12 | 11 | 14 | 14 | 14 | 11 | 10 | 10 | 9 | 13 | 9 | 8 | |
| 2CS response regulator | 6 | 6 | 6 | 6 | 5 | 6 | 6 | 6 | 6 | 6 | 6 | 4 | |
| 2CS histidine kinase | 6 | 6 | 6 | 6 | 5 | 6 | 6 | 6 | 6 | 6 | 6 | 4 | |
| Other | 7 | 7 | 7 | 7 | 8 | 6 | 7 | 6 | 6 | 6 | 7 | 6 | |
| R/M systems | |||||||||||||
| Type I ( | 0 | 0 | 0 | 1 | 0 | 1 | 2 | 0 | 1 | 1 | 0 | 0 | |
| Type II/IIS | 2 | 1 | 1 | 1 | 2 | 2 | 2 | 3 | 2 | 3 | 4 | 3 | |
| Type III | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | |
| Other ( | 1 | 1 | 1 | 0 | 2 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | |
| DNA methylases | 2 | 3 | 3 | 4 | 4 | 3 | 4 | 4 | 2 | 3 | 1 | 3 | |
| Transcription | |||||||||||||
| Regulatory proteins | 18 | 17 | 16 | 16 | 17 | 17 | 13 | 18 | 15 | 15 | 12 | 14 | |
| σ factors | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | |
| Hag proteins | 1 | 3 | 7 | 2 | 3 | 2 | 3 | 3 | 10 | 15 | 0 | 0 | |
| Motility | |||||||||||||
| FlaAB (class; orient.) | 1;→→ | 1;→→ | 2;→→ | 2;←→ | 2;←→ | 2;←→ | 1;→→ | 1;→→ | 2;←→ | 2;←→ | 1;→→ | 1;→→ | |
| MAF (class) | 1 | 1 | 2 | 2 | 2 | 2 | 1 | 1 | 2 | 2 | 1 | 1 | |
| Pse/Leg | Y/Y | Y/(Y) | Y/N | Y/N | Y/N | Y/N | Y/Y | Y/Y | Y/N | Y/N | Y/Y | Y/Y | |
Note.—HV, hypervariable; Che/Mot, chemotaxis/motility; MCP, methyl-accepting chemotaxis protein; 2CS, two component system; MAF, motility accessory factor; Pse/Leg, pseudaminic acid/legionaminic acid.
aCDS numbers do not include pseudogenes.
bcas1 gene nonfunctional.
FBRIG plot of the Campylobacter lari group. The BRIG image was created using BLASTN and a minimum default threshold of 50%. The reference strain was C. lari subsp. lari strain RM2100. White areas correspond to sequences with similarity values below the minimum threshold.
FColinearity of the Campylobacter lari group genomes. Each core protein in the C. lari subsp. lari strain RM2100 genome (x axis) was compared with the core proteins of other C. lari group members and with those of C. jejuni strain RM11168 (y axis) by BLASTP analysis. Each protein represents a match above 50% similarity. The x and y axis values represent gene numbers.
Average Amino Acid Identities of the Campylobacter lari Group Core Proteome
Note.—To more easily visualize amino acid similarities of the proteomes, we utilized a gradient heat map with black = 100%.
Characteristics of the Campylobacter lari Group Flagellar and Flagellar Modification Genes
Note.—aa, amino acids; (1) indicates a partial maf gene; truncated flagellar subunits are shaded light grey; Y (shaded grey), putative flagellar modification genes encoded by the C. lari group genomes. In five genomes, the missing LegC function may be substituted by PglE, thus these are also shaded grey.