| Literature DB >> 18159241 |
William G Miller1, Craig T Parker, Marc Rubenfield, George L Mendz, Marc M S M Wösten, David W Ussery, John F Stolz, Tim T Binnewies, Peter F Hallin, Guilin Wang, Joel A Malek, Andrea Rogosin, Larry H Stanker, Robert E Mandrell.
Abstract
BACKGROUND: Arcobacter butzleri is a member of the epsilon subdivision of the Proteobacteria and a close taxonomic relative of established pathogens, such as Campylobacter jejuni and Helicobacter pylori. Here we present the complete genome sequence of the human clinical isolate, A. butzleri strain RM4018. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2007 PMID: 18159241 PMCID: PMC2147049 DOI: 10.1371/journal.pone.0001358
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Features of the Arcobacter butzleri RM4018 genome
| General features | Number or % of total |
| Chromosome size (bp) | 2,341,251 |
| G+C content | 27.05% |
| CDS numbers | 2259 |
| Assigned function | 1011 (45%) |
| Pseudogenes | 5 |
| General function | 505 (22%) |
| Conserved hypothetical/hypothetical | 743 (33%) |
| Prophage | 1 |
| Genetic islands | 3 |
| Ribosomal RNA operons | 5 |
| Plasmids | 0 |
| IS elements | 0 |
| Poly GC tracts | 0 |
|
| |
| Chemotaxis proteins | 46 |
| Che/Mot proteins | 11 |
| Methyl-accepting chemotaxis proteins | 29 |
| Redox-sensing PAS domain proteins | 3 |
| Cyclic diguanylic acid proteins | 25 |
| Restriction/modification systems | |
| Type I | 0 |
| Type II/IIS | 0 |
| Type III | 0 |
| Transcriptional regulators | |
| Regulatory proteins | 36 |
| Non-ECF family σ factors | 1 |
| ECF family σ/anti-σ factor pairs | 7 |
| Two-component systems | |
| Response regulator | 42 |
| Sensor histidine kinase | 37 |
|
| |
|
| |
| Within | 1754 (77.6%) |
|
| 98 (4.3%) |
|
| 10 (0.4%) |
|
| 133 (5.9%) |
|
| 56 (24.8%) |
| Only in non | 190 (8.4%) |
| Unique | 315 (13.9%) |
Total does not include pseudogenes.
One hsdM pseudogene and an hsdS/hsdM pair are present but no complete Type I R/M system (including hsdR) is present.
Similarity of predicted A. butzleri proteins to proteins from other taxa.
| Taxon | Best match | Within best 5 matches | |
| # | % | % | |
|
| 1390 | 71.50 | 63.43 |
|
| 258 | 13.27 | 22.54 |
|
| 19 | 0.98 | 1.48 |
|
| 47 | 2.42 | 3.88 |
|
| 50 | 2.57 | 4.73 |
|
| 60 | 3.09 | 4.24 |
|
| 19 | 0.98 | 1.90 |
|
| 35 | 1.80 | 3.55 |
|
| 15 | 0.77 | 1.59 |
|
| 10 | 0.51 | 1.00 |
| Other | 3 | 0.15 | 0.17 |
|
| 603 | 31.02 | 21.21 |
|
| 4 | 0.21 | 0.59 |
|
| 21 | 1.08 | 1.98 |
|
| 3 | 0.15 | 0.75 |
|
| 335 | 17.23 | 9.56 |
|
| 240 | 12.35 | 8.31 |
| Other | 0 | 0.00 | 0.02 |
|
| 47 | 2.42 | 3.33 |
| Unclassified | 482 | 24.79 | 16.34 |
|
| 240 | 12.35 | 7.98 |
|
| 235 | 12.09 | 7.96 |
| Other | 7 | 0.36 | 0.41 |
| Other | 395 | 20.32 | 24.53 |
| α- | 30 | 1.54 | 2.47 |
|
| 47 | 2.42 | 3.11 |
|
| 34 | 1.75 | 2.15 |
| Other | 13 | 0.67 | 0.96 |
|
| 242 | 12.45 | 14.30 |
|
| 60 | 3.09 | 4.06 |
|
| 19 | 0.98 | 1.60 |
|
| 36 | 1.85 | 1.58 |
|
| 34 | 1.75 | 2.27 |
|
| 22 | 1.13 | 0.75 |
|
| 26 | 1.34 | 1.68 |
| Other | 45 | 2.31 | 2.35 |
|
| 68 | 3.50 | 4.23 |
|
| 46 | 2.37 | 2.64 |
| Other | 22 | 1.13 | 1.59 |
|
| 3 | 0.15 | 0.35 |
|
| 5 | 0.26 | 0.38 |
|
| 55 | 2.83 | 3.82 |
|
| 14 | 0.72 | 1.22 |
| Other | 41 | 2.11 | 2.60 |
|
| 1 | 0.05 | 0.25 |
|
| 3 | 0.15 | 0.25 |
|
| 11 | 0.57 | 0.94 |
|
| 2 | 0.10 | 0.06 |
|
| 2 | 0.10 | 0.17 |
|
| 48 | 2.47 | 3.19 |
|
| 14 | 0.72 | 1.28 |
|
| 30 | 1.54 | 1.61 |
| Other | 4 | 0.21 | 0.30 |
|
| 4 | 0.21 | 0.29 |
|
| 0 | 0.00 | 0.05 |
|
| 3 | 0.15 | 0.35 |
|
| 7 | 0.36 | 0.48 |
|
| 1 | 0.05 | 0.13 |
| Other Bacteria | 0 | 0.00 | 0.04 |
|
| 7 | 0.36 | 0.68 |
|
| 1 | 0.05 | 0.06 |
|
| 5 | 0.26 | 0.55 |
| Other | 1 | 0.05 | 0.07 |
| Phage/Plasmid | 1 | 0.05 | 0.29 |
|
| 6 | 0.31 | 0.33 |
Top matches are derived from Table S3. Predicted Arcobacter butzleri proteins were compared to proteins in the NCBI non-redundant (nr) database by BLASTP. Matches with an Expect (E) value of >1×10−5, an identity of <25%, and an alignment length across either the query or match sequences of <75% were excluded. The top five matches (where applicable) were identified at each locus (Table S3) and classified by taxon. Individual orders are listed if they constitute >1% of the total.
A total of 1944 loci contained matches to proteins in the nr database, according to the criteria listed above. Percentage values for the defined taxa do not include the 315 loci (13.9% of the 2259 identified loci) for which no homologs were identified.
Multiple strains of the same species (e.g. C. jejuni RM1221 and C. jejuni NCTC 11168) at a given locus were counted once. Percentage values for the defined taxa do not include the 315 loci for which no homologs were identified.
Figure 1Genome BLAST atlas of the A. butzleri strain RM4018.
Arcobacter butzleri strain RM4018 is the reference genome and is compared to a set of 15 other epsilonproteobacterial genomes, including different Campylobacter (rings 7–14 from center) and Helicobacter (rings 15–17 from center) strains, as well as the UniProt database (outermost ring in black). A web-based “zoomable atlas” can be found at [147].
Divergence within the major functional categories
| Divergent genes | |||
| Functional category | ε-proteobacteria |
|
|
| Glycolysis/TCA cycle/gluconeogenesis | 5/34 (15%) | 5/34 (15%) | 6/34 (18%) |
| Respiration/electron transport | 10/90 (11%) | 24/90 (27%) | 20/90 (22%) |
| Sulfur/nitrogen metabolism | 10/29 (34%) | 22/29 (76%) | 12/29 (41%) |
| Amino acid biosynthesis | 3/85 (4%) | 11/85 (13%) | 3/85 (4%) |
| Purine/pyrimidine biosynthesis | 2/29 (7%) | 5/29 (17%) | 2/29 (7%) |
| Biosynthesis of cofactors | 6/86 (7%) | 16/86 (19%) | 10/86 (12%) |
| Fatty acid biosynthesis | 0/17 (0%) | 1/17 (6%) | 0/17 (0%) |
| Transcriptional regulators/σ factors | 24/51 (47%) | 41/51 (80%) | 29/51 (57%) |
| Signal transduction | 51/104 (49%) | 77/104 (74%) | 58/104 (56%) |
| Protein translation/modification | 2/143 (1%) | 9/143 (6%) | 5/143 (3%) |
| DNA replication/repair | 13/70 (19%) | 26/70 (37%) | 16/70 (23%) |
| RNA synthesis | 1/17 (6%) | 5/17 (29%) | 1/17 (6%) |
| Macromolecule degradation | 4/45 (9%) | 13/45 (29%) | 7/45 (16%) |
| Cell envelope | 70/190 (37%) | 91/190 (48%) | 85/190 (45%) |
| Transport/secretion | 47/191 (25%) | 88/191 (46%) | 71/191 (37%) |
| Chemotaxis | 32/46 (70%) | 38/46 (83%) | 33/46 (72%) |
| Antibiotic resistance | 10/28 (36%) | 20/28 (71%) | 10/28 (36%) |
| General function | 458/782 (59%) | 597/782 (76%) | 527/782 (67%) |
Arcobacter butzleri strain RM4018 proteins that have either no homologs or homologs of low similarity (≤ 35% identity) within the given taxa/taxon. Categories and values are derived from Table S2.
Includes [Fe-S] centers.
Includes biosynthesis of prosthetic groups and carriers.
LOS, flagella and membrane proteins.
Includes conserved hypothetical and hypothetical proteins. Does not include general function proteins assigned to the other functional categories.
Figure 2Sulfur assimilation and biosynthesis of the sulfur-containing amino acids.
Genes/proteins in strain RM4018 unique within Campylobacteraceae or proteins with Campylobacteraceae orthologs of low similarity are labeled/shaded in blue.
Figure 3Predicted TCA, methylcitrate and citramalate cycles of A. butzleri strain RM4018.
Genes unique within the Campylobacteraceae are labeled in green. Genes in parentheses encode multi-subunit proteins or a protein and its cognate accessory protein. 1: pyruvate dehydrogenase; 2: L-lactate dehydrogenase; 3: phosphate transacetylase; 4: acetate kinase; 5: citrate synthase; 6: aconitase; 7: isocitrate dehydrogenase; 8: 2-oxoglutarate:acceptor oxidoreductase; 9: fumarate reductase; 10: fumarase; 11: malate dehydrogenase; 12: malate:quinone oxidoreductase; 13: 2-methylcitrate synthase; 14: 2-methylcitrate dehydratase; 15: aconitase; 16: 2-methylisocitrate lyase; 17: malic enzyme; 18: malyl-CoA hydrolase; 19: malyl-CoA lyase; 20: β-methylmalyl-CoA lyase; 21: mesaconyl-CoA hydratase; 22: mesaconyl-CoA synthetase; 23: citramalate hydrolase; 24: citramalate lyase.
Figure 4Respiratory pathways in A. butzleri strain RM4018.
Transfer of electrons to the menaquinone (MQ) pool is represented by blue arrows; transfer of electrons from the menaquinone pool is represented by red arrows. b- and c-type cytochromes are shaded yellow. Genes unique within the Campylobacteraceae are labeled in green. NADH:Q OR: NADH:quinone oxidoreductase; M:Q OR: malate:quinone oxidoreductase; FormDH: formate dehydrogenase; LacDH: L-lactate dehydrogenase; SulfDH: sulfur dehydrogenase; Ubi cyt c OR: Ubiquinol cytochrome c oxidoreductase; TMAO RD: TMAO reductase; NO3RD: nitrate reductase; NO2RD: nitrite reductase; FumRD: fumarate reductase; QHAmDH: quinohemoprotein amine dehydrogenase; AldDH: aldehyde dehydrogenase; PX: peroxidase; CTQ: cysteine tryptophylquinone.
Figure 5Phylogenetic analysis of four representative flagellar proteins and flagellar clustering in strain RM4018.
A. Each dendrogram was constructed using the neighbor-joining algorithm and the Kimura two-parameter distance estimation method. Bootstrap values of >75%, generated from 500 replicates, are shown at the nodes. The scale bar represents substitutions per site. Cjj: Campylobacter jejuni subsp. jejuni; Cjd: C. jejuni subsp. doylei; Cc: C. coli; Cu: C. upsaliensis; Cl: C. lari; Cff: C. fetus subsp. fetus; Ccur: C. curvus; Ccon: C. concisus; Ab: A. butzleri; Ws: W. succinogenes; Sd: S. denitrificans; Hh: H. hepaticus; Hac: H. acinonychis; Hp: H. pylori; Nit: Nitratiruptor sp.; Ec: Escherichia coli. B. Location of the flagellar genes of C. jejuni strain RM1221 and A. butzleri strain RM4018.
Strains used in the comparative genomic indexing analysis.
| Strain | Description | Location | Source |
| RM4018 | ATCC 49616, type strain | USA (Calif.) | Human diarrheal stool |
| RM1588 | NADC 5276 | USA | Chicken |
| RM1591 | NADC 5377 | USA | Turkey carcass |
| RM4128 | 29.97 | South Africa | Human stool |
| RM4462 | NADC 3553 | USA (Tex.) | Human stool |
| RM4467 | NADC 3566 | USA | Primate rectal swab |
| RM4596 | NADC 5262 | USA (Iowa) | Turkey |
| RM4843 | NADC 5278 | USA (Iowa) | Chicken carcass |
| RM4850 | NADC 6830 | USA (Iowa) | Horse |
| RM5516 | NADC 3156 | USA (Iowa) | Pig |
| RM5538 | CDC D2725 | USA (Mass.) | Human stool |
| RM5541 | CDC D2901 | USA (Colo.) | Human stool |
| RM5544 | CDC D2778 | Thailand | Human stool |
All test strains selected had unique MLST sequence types (data not shown).
Assignment of absent CDS to functional categories of A. butzleri.
| Genes in data set | Absent genes (%) | |
|
| ||
| I.A Degradation | 12 | 8.3 |
| I.B Energy metabolism | 129 | 3.8 |
| I.C Central intermediary metabolism | 37 | 40.5 |
| I.D Amino acid biosynthesis | 85 | 3.5 |
| I.E Polyamine biosynthesis | 4 | 0 |
| I.F Purines, pyrimidines, nucleosides and nucleotides | 41 | 0 |
| I.G Biosynthesis of cofactors, prosthetic groups and carriers | 96 | 7.3 |
| I.H Fatty acid biosynthesis | 17 | 11.8 |
|
| 12 | 8.3 |
| II.A Transcriptional regulation | 51 | 33.3 |
| II.B Signal transduction | 104 | 23.1 |
|
| ||
| III.A Synthesis and modification of macromolecules | 240 | 7.1 |
| III.A.2-3 Ribosome and ribosomal protein synthesis, modification and maturation | 63 | 4.8 |
| III.A.5 Aminoacyl tRNA synthetases and tRNA modification | 53 | 0 |
| III.A.7 DNA replication, restriction/modification, repair, and recombination | 69 | 13 |
| III.A.8 Protein translation and modification | 27 | 0 |
| III.A.9 RNA synthesis, and RNA modification, and DNA transcription | 18 | 5.6 |
| III.A.11 Phospholipids | 11 | 27.2 |
| III.B Degradation of macromolecules | 45 | 8.9 |
| III.C Cell envelope | ||
| III.C.1 Membrane proteins, lipoproteins, and porins | 93 | 16.1 |
| III.C.2 Surface polysaccharides, lipopolysaccharides, and antigens | 57 | 36.8 |
| III.C.3 Surface structures | 41 | 34.1 |
| III.C.4 Murein sacculus and peptidoglycan | 21 | 9.5 |
|
| ||
| IV.A Transport/binding proteins | 169 | 24.9 |
| IV.B Chaperones | 16 | 12.5 |
| IV.C Cell division | 14 | 7.1 |
| IV.D Chemotaxis and mobility | 46 | 21.7 |
| IV.E Protein and peptide secretion | 21 | 0 |
| IV.G Detoxification | 9 | 11.1 |
| IV.I Pathogenicity | 12 | 25 |
| IV.J DNA uptake/competence | 8 | 12.5 |
|
| ||
| V.A Phage-related functions and prophage | 54 | 96.3 |
| V.D Drug/analog sensitivity and antibiotic resistance | 28 | 17.9 |
| V.F Adaptations and atypical conditions | 9 | 11.1 |
|
| ||
| VI.A Miscellaneous/General function | 143 | 16.1 |
| VI.B Domain of unknown function (DUF) proteins | 61 | 9.8 |
| VI.C Conserved hypothetical proteins-no conserved domains | 303 | 19.9 |
| VI.D Hypothetical proteins | 262 | 39.7 |
Genes absent in at least 2 of 12 additional strains of A. butzleri.
Primarily sox and ure genes.