Literature DB >> 30533764

Complete Genome Sequence of the Arcobacter suis Type Strain LMG 26152.

William G Miller1, Emma Yee1, James L Bono2.   

Abstract

Arcobacter species are prevalent in pigs, and strains have been isolated from pig feces and pork meat; some Arcobacter strains may be porcine abortifacients. Arcobacter suis was recovered from pork meat in Spain. This study describes the whole-genome sequence of the A. suis type strain LMG 26152 (=F41T =CECT 7833T).

Entities:  

Year:  2018        PMID: 30533764      PMCID: PMC6256499          DOI: 10.1128/MRA.01307-18

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Arcobacter species are often isolated from swine, pig manure, and pork meat (1–6), and some species or strains are possible porcine abortifacients (7, 8). Arcobacter suis, represented by a single strain (the type strain), was originally isolated from retail market pork (9). Subsequent to the initial description, A. suis was also recovered from water buffalo milk (10), operational taxonomic units similar to A. suis were identified in samples from a spinach-processing plant (11), and A. suis was potentially identified in sewage (12). In this study, we report the first closed genome sequence of the A. suis type strain LMG 26152 (=F41T =CECT 7833T), isolated in 2008 from pork meat in Catalonia, Spain (9). Arcobacter suis strain LMG 26152T was grown at 30°C for 48 h aerobically on anaerobe basal agar (Oxoid) plus 5% horse blood. A loop of cells (∼5 μl) was taken from the plate, and genomic DNA was isolated with the Wizard genomic DNA purification kit (Promega, Madison, WI). Shotgun and paired-end Roche 454 libraries were constructed following the manufacturer’s protocols and with standard methods. PacBio SMRTbell libraries were prepared from 10 μg of genomic DNA with the standard 20-kb PacBio protocol (13). Shotgun and paired-end 454 libraries were sequenced on a GS-FLX+ instrument with Titanium chemistry and standard protocols. The resulting reads were assembled into 79 total contigs and a chromosomal scaffold of 33 contigs with Newbler v. 2.6 (Roche); Roche standard flowgram format (SFF) files were not processed before assembly, and 454 reads were quality controlled within the Newbler assembler. Low-quality contigs were deleted, and the remaining 25 contigs were positioned at one or more locations within the scaffold gaps with the Perl script contig_extender3 (14). PacBio sequencing was performed as described previously (14); the single chromosomal contig was assembled along with the 454 contigs with SeqMan v. 8.0 (DNASTAR, Madison, WI). Chromosomal assembly was also validated with an optical restriction map (restriction enzyme XbaI; OpGen, Gaithersburg, MD). Illumina HiSeq reads were obtained from SeqWright (Houston, TX) and assembled with Newbler v. 2.6. HiSeq reads were not processed before assembly, and any quality trimming of the reads was performed within Newbler. The HiSeq reads and contigs were used to verify and error correct the 454 and PacBio base calls, as described previously (15). The final coverage across the genome was 917×. Sequencing metrics and genomic data for A. suis strain LMG 26152T are presented in Table 1. A. suis strain LMG 26152T has a circular genome of 2,639,269 bp, with an average G+C content of 27.4%. Putative coding sequences (CDSs), tRNA/transfer-messenger RNA (tmRNA) genes, and rRNA loci were identified with GeneMark, ARAGORN, and RNAmmer, respectively (16–18). A preliminary GenBank-formatted file was created with the genome sequence and the GeneMark-derived CDS coordinates. Identification of putative pseudogenes and genes missed in the original GeneMark analysis and manual curation of each putative CDS were performed with the GenBank-formatted file and Artemis v. 16 (19). Annotation was accomplished with BLASTP to compare the proteome of strain LMG 26152T to proteins in both the NCBI nonredundant (nr) database and a custom protein database constructed from the proteomes of all current completed Arcobacter and Campylobacter genomes. Annotation was further refined with an analysis of Pfam motifs (20).
TABLE 1

Sequencing metrics and genomic data for Arcobacter suis strain LMG 26152T

FeatureValue(s)a
Sequencing metrics
    454 (shotgun) platform
        No. of reads185,325
        No. of bases103,818,564
        Average length (bases)560.2
        Coverage (×)39.3
    454 (paired-end) platform
        No. of reads76,546
        No. of bases24,718,237
        Average length (bases)322.9
        Coverage (×)9.4
    Illumina HiSeq 2000 platform
        No. of reads17,658,830
        No. of bases1,765,883,000
        Average length (bases)100
        Coverage (×)669.1
    PacBio platform
        No. of reads174,492
        No. of bases524,997,072
        Average length (bases)3,008.7b
        Coverage (×)198.9
    Newbler metricsc
        N50ContigSize (454) (bases)142,251
        Q40PlusBases (454) (%)99.84
        N50ContigSize (HiSeq pool 1) (bases)142,383
        Q40PlusBases (HiSeq pool 1) (%)99.99
        N50ContigSize (HiSeq pool 2) (bases)142,376
        Q40PlusBases (HiSeq pool 2) (%)99.99
        N50ContigSize (HiSeq pool 3) (bases)142,376
        Q40PlusBases (HiSeq pool 3) (%)99.99
Genomic data
    Chromosome
        Size (bp)2,639,269
        G+C content (%)27.39
        No. of CDSsd 2,523
            Assigned function (% CDSs)976 (38.7)
            General function annotation (% CDSs)923 (36.6)
            Domain/family annotation only (% CDSs)173 (6.9)
            Hypothetical (% CDSs)451 (17.9)
        Pseudogenes34
    Genomic islands/CRISPR
        No. of genetic islands9
        No. of CDSs in genetic islands157, [7]
        No. of CRISPR-Cas loci0
    Gene content/pathways
        IS elements, mobile elements, or tranposases3, [2] (IS3)
        Signal transduction
            Che proteinscheABCDRVW(Y)2
            No. of methyl-accepting chemotaxis proteins25
            No. of response regulators42, [1]
            No. of histidine kinases53, [1]
            No. of response regulator/histidine kinase fusions6
            No. of diguanylate cyclases17
            No. of diguanylate phosphodiesterases (HD-GYP, EAL)7, 22
            No. of diguanylate cyclase/phosphodiesterases13
            No. of other8, [1]
        Motility
            Flagellin genesfla
        Restriction/modification
            No. of type I systems (hsd)0
            No. of type II systems1
            No. of type III systems1
        Transcription/translation
            No. of transcriptional regulatory proteins54
            Non-ECFe σ factorsσ54, σ70
            No. of ECF σ factors3
            No. of tRNAs59
            No. of ribosomal loci5
        CO dehydrogenase (coxSLF)Yes
        Ethanolamine utilization (eutBCH)Yes
        Nitrogen fixation (nif)Yes
        OsmoprotectionbetA
        Pyruvate → acetyl-CoAf
            Pyruvate dehydrogenase (E1/E2/E3)Yes
            Pyruvate:ferredoxin oxidoreductaseporABDG
        UreaseureAB
        Vitamin B12 biosynthesisYes

Numbers in square brackets indicate pseudogenes or fragments.

Maximum length, 24,428 bases.

Features and values taken from largeContigMetrics within 454NewblerMetrics.txt for each assembly. Large contigs were defined as those ≥500 bases. Due to the large number of HiSeq reads, the total reads were split into 3 pools and assembled independently.

Numbers do not include pseudogenes. CDSs, coding sequences.

ECF, extracytoplasmic function.

CoA, coenzyme A.

Sequencing metrics and genomic data for Arcobacter suis strain LMG 26152T Numbers in square brackets indicate pseudogenes or fragments. Maximum length, 24,428 bases. Features and values taken from largeContigMetrics within 454NewblerMetrics.txt for each assembly. Large contigs were defined as those ≥500 bases. Due to the large number of HiSeq reads, the total reads were split into 3 pools and assembled independently. Numbers do not include pseudogenes. CDSs, coding sequences. ECF, extracytoplasmic function. CoA, coenzyme A. The LMG 26152T genome is predicted to encode 2,523 putative protein-coding genes, 34 pseudogenes, 5 rRNA operons, and 59 tRNA-encoding genes, and it contains 9 genomic islands ranging from 5.5 to 34.3 kb in size. The LMG 26152T genome contains a nif/rpoN nitrogen fixation gene cluster (21; GenBank accession number CP001999) and the same set of adenosylcobalamin biosynthesis genes identified previously in A. bivalviorum (22). The A. suis genome also encodes the B12-dependent EutBC ethanolamine ammonia-lyase and the EutH ethanolamine permease. The acetaldehyde produced by EutBC would presumably be converted to ethanol and acetyl-coenzyme A by a putative AdhE dehydrogenase (Asuis2568). Two large genes encoding T1SS repeat domain-containing proteins were identified in the A. suis genome: asuis0242 (9,252 bp) and asuis0243 (16,326 bp). Similarly to A. mytili (23), these genes contain tandemly repeated internal motifs (5 × 639 bp, asuis0242; 39 × 647 bp, asuis0243). A 34,282-bp gene with no internal repeats encoding a putative repeats-in-toxin (RTX) family calcium-binding protein was also identified.

Data availability.

The complete genome sequence of A. suis strain LMG 26152T has been deposited in GenBank under the accession number CP032100. 454, HiSeq, and PacBio sequencing reads have been deposited in the NCBI Sequence Read Archive (SRA) under the accession number SRP155204.
  17 in total

1.  Prevalence and diversity of Arcobacter spp. isolated from the internal organs of spontaneous porcine abortions in Denmark.

Authors:  Stephen L W On; Tim K Jensen; Vivi Bille-Hansen; Sven E Jorsal; Peter Vandamme
Journal:  Vet Microbiol       Date:  2002-03-01       Impact factor: 3.293

2.  ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences.

Authors:  Dean Laslett; Bjorn Canback
Journal:  Nucleic Acids Res       Date:  2004-01-02       Impact factor: 16.971

3.  Characterization of Arcobacter suis isolated from water buffalo (Bubalus bubalis) milk.

Authors:  Federica Giacometti; Nuria Salas-Massó; Andrea Serraino; Maria José Figueras
Journal:  Food Microbiol       Date:  2015-06-17       Impact factor: 5.516

4.  Arcobacter trophiarum sp. nov., isolated from fattening pigs.

Authors:  Sarah De Smet; Peter Vandamme; Lieven De Zutter; Stephen L W On; Laid Douidah; Kurt Houf
Journal:  Int J Syst Evol Microbiol       Date:  2010-03-19       Impact factor: 2.747

5.  Arcobacter thereius sp. nov., isolated from pigs and ducks.

Authors:  Kurt Houf; Stephen L W On; Tom Coenye; Lies Debruyne; Sarah De Smet; Peter Vandamme
Journal:  Int J Syst Evol Microbiol       Date:  2009-07-21       Impact factor: 2.747

6.  Occurrence and genetic diversity of Arcobacter spp. in a spinach-processing plant and evaluation of two Arcobacter-specific quantitative PCR assays.

Authors:  Lena Hausdorf; Maria Neumann; Ingo Bergmann; Kerstin Sobiella; Kerstin Mundt; Antje Fröhling; Oliver Schlüter; Michael Klocke
Journal:  Syst Appl Microbiol       Date:  2013-04-03       Impact factor: 4.022

7.  The Pfam protein families database.

Authors:  Marco Punta; Penny C Coggill; Ruth Y Eberhardt; Jaina Mistry; John Tate; Chris Boursnell; Ningze Pang; Kristoffer Forslund; Goran Ceric; Jody Clements; Andreas Heger; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2011-11-29       Impact factor: 16.971

8.  First multi-locus sequence typing scheme for Arcobacter spp.

Authors:  William G Miller; Irene V Wesley; Stephen L W On; Kurt Houf; Francis Mégraud; Guilin Wang; Emma Yee; Apichai Srijan; Carl J Mason
Journal:  BMC Microbiol       Date:  2009-09-14       Impact factor: 3.605

9.  RNAmmer: consistent and rapid annotation of ribosomal RNA genes.

Authors:  Karin Lagesen; Peter Hallin; Einar Andreas Rødland; Hans-Henrik Staerfeldt; Torbjørn Rognes; David W Ussery
Journal:  Nucleic Acids Res       Date:  2007-04-22       Impact factor: 16.971

10.  Comparative Genomic Analysis Identifies a Campylobacter Clade Deficient in Selenium Metabolism.

Authors:  William G Miller; Emma Yee; Bruno S Lopes; Mary H Chapman; Steven Huynh; James L Bono; Craig T Parker; Norval J C Strachan; Ken J Forbes
Journal:  Genome Biol Evol       Date:  2017-07-01       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.