Literature DB >> 30533823

Complete Genome Sequences of the Arcobacter cryaerophilus Strains ATCC 43158T and ATCC 49615.

William G Miller1, Emma Yee1, James L Bono2.   

Abstract

Arcobacter cryaerophilus was originally recovered from aborted bovine and porcine fetuses, but it has been subsequently isolated from meat, water, and human clinical samples. This study describes the complete whole-genome sequences of two A. cryaerophilus strains, ATCC 43158T (=A 169/BT =LMG 24291T) and ATCC 49615 (=CDC D2610 =LMG 10829).

Entities:  

Year:  2018        PMID: 30533823      PMCID: PMC6256622          DOI: 10.1128/MRA.01463-18

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Arcobacter cryaerophilus (formerly Campylobacter cryaerophila [1]) was originally recovered from aborted bovine and porcine fetuses (1–3), from the milk of cows with mastitis (4), and from bovine and porcine feces (1). Subsequent studies reported the recovery of A. cryaerophilus isolates from meat (5, 6), water (7), and human clinical samples (8, 9). Additionally, based on the results of whole-cell protein and rRNA restriction fragment length polymorphism analyses (10, 11), A. cryaerophilus strains have been divided into two subgroups, 1A and 1B. In this study, we report the first closed genome sequences of two A. cryaerophilus strains, the type strain ATCC 43158 (subgroup 1A; =A 169/BT [1] =LMG 24291T), isolated in the United Kingdom from the brain of an aborted bovine fetus, and strain ATCC 49615 (subgroup 1B; =CDC D2610 [12] =LMG 10829), isolated in Illinois from human blood. A draft genome sequence (91 scaffolds) was deposited previously for A. cryaerophilus strain ATCC 43158T (GenBank accession number NXGK01000000). Additionally, draft genome sequences for 12 other A. cryaerophilus strains have been deposited previously in GenBank (https://www.ncbi.nlm.nih.gov/genome/genomes/11530). Both A. cryaerophilus strains were grown aerobically at 30°C for 48 h on anaerobe basal agar (Oxoid) amended with 5% horse blood. Cells were separately removed from each plate using sterile 5-μl inoculating loops, and genomic DNA was prepared from each loopful of cells using the Promega Wizard genomic DNA purification kit (Madison, WI). Roche 454 shotgun and paired-end libraries were constructed for both strains, using the manufacturer’s standard protocols, and sequenced on a GS-FLX+ instrument using the Titanium chemistry. For each strain, the shotgun and paired-end reads were assembled together, using Newbler v. 2.6 (Roche) and default parameters; 454 reads were quality controlled within Newbler. Sequencing metrics for both strains are presented in Table 1. Two scaffolds were obtained for strain ATCC 43158T, one chromosomal scaffold of 20 contigs and a megaplasmid scaffold of 4 contigs. One chromosomal scaffold of 16 contigs was obtained for strain ATCC 49615. Each scaffold was closed by using the custom Perl script contig_extender3 (13) to place within the scaffold gaps unique, nonscaffolded contigs and/or contigs representing sequences present at two or more locations. The two chromosomes and the megaplasmid were assembled manually within SeqMan Pro v. 8.0 (DNASTAR, Madison, WI), using the contig order defined above and contig-spanning 454 reads. Assembly of both genomes was verified using an optical restriction map (restriction enzyme XbaI; OpGen, Gaithersburg, MD). Illumina HiSeq reads were obtained from SeqWright (Houston, TX) and assembled and quality controlled within Newbler v. 2.6, as above. The Illumina contigs were assembled automatically (unique contigs) or positioned manually (repeated contigs) within the 454 SeqMan assembly. The Illumina contigs and reads were used to verify the 454 base calls, as described previously (14). A repeat region within the type strain chromosome required long reads for validation. Therefore, a 20-kb PacBio library was constructed, using standard methods (15), and sequenced on an RS II instrument, using standard protocols, as described previously (14). PacBio reads were assembled with RS_HGAP_Assembly v. 3 (Pacific Biosciences) and default settings, which yielded single chromosomal and megaplasmid contigs. The PacBio chromosomal contig was manually placed within the SeqMan assembly to confirm the 454 contig order across the repeat region.
TABLE 1

Sequencing metrics and genomic data for Arcobacter cryaerophilus strains ATCC 43158T and ATCC 49615

Featureb Value(s)a
ATCC 43158TATCC 49615
Sequencing metrics
    454 (shotgun) platform
        No. of reads215,12987,759
        No. of bases90,272,11636,134,765
        Avg length (bases)419.6411.7
        Coverage (×)45.017.6
    454 (paired-end) platform
        No. of reads118,785161,571
        No. of bases38,763,67549,505,772
        Avg length (bases)326.3306.4
        Coverage (×)19.324.1
    llumina HiSeq 2000 platform
        No. of reads18,366,93218,769,966
        No. of bases1,836,693,2001,895,766,566
        Avg length (bases)100101
        Coverage (×)915.2922.1
    PacBio platform
        No. of reads117,659NA
        No. of bases437,690,629NA
        Avg length (bases)3,720cNA
        Coverage (×)218.1NA
    Newbler metricsd
        Total no. of contigs4532
        N50ContigSize (454)195,432346,474
        Q40PlusBases (454) (%)99.9099.87
        N50ContigSize (HiSeq pool 1)195,198301,090
        Q40PlusBases (HiSeq pool 1) (%)99.9899.96
        N50ContigSize (HiSeq pool 2)195,524165,874
        Q40PlusBases (HiSeq pool 2) (%)99.9699.96
        N50ContigSize (HiSeq pool 3)195,196301,081
        Q40PlusBases (HiSeq pool 3) (%)99.9799.99
Genomic data
    Size (bp)2,006,909; 101,4312,055,914
    G+C content (%)27.46; 24.6327.51
    No. of CDSe1,951; 941,994
        Assigned function (% CDS)847 (43.4); 3 (3.2)857 (43.0)
        General function annotation (% CDS)675 (34.6); 34 (36.2)694 (34.8)
        Domain/family annotation only (% CDS)131 (6.7); 4 (4.3)136 (6.8)
        Hypothetical (% CDS)298 (15.3); 53 (56.4)307 (15.4)
    Pseudogenes23; 68
    Genomic islands/CRISPR
        No. of genetic islands8; 05
        No. of CDS in genetic islands108 [2]; 085 [2]
        No. of CRISPR/Cas loci00
    Gene content/pathways
        IS elements, mobile elements, or transposases2; 12
        Signal transduction
            No. of Che proteins9; 09
            No. of methyl-accepting chemotaxis proteins20; 023
            No. of response regulators23; 225
            No. of histidine kinases25; 127
            No. of response regulator/histidine kinase fusions03
            No. of diguanylate cyclases14; 015
            No. of diguanylate phosphodiesterases (HD-GYP, EAL)4, 1; 05, 1
            No. of diguanylate cyclase/phosphodiesterases9; 09
            No. of other10; 110
        Motility
            Flagellin genesflaABflaAB
        Restriction/modification
            No. of type I systems (hsd)[2]; 02
            No. of type II systems1; 03
            No. of type III systems02
        Transcription/translation
            No. of transcriptional regulatory proteins32; 432
            Non-ECF σ factorsσ70σ70
            No. of ECF σ factors00
            No. of tRNAs49; 051
            No. of ribosomal loci5; 05
        CO dehydrogenase (coxSLF)NoNo
        Ethanolamine utilization (eutBCH)NoNo
        Nitrogen fixation (nif)NoNo
        OsmoprotectionectABCNo
        Pyruvate → acetyl-CoA
            Pyruvate dehydrogenase (E1/E2/E3)YesYes
            Pyruvate:ferredoxin oxidoreductaseNoNo
        UreaseNoNo
        Vitamin B12 biosynthesisNoNo

Numbers in square brackets indicate pseudogenes or fragments. Strain ATCC 43158T values before a semicolon are for the chromosome, while values after the semicolon are for the pACRY43158 plasmid. NA, not applicable.

CDS, coding sequences; ECF, extracytoplasmic function; acetyl-CoA, acetyl coenzyme A.

Maximum length, 25,867 bases.

Features and values taken from largeContigMetrics within 454NewblerMetrics.txt for each assembly. Large contigs were defined as having ≥500 bases. Due to the large number of HiSeq reads, the total reads were split into three pools and assembled independently. N50ContigSize value is in number of bases.

Numbers do not include pseudogenes.

Sequencing metrics and genomic data for Arcobacter cryaerophilus strains ATCC 43158T and ATCC 49615 Numbers in square brackets indicate pseudogenes or fragments. Strain ATCC 43158T values before a semicolon are for the chromosome, while values after the semicolon are for the pACRY43158 plasmid. NA, not applicable. CDS, coding sequences; ECF, extracytoplasmic function; acetyl-CoA, acetyl coenzyme A. Maximum length, 25,867 bases. Features and values taken from largeContigMetrics within 454NewblerMetrics.txt for each assembly. Large contigs were defined as having ≥500 bases. Due to the large number of HiSeq reads, the total reads were split into three pools and assembled independently. N50ContigSize value is in number of bases. Numbers do not include pseudogenes. Genomic data for the two A. cryaerophilus strains are presented in Table 1. The average genome size for the two strains is approximately 2 Mbp, with an average G+C content of 27.5%. Protein-, rRNA-, and tRNA-encoding genes were identified using GeneMark, RNAmmer, and ARAGORN (16–18), respectively, and annotated as described previously (14). The ATCC 43158T genome also contains the 101,431-bp megaplasmid pACRY43158.

Data availability.

The complete genome sequence of A. cryaerophilus strain ATCC 43158T has been deposited in GenBank under the accession numbers CP032823 (chromosome) and CP032824 (pACRY43158), and the complete genome sequence of A. cryaerophilus strain ATCC 49615 has been deposited in GenBank under the accession number CP032825. All 454, HiSeq, and PacBio sequencing reads for strain ATCC 43158T and 454 and HiSeq reads for strain ATCC 49615 have been deposited in the NCBI Sequence Read Archive (SRA) under the accession numbers SRP164716 and SRP164722, respectively.
  14 in total

1.  Isolation of spirillum-like organisms from pig and bovine fetuses.

Authors:  R Higgins; R Degre
Journal:  Vet Rec       Date:  1979-03-24       Impact factor: 2.695

2.  ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences.

Authors:  Dean Laslett; Bjorn Canback
Journal:  Nucleic Acids Res       Date:  2004-01-02       Impact factor: 16.971

3.  Restriction fragment length polymorphisms in the ribosomal genes for species identification and subtyping of aerotolerant Campylobacter species.

Authors:  J A Kiehlbauch; B D Plikaytis; B Swaminathan; D N Cameron; I K Wachsmuth
Journal:  J Clin Microbiol       Date:  1991-08       Impact factor: 5.948

4.  Campylobacter butzleri sp. nov. isolated from humans and animals with diarrheal illness.

Authors:  J A Kiehlbauch; D J Brenner; M A Nicholson; C N Baker; C M Patton; A G Steigerwalt; I K Wachsmuth
Journal:  J Clin Microbiol       Date:  1991-02       Impact factor: 5.948

5.  Mastitis in dairy cows associated with an aerotolerant campylobacter.

Authors:  E F Logan; S D Neill; D P Mackie
Journal:  Vet Rec       Date:  1982-03-06       Impact factor: 2.695

6.  Presence of Arcobacter spp. in environmental waters correlates with high levels of fecal pollution.

Authors:  Luis Collado; Isabel Inza; Josep Guarro; Maria Jose Figueras
Journal:  Environ Microbiol       Date:  2008-01-21       Impact factor: 5.491

7.  Characterization of the Arcobacter contamination on Belgian pork carcasses and raw retail pork.

Authors:  Ellen Van Driessche; Kurt Houf
Journal:  Int J Food Microbiol       Date:  2007-05-31       Impact factor: 5.277

8.  Polyphasic taxonomic study of the emended genus Arcobacter with Arcobacter butzleri comb. nov. and Arcobacter skirrowii sp. nov., an aerotolerant bacterium isolated from veterinary specimens.

Authors:  P Vandamme; M Vancanneyt; B Pot; L Mels; B Hoste; D Dewettinck; L Vlaes; C van den Borre; R Higgins; J Hommez
Journal:  Int J Syst Bacteriol       Date:  1992-07

9.  RNAmmer: consistent and rapid annotation of ribosomal RNA genes.

Authors:  Karin Lagesen; Peter Hallin; Einar Andreas Rødland; Hans-Henrik Staerfeldt; Torbjørn Rognes; David W Ussery
Journal:  Nucleic Acids Res       Date:  2007-04-22       Impact factor: 16.971

10.  Genetic Diversity and Incidence of Virulence-Associated Genes of Arcobacter butzleri and Arcobacter cryaerophilus Isolates from Pork, Beef, and Chicken Meat in Poland.

Authors:  Iwona Zacharow; Jarosław Bystroń; Ewa Wałecka-Zacharska; Magdalena Podkowik; Jacek Bania
Journal:  Biomed Res Int       Date:  2015-10-11       Impact factor: 3.411

View more
  1 in total

1.  Complete Genome Sequencing of Four Arcobacter Species Reveals a Diverse Suite of Mobile Elements.

Authors:  William G Miller; Emma Yee; James L Bono
Journal:  Genome Biol Evol       Date:  2020-02-01       Impact factor: 3.416

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.