Literature DB >> 31699768

Complete Genome Sequences of the Campylobacter fetus subsp. venerealis, Campylobacter lari subsp. concheus, Campylobacter sputorum bv. sputorum, and Campylobacter volucris Type Strains.

William G Miller1, Emma Yee2.   

Abstract

Campylobacter spp. are recovered from a wide variety of sources, including birds, livestock, shellfish, and human clinical samples. We present here the complete genomic data for the type strains of Campylobacter fetus subsp. venerealis, Campylobacter lari subsp. concheus, Campylobacter sputorum bv. sputorum, and Campylobacter volucris.

Entities:  

Year:  2019        PMID: 31699768      PMCID: PMC6838626          DOI: 10.1128/MRA.01157-19

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

The genus Campylobacter currently contains 37 taxa comprising 31 validly described species, with five of these species further divided into 11 subspecies. Genomic data exist for the type strains of 34 Campylobacter taxa (e.g., see references 1–8). Such information would be valuable as baseline reference data for each species. Although genomic data exist for strains of the remaining three taxa (i.e., Campylobacter lari subsp. concheus, Campylobacter sputorum, and Campylobacter volucris [7, 9]), no genomic data, draft or complete, exist for their type strains. Additionally, while the type genome of Campylobacter fetus subsp. venerealis has been sequenced (10), a complete genome (one gapless contig) is recommended due to the presence of both mobile elements and a suite of S-layer genes with repeated motifs in Campylobacter fetus subsp. venerealis strains (11, 12). In this study, we report the closed genome sequences of Campylobacter lari subsp. concheus LMG 21009T, Campylobacter sputorum bv. sputorum LMG 7795T, Campylobacter volucris LMG 24380T, and Campylobacter fetus subsp. venerealis NCTC 10354T. The four type strains were obtained from the NCTC (C. fetus subsp. venerealis) or BCCM/LMG (C. lari subsp. concheus, C. sputorum bv. sputorum, and C. volucris) culture collections. All strains were grown, both initially and for one subculture, microaerobically, at 37°C for 48 h on brain heart infusion agar (Thermo Fisher Scientific, Waltham, MA) amended with 5% laked horse blood. Genomic DNAs (gDNAs) were prepared from loopfuls (∼5 μl) of cells using the Promega Wizard genomic DNA purification kit. For each strain, a single preparation of gDNA was used to construct the Illumina and PacBio libraries. Each Illumina library was prepared using the Illumina Nextera DNA Flex library prep kit with 20 ng of gDNA. Pooled libraries were sequenced on a MiSeq instrument at 8.0 pM with dual-index paired-end reads using the MiSeq reagent kit v2 (300 cycles). Each 20-kb PacBio library was prepared using 15 μg of gDNA, the PacBio SMRTbell template prep kit v1.0, and the manufacturer’s protocols. Single-molecule real-time (SMRT) sequencing was performed on an RS II sequencer. PacBio reads were assembled using the Hierarchical Genome Assembly Process (HGAP) v3.0 in SMRT Analysis software v2.3.0. Default parameters were used for all software unless otherwise specified. The sequencing metrics are presented in Table 1. SMRT sequencing resulted in a single chromosomal contig for each of the four strains; for two strains (NCTC 10354T and LMG 24380T), single plasmid contigs were also present. Almost all of the low-quality (Q < 40) bases in these contigs were at the contig ends. Therefore, contig circularization removed these low-quality bases; all contigs were circularized manually with Geneious Prime v2019.1.3 (Biomatters Ltd., Auckland, New Zealand). Base calling was further improved by mapping the quality-trimmed MiSeq reads onto the circularized PacBio contigs within Geneious Prime. Using the Geneious “find variations/SNPs” module with a default minimum variation of 0.3, variations in each genome were identified and corrected to the MiSeq consensus sequence. The final coverages ranged from 280 to 1,233×.
TABLE 1

Sequencing metrics and genomic data

FeatureaValue(s) forb:
C. fetus subsp. venerealis NCTC 10354TC. lari subsp. concheus LMG 21009TC. sputorum bv. sputorum LMG 7795TC. volucris LMG 24380T
SourceHeifer, vaginalShellfishHuman, oralBlack-headed gull
Sequencing metrics
    Illumina MiSeq platform
        No. of reads1,460,7381,699,5321,651,4581,702,578
        No. of bases216,732,966253,024,312243,920,520252,662,467
        Avg length (bases)148149148148
        Coverage (×)115170139166
    PacBio platform
        No. of reads188,065328,08445,750198,092
        No. of bases1,331,563,1781,582,521,489246,714,4481,189,853,205
        Avg length (bases)7,080.34,823.55,392.76,006.6
        Coverage (×)7061,063141780
Accession no.
    GenBank (chromosome)CP043435CP043426CP043427CP043428
    GenBank (plasmid)CP043436NANACP043429
    NCBI SRASRP219907SRP219934SRP219932SRP219757
Genomic data
    Chromosome
        Size (kbp)c 1,885.71,488.51,753.41,524.9
        G+C content (%)33.2929.7729.7128.57
        No. of CDSd 1,8081,4431,6621,482
            Assigned function (% CDS)915 (50.6)865 (59.9)914 (55.0)873 (58.9)
            General function annotation (% CDS)510 (28.2)345 (23.9)430 (25.9)357 (24.1)
            Domain/family annotation only (% CDS)106 (5.9)96 (6.7)99 (6.0)87 (5.9)
            Hypothetical (% CDS)277 (15.3)137 (9.5)219 (13.2)165 (11.1)
        No. of pseudogenes50185219
        No. of GC tracts ≥8 bp (no. hypervariable)33 (28)13 (13)17 (11)17 (17)
    Plasmid (size [bp])p3226 (27,915)NoNop9726 (7,145)
    Genomic island(s)/CRISPR
        No. of genetic islands5011
        No. of CDS in genetic islands165 [9]022 [1]48 [1]
        CRISPR/Cas lociType III0Type I-B0
    No. of insertion sequence elements6; 30[1]0
    Gene content/pathways
        Signal transduction
            No. of Che proteins6797
            No. of methyl-accepting chemotaxis proteins10 [5]; 011 [2]10 [5]13 [1]; 0
            No. of RRs13; 09159; 0
            No. of HKs9 [1]; 0612 [2]6; 0
            No. of RR/HK fusions1; 0000
            No. of diguanylate cyclases/phosphodiesterases9; 0272; 0
            No. of other signal transduction genes2; 023 [1]2; 0
        Motility
            Flagellin genesflaABflaABfla1-2flaAB
        Restriction/modification systems
            No. of type I systems (hsd)1; 011[1]; 0
            No. of type II systems1; 0233; 0
            No. of type III systems0000
        Transcription/translation
            No. of transcriptional regulatory proteins25 [2]; 022 [1]24 [1]18; 0
            Sigma factorsσ28, σ54, σ70σ28, σ54, σ70σ28, σ54, σ70σ28, σ54, σ70
            No. of tRNAs, ribosomal loci44, 346, 347, 346, 3
        Catalase gene (katA)YesYesNoYes
        Cytolethal distending toxin genes (cdtABC)Yes (3×)YesNoYes
        Histidine biosynthesis genese IIIIII
    Nuo clusterf IIIIIIII
    Tryptophan biosynthesis [trpABC(DG)EF]YesNoYesNo

RRs, response regulators; HKs, histidine kinases.

Numbers in brackets indicate pseudogenes or fragments. Values following a semicolon are plasmid borne. NA, not applicable.

Size listed in kilobase pairs due to length variation at the hypervariable GC tracts.

Numbers do not include pseudogenes. CDS, coding sequences.

I represents genes hisA, hisB, hisC, hisD, hisF, hisG(S), hisH, hisIE, hisJ, and hisZ; II represents genes hisA, hisBJ, hisC, hisD, hisF, hisG(L), hisH, and hisIE.

I represents nuoA-nuoB-nuoCD-nuoE-nuoF-nuoG-nuoH-nuoI-nuoJ-nuoK-nuoL-nuoM-nuoN; II represents genes nuoA-nuoB-nuoC-nuoD-ORF-ORF-nuoG-nuoH-nuoI-nuoJ-nuoK-nuoL-nuoM-nuoN; and III represents genes nuoA-nuoB-nuoC-nuoD-nuoE-nuoF-nuoG-nuoH-nuoI-nuoJ-nuoK-nuoL-nuoM-nuoN.

Sequencing metrics and genomic data RRs, response regulators; HKs, histidine kinases. Numbers in brackets indicate pseudogenes or fragments. Values following a semicolon are plasmid borne. NA, not applicable. Size listed in kilobase pairs due to length variation at the hypervariable GC tracts. Numbers do not include pseudogenes. CDS, coding sequences. I represents genes hisA, hisB, hisC, hisD, hisF, hisG(S), hisH, hisIE, hisJ, and hisZ; II represents genes hisA, hisBJ, hisC, hisD, hisF, hisG(L), hisH, and hisIE. I represents nuoA-nuoB-nuoCD-nuoE-nuoF-nuoG-nuoH-nuoI-nuoJ-nuoK-nuoL-nuoM-nuoN; II represents genes nuoA-nuoB-nuoC-nuoD-ORF-ORF-nuoG-nuoH-nuoI-nuoJ-nuoK-nuoL-nuoM-nuoN; and III represents genes nuoA-nuoB-nuoC-nuoD-nuoE-nuoF-nuoG-nuoH-nuoI-nuoJ-nuoK-nuoL-nuoM-nuoN. The genomic data are presented in Table 1. Protein-, rRNA-, and tRNA-encoding genes were identified using GeneMark, RNAmmer, and ARAGORN (13–15), respectively, and annotated as described previously (16). The genome sizes, G+C content values, and gene contents of these four strains are similar to those reported for other strains of the same taxa (7, 9, 12). With these genome sequences, genomic data are now available for type strains representing all current taxa in the genus Campylobacter.

Data availability.

The complete genome sequences of the four Campylobacter type strains have been deposited in GenBank, and all MiSeq and PacBio sequencing reads have been deposited in the NCBI Sequence Read Archive (SRA). The accession numbers are provided in Table 1.
  16 in total

1.  Complete genome sequence of type strain Campylobacter fetus subsp. venerealis NCTC 10354T.

Authors:  Ana Paula Reinato Stynen; Andrey Pereira Lage; Robert J Moore; Antonio Mauro Rezende; Vivian D'Afonseca da Silva de Resende; Patricia de Cássia Ruy; Nesley Daher; Daniela de Melo Resende; Sintia Silva de Almeida; Siomar de Castro Soares; Vinicius Augusto Carvalho de Abreu; Aryane Aparecida C Magalhães Rocha; Anderson Rodrigues dos Santos; Eudes Guilherme Vieira Barbosa; Danielle Fonseca Costa; Fernanda Alves Dorella; Anderson Miyoshi; Alex Ranieri Jerônimo de Lima; Frederico Davi da Silva Campos; Pablo Gomes de Sá; Thiago Souza Lopes; Ryan Mauricio Araujo Rodrigues; Adriana Ribeiro Carneiro; Thiago Leão; Louise Teixeira Cerdeira; Rommel Thiago Jucá Ramos; Artur Silva; Vasco Azevedo; Jerônimo C Ruiz
Journal:  J Bacteriol       Date:  2011-10       Impact factor: 3.490

2.  Inconsistency of phenotypic and genomic characteristics of Campylobacter fetus subspecies requires reevaluation of current diagnostics.

Authors:  Linda van der Graaf-van Bloois; William G Miller; Emma Yee; Martine Rijnsburger; Jaap A Wagenaar; Birgitta Duim
Journal:  J Clin Microbiol       Date:  2014-09-17       Impact factor: 5.948

3.  GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses.

Authors:  John Besemer; Mark Borodovsky
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

4.  Complete Genome Sequence of Campylobacter fetus subsp. testudinum Strain 03-427T.

Authors:  Maarten J Gilbert; William G Miller; Emma Yee; Martin J Blaser; Jaap A Wagenaar; Birgitta Duim
Journal:  Genome Announc       Date:  2013-12-12

5.  First Closed Genome Sequence of Campylobacter fetus subsp. venerealis bv. intermedius.

Authors:  Linda van der Graaf-van Bloois; William G Miller; Emma Yee; James L Bono; Martine Rijnsburger; Carlos Campero; Jaap A Wagenaar; Birgitta Duim
Journal:  Genome Announc       Date:  2014-02-06

6.  Comparative genomics of the Campylobacter lari group.

Authors:  William G Miller; Emma Yee; Mary H Chapman; Timothy P L Smith; James L Bono; Steven Huynh; Craig T Parker; Peter Vandamme; Khai Luong; Jonas Korlach
Journal:  Genome Biol Evol       Date:  2014-11-08       Impact factor: 3.416

7.  Complete Genome Sequence of the Campylobacter helveticus Type Strain ATCC 51209.

Authors:  William G Miller; Emma Yee; James L Bono
Journal:  Genome Announc       Date:  2017-05-25

8.  Comparative Genomics of All Three Campylobacter sputorum Biovars and a Novel Cattle-Associated C. sputorum Clade.

Authors:  William G Miller; Emma Yee; Mary H Chapman; James L Bono
Journal:  Genome Biol Evol       Date:  2017-06-01       Impact factor: 3.416

9.  Complete Genome Sequence of the Arcobacter molluscorum Type Strain LMG 25693.

Authors:  William G Miller; Emma Yee; James L Bono
Journal:  Microbiol Resour Announc       Date:  2018-10-25

10.  Complete Genome Sequence of Campylobacter gracilis ATCC 33236T.

Authors:  William G Miller; Emma Yee
Journal:  Genome Announc       Date:  2015-09-17
View more
  1 in total

1.  Complete Genome Sequencing of Four Arcobacter Species Reveals a Diverse Suite of Mobile Elements.

Authors:  William G Miller; Emma Yee; James L Bono
Journal:  Genome Biol Evol       Date:  2020-02-01       Impact factor: 3.416

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.