Literature DB >> 25197460

High quality draft genome sequence of Staphylococcus cohnii subsp. cohnii strain hu-01.

XinJun Hu1, Ang Li1, LongXian Lv1, Chunhui Yuan1, Lihua Guo1, Xiawei Jiang1, Haiyin Jiang1, GuiRong Qian1, BeiWen Zheng1, Jing Guo1, LanJuan Li1.   

Abstract

Staphylococcus cohnii subsp. cohnii belongs to the family Staphylococcaceae in the order Bacillales, class Bacilli and phylum Firmicutes. The increasing relevance of S. cohnii to human health prompted us to determine the genomic sequence of Staphylococcus cohnii subsp. cohnii strain hu-01, a multidrug-resistant isolate from a hospital in China. Here we describe the features of S. cohnii subsp. cohnii strain hu-01, together with the genome sequence and its annotation. This is the first genome sequence of the species Staphylococcus cohnii.

Entities:  

Keywords:  Hiseq2000; Staphylococcus cohnii subsp. cohnii; genome

Year:  2014        PMID: 25197460      PMCID: PMC4149010          DOI: 10.4056/sigs.5429581

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

belongs to the Coagulase-Negative Staphylococci group. It was described by Schleifer and Kloos (1975) and was named for Ferdinand Cohn, a German botanist and bacteriologist [1]. Recently, more cases of infection have been reported in the literature. This organism may be responsible for brain abscess, pneumonia, acute cholecystitis, endocarditis, bacteremia, urinary tract infection and septic arthritis [2]. is comprised of two subspecies that are defined on the basis of their phenotypic characteristics: and [3]. Gram-positive coccus, coagulase negative and catalase positive, that behaves like a commensal mucocutaneous bacterium [4]. It has more frequently been isolated in hospital than in non-hospital environments [2]. Here we report this draft genome of strain hu-01, the first genome of this species to be sequenced.

Classification and features

Strain hu-01 was isolated from a hospital environment in Zhejiang province, China, in October 2012. It is a Gram-positive, coccus-shaped bacterium that can grow on 5% sheep blood enriched Columbia agar (BioMérieux, Marcyl’Etoile, France) at 37°C. Growth occurs under either aerobic or anaerobic conditions. The optimum temperature for growth is 37 ºC, with a temperature range of 15-45 ºC (Table 1). Cell morphology, motility and sporulation were examined by using transmission electron (H-600, Hitachi) microscopy. Cells of strain hu-01 are coccoidal, 0.6 to 1.2 μm in diameter, occurring predominantly singly or in pairs (Figure 1 and Figure 2).
Table1

Classification and general features of strain hu-01 according to the MIGS recommendations [9].

MIGS ID    Property     Term   Evidence codea
     Domain Bacteria   TAS [20]
     Phylum Firmicutes   TAS [21-23]
     Class Bacilli   TAS [24,25]
    Current classification     Order Bacillales   TAS [26,27]
     Family Staphylococcaceae   TAS [24,28]
     Genus Staphylococcus   TAS [26,29-31]
     Species Staphylococcus cohnii subsp. cohnii   TAS [1,3]
     Strain hu-01   IDA
    Gram stain     Positive   IDA
    Cell shape     coccus   IDA
    Motility     Nonmotile   IDA
    Sporulation     Nonsporulating   IDA
    Temperature range     15-45°C   IDA
    Optimum temperature     37°C   IDA
MIGS-6.3    Salinity     Tolerates 10% NaCl   IDA
MIGS-22    Oxygen     Facultatively anaerobic   IDA
    Carbon source     D-mannitol, fructose, trehalose   IDA
    Energy source     fructose, trehalose   IDA
MIGS-6    Habitat     Hospital environment   IDA
MIGS-15    Biotic relationship     Free living   IDA
MIGS-14    Pathogenicity     Opportunistic pathogen   IDA
    Isolation     Hospital   IDA
MIGS-4    Geographic location     Hangzhou, China   IDA
MIGS-5    Sample collection time     October, 2012   IDA
MIGS-4.1    Latitude     30°16’N   IDA
MIGS-4.2    Longitude     120°12’E   IDA
MIGS-4.3    Depth     unknown   IDA
MIGS-4.4    Altitude     50 (meters)   IDA

aEvidence codes-IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32]. If the evidence code is IDA, then the property should have been directly observed, for the purpose of this specific publication, for a live isolate by one of the authors, or an expert or reputable institution mentioned in the acknowledgements.

Figure1

Gram staining of strain hu-01

Figure 2

Transmission electron micrograph of cells of strain hu-01. Bar: 0.5 µm

aEvidence codes-IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32]. If the evidence code is IDA, then the property should have been directly observed, for the purpose of this specific publication, for a live isolate by one of the authors, or an expert or reputable institution mentioned in the acknowledgements. Gram staining of strain hu-01 Transmission electron micrograph of cells of strain hu-01. Bar: 0.5 µm Comparative 16S rRNA gene sequence analysis by BLASTN [5,6] using the NCBI-NR/NT database revealed 94-99% sequence similarity to members of genus . Neighbor-Joining phylogenetic analysis based on Kimura 2-parameter model indicated the strain hu-01 is most closely related the strain (AB009936.1) (Figure 3).
Figure 3

Phylogenetic tree depicting the relationship between and other members of the genus . The strains and their corresponding Genbank accession numbers are shown following the organism name and indicated in parentheses. The phylogenetic tree uses 16S rRNA gene sequences aligned by the CLUSTALW [7], and phylogenetic inferences were made using Neighbor-joining method based on Kimura 2-parameter model within the MEGA5 software [8] and rooted with . Bootstrap consensus trees were inferred from 100 replicates, only bootstrap values > 50% were indicated.

Phylogenetic tree depicting the relationship between and other members of the genus . The strains and their corresponding Genbank accession numbers are shown following the organism name and indicated in parentheses. The phylogenetic tree uses 16S rRNA gene sequences aligned by the CLUSTALW [7], and phylogenetic inferences were made using Neighbor-joining method based on Kimura 2-parameter model within the MEGA5 software [8] and rooted with . Bootstrap consensus trees were inferred from 100 replicates, only bootstrap values > 50% were indicated. Biochemical features were tested by using two automated systems, the Vitek2 Compact (bioMérieux, Marcy l'Etoile, France) and Phoenix 100 ID/AST system (Becton Dickinson Company [BD], Sparks, Maryland, USA). Positive reactions were obtained for D-fructose, trehalose, D-gluconic acid and D-mannitol. Negative reactions were observed for glucose, D-trehalose, D-sucrose, maltose, urea, cellobiose, glucoside, D-tagatose and maltotriose. This strain was susceptible to gentamicin, ciprofloxacin, levofloxacin, moxifloxacin, quinupristin, linezolid, vancomycin, tetracycline, tigecycline, nitrofurantoin, rifampicin, trimethoprim and resistant to cefoxitin, benzylpenicillin, oxacillin, erythromycin, clindamycin.

Genome sequencing information

Genome project history

S. conhii subsp. cohnii strain hu-01 was selected for sequencing because of its increasing relevance to human health. The strain was isolated from a hospital environment in China. This whole genome shotgun project of S. conhii subsp. cohnii strain hu-01 was deposited at DDBJ/EMBL/GenBank under the accession AYOS00000000. Table 2 presents the project information and its association with MIGS version 2.0 compliance [9].
Table 2

Project information

MIGS ID   Property    Term
MIGS-31   Finishing quality    High-quality draft
MIGS-28   Libraries used    One pair-end 500 bp library
MIGS-29   Sequencing platforms    Illumina HiSeq 2000
MIGS-31.2   Fold coverage    150×(based on 500 bp library)
MIGS-30   Assemblers    Velvet 1.2.07
MIGS-32   Gene calling method    Glimmer 3.0
   Genbank ID    AYOS00000000
   Genbank Date of Release    Jan 06, 2014
   GOLD ID    Gi0062613
MIGS-13   Project relevance    Biotechnology, Pathway, Pathogenic

Growth conditions and DNA isolation

S. conhii subsp .cohnii strain hu-01 was grown aerobically on Columbia blood agar base, at 37°C for 24h. Genomic DNA was extracted using the DNeasy blood and tissue kit (Qiagen, Germany), according to the manufacturer’s recommended protocol. The quantity of DNA was measured by the NanoDrop Spectrophotometer and Cubit. Then 10μg of DNA was sent to the State Key Laboratory for Diagnosis and Treatment of Infectious Disease at Zhejiang University for sequencing on a Hiseq2000 (Illumina, CA) sequencer.

Genome sequencing and assembly

One DNA library was generated (500 bp insert size, with the Illumina adapter at both ends, detected by Agilent DNA analyzer 2100), then sequencing was performed by using an Illumina Hieseq 2000 genomic sequencer, with a 2×100 pair end sequencing strategy. A total of 1,103 M bp of sequence data was produced which was assessed for quality by the following criteria: 1) Reads linked to adapters at both end were considered as sequencing artifacts then removed. 2) Bases with a quality index lower than Q20 at both ends were trimmed. 3) Reads with ambiguous bases (N) were removed. 4) Single qualified reads were discarded (In this situation, one read is qualified but its mate is not). A total of 867.94 M clean filtered reads were assembled into scaffolds using the Velvet version 1.2.07 with parameters "-scaffolds no" [10], then we used a PAGIT flow [11] to prolong the initial contigs and correct sequencing errors. to arrive at a set of improved scaffolds.

Genome annotation

Predict genes were identified using Glimmer version 3.0 [12],tRNAscan-SE version 1.21 [13] was used to find tRNA genes, whereas ribosomal RNAs were found by using RNAmmer version 1.2 [14]. To annotate predicted genes, we used HMMER version 3.0 [15], with parameters 'hmmscan -E 0.01 -domainE 0.01' to align genes against Pfam version 27.0 [16] (only pfam-A was used) to find genes with conserved domains. The KAAS server [17] was used to assign translated amino acids (with genetic code table 11) into KEGG Orthology with SBH (single-directional best hit) method. Translated genes were aligned with the COG database using NCBI blastp (hits should have scores no less than 60, e-value is no more than 1e-6). To find genes with hypothetical or putative function, we aligned genes against NCBI nucleotide sequence database (nt database was downloaded at Sep 20, 2013) by using NCBI blastn, only if hits have an identity of no less than 0.95, coverage no less than 0.9, and the reference gene had an annotation of putative or hypothetical. To define genes with signal peptide, we use signalp version 4.1 [18] to identify genes with signal peptide with default parameters except " -t gram+ ". TMHMM2.0 [19] was used to identify genes with transmembrane helices.

Genome properties

The draft genome sequence of S. conhii subsp. cohnii strain hu-01 revealed a genome size of 5,761,489 bp and a G+C content of 34.85% (521 scaffolds with N50 is 39,926 bp). These scaffolds contain 5,820 coding sequences (CDSs), 61 tRNAs (excluding 6 Pseudo tRNAs) and incomplete rRNA operons (10 small subunit rRNA and 3 large subunit rRNAs). A total of 1,840 protein-coding genes were assigned as putative function or hypothetical proteins. 3,734 genes were categorized into COGs functional groups. The properties and the statistics of the genome are summarized in Table 3 and Table 4.
Table 3

Genome statistics of strain hu-01

Attribute     Value     % of totala
Genome size (bp)     5,761,489     --
DNA coding region (bp)     4,751,472     82.469
DNA G+C content (bp)     1,697,984     29.471
Total genes     5,833     --
RNA genes     13     0.221
Protein-coding genes     5,820     99.777
Genes with function prediction     1,840     31.544
Genes assigned to COGs     3,734     64.015
Genes assigned to Pfam domains     4,943     84.741
Genes with signal peptides     431     7.388
Genes with transmembrane helices     1,629     27.927

a) The total is based on either the size of the genome in base pairs or the total number of genes in the annotated genome.

Table 4

Number of genes associated with the general COG functional categories

Code     Valuea     %ageb    Description
J     230     3.95    Translation, ribosomal structure and biogenesis
K     452     7.77    Transcription
L     184     3.16    Replication, recombination and repair
B     3     0.05    Chromatin structure and dynamics
D     72     1.24    Cell cycle control, cell division, chromosome partitioning
V     187     3.21    Defense mechanisms
T     238     4.09    Signal transduction mechanisms
M     254     4.36    Cell wall/membrane/envelope biogenesis
N     70     1.20    Cell motility
Z     1     0.02    Cytoskeleton
W     1     0.02    Extracellular structures
U     57     0.98    Intracellular trafficking, secretion, and vesicular transport
O     147     2.53    Posttranslational modification, protein turnover, chaperones
C     292     5.02    Energy production and conversion
G     384     6.60    Carbohydrate transport and metabolism
E     640     11.0    Amino acid transport and metabolism
F     140     2.41    Nucleotide transport and metabolism
H     234     4.02    Coenzyme transport and metabolism
I     165     2.84    Lipid transport and metabolism
P     389     6.68    Inorganic ion transport and metabolism
Q     197     3.38    Secondary metabolites biosynthesis, transport and catabolism
R     841     14.45    General function prediction only
S     403     6.92    Function unknown
--c     483     8.30    Not archived in COGs
--d     1603     27.54    No hits

a) For some genes, qualified alignments can occur with several genes belonging to different COG categories. In such cases only the best match to a single COG category is considered. b) The total is based on the total number of protein coding genes(5,820) in the annotated genome. c) These genes have alignments with reference genes archived in COG, but these reference genes do not have COG categories. d) Genes without a qualified hit to a reference genes.

a) The total is based on either the size of the genome in base pairs or the total number of genes in the annotated genome. a) For some genes, qualified alignments can occur with several genes belonging to different COG categories. In such cases only the best match to a single COG category is considered. b) The total is based on the total number of protein coding genes(5,820) in the annotated genome. c) These genes have alignments with reference genes archived in COG, but these reference genes do not have COG categories. d) Genes without a qualified hit to a reference genes.

Conclusion

are part of the normal flora of human skin and mucous membranes which, in particular conditions, may become opportunistic pathogens [4]. The genome sequence of strain hu-01 will provide the basis to elucidate the molecular principles of host colonization and insight into the genetic background of this organism’s pathogenesis.
  21 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  SignalP 4.0: discriminating signal peptides from transmembrane regions.

Authors:  Thomas Nordahl Petersen; Søren Brunak; Gunnar von Heijne; Henrik Nielsen
Journal:  Nat Methods       Date:  2011-09-29       Impact factor: 28.547

3.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

4.  Multiple sequence alignment using ClustalW and ClustalX.

Authors:  Julie D Thompson; Toby J Gibson; Des G Higgins
Journal:  Curr Protoc Bioinformatics       Date:  2002-08

5.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

6.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

Review 7.  Bacteremia due to Staphylococcus cohnii ssp. urealyticus caused by infected pressure ulcer: case report and review of the literature.

Authors:  Jonathan Soldera; Wagner Luis Nedel; Paulo Ricardo Cerveira Cardoso; Pedro Alves d'Azevedo
Journal:  Sao Paulo Med J       Date:  2013       Impact factor: 1.044

8.  The minimum information about a genome sequence (MIGS) specification.

Authors:  Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal:  Nat Biotechnol       Date:  2008-05       Impact factor: 54.908

9.  The Pfam protein families database.

Authors:  Marco Punta; Penny C Coggill; Ruth Y Eberhardt; Jaina Mistry; John Tate; Chris Boursnell; Ningze Pang; Kristoffer Forslund; Goran Ceric; Jody Clements; Andreas Heger; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2011-11-29       Impact factor: 16.971

10.  NCBI BLAST: a better web interface.

Authors:  Mark Johnson; Irena Zaretskaya; Yan Raytselis; Yuri Merezhuk; Scott McGinnis; Thomas L Madden
Journal:  Nucleic Acids Res       Date:  2008-04-24       Impact factor: 16.971

View more
  4 in total

1.  Draft Genome Sequence of Staphylococcus gallinarum DSM 20610T, Originally Isolated from the Skin of a Chicken.

Authors:  Ding Shi; Daiqiong Fang; Xinjun Hu; Ang Li; Longxian Lv; Jing Guo; Yanfei Chen; Wenrui Wu; Feifei Guo; Lanjuan Li
Journal:  Genome Announc       Date:  2015-06-04

2.  Genome sequencing, assembly, annotation and analysis of Staphylococcus xylosus strain DMB3-Bh1 reveals genes responsible for pathogenicity.

Authors:  Gurwinder Kaur; Amit Arora; Shanmugam Mayilraj; Javed N Agrewala; Sathyaseelan Sathyabama; Nida Mubin; Sheenam Verma
Journal:  Gut Pathog       Date:  2016-11-08       Impact factor: 4.181

3.  Phenotypic and molecular assessment of antimicrobial resistance profile of airborne Staphylococcus spp. isolated from flats in Kraków.

Authors:  Anna Lenart-Boroń; Katarzyna Wolny-Koładka; Katarzyna Juraszek; Andrzej Kasprowicz
Journal:  Aerobiologia (Bologna)       Date:  2017-04-10       Impact factor: 2.410

4.  Draft genome sequences of two opportunistic pathogenic strains of Staphylococcus cohnii isolated from human patients.

Authors:  Soraya Mendoza-Olazarán; José F Garcia-Mazcorro; Rayo Morfín-Otero; Licet Villarreal-Treviño; Adrián Camacho-Ortiz; Eduardo Rodríguez-Noriega; Paola Bocanegra-Ibarias; Héctor J Maldonado-Garza; Scot E Dowd; Elvira Garza-González
Journal:  Stand Genomic Sci       Date:  2017-08-31
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.