Literature DB >> 25197487

Non-contiguous finished genome sequence of Staphylococcus capitis CR01 (pulsetype NRCS-A).

H Lemriss1, S Lemriss2, M Butin3, A Ibrahimi4, S El Kabbaj2, Jp Rasigade5, F Laurent6.   

Abstract

Staphylococcus capitis is a coagulase-negative staphylococcus (CoNS) commonly found in the human microflora. Recently, a clonal population of Staphylococcus capitis (denominated NRCS-A) was found to be a major cause of late-onset sepsis (LOS) in several neonatal intensive care units in France. Here, we report the complete genome sequence and annotation of the prototype Staphylococcus capitis NCRS-A strain CR01. The 2,504,472 bp long genome (1 chromosome and no plasmids) exhibits a G+C content of 32.81%, and contains 2,468 protein-coding and 59 tRNA genes and 4 rRNA genes.

Entities:  

Keywords:  Staphylococcus capitis (NCRS-A); draft-genome; late-onset sepsis; methicillin resistance

Year:  2014        PMID: 25197487      PMCID: PMC4149024          DOI: 10.4056/sigs.5491045

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

A frequent cause of low-weight newborns mortality and morbidity in Neonatal Intensive Care Units (NICUs) are late-onset sepsis (LOS), that are defined as sepsis occurring after 3 days of age. The most frequently encountered pathogens are coagulase-negative staphylococci (CoNS) and within those has been shown to be the most prevalent [1,2]. However, a few studies have reported the emergence of as a main CoNS- and LOS- causative pathogen in NICU settings [2-4]. A study in French NICUs [2] has demonstrated the spread of a single clonal population of methicillin-resistant (pulsotype NRCS-A) associated to reduced susceptibility to vancomycin, the first line of antibiotics used in cases of LOS. Moreover, this clone has also been recently identified in NICUs in Belgium, United Kingdom and Australia, which suggests a worldwide distribution. In contrast, in adult bacteremia, are rarely found and when detected, it presents a bigger diversity in terms of genotypes as well as antimicrobial susceptibility profiles than neonates bacteremia. In order to elucidate the molecular mechanisms behind the wide spreading of the NRCS-A clone in NICUs throughout the world, we sequenced a prototype strain (CR01).

Classification and information

A strain belonging to the clonal population of NCRS-A pulsetype (Table 1) was isolated from the blood culture of a preterm infant with LOS, hospitalized in the NICU of the Northern Hospital Group Center (Hospices Civils de Lyon, Lyon, France) and suffering of LOS.
Table 1

Classification and general features of strain CR01, pulsetype-NRCS-A according the MIGS recommendation [5].

MIGS IDProperty   Term   Evidence codea
Current classification   Domain Bacteria   Phylum Firmicutes   Class Bacilli   Order Bacillales   Family Staphylococcaceae   Genus Staphylococcus   Species Staphylococcus capitis   Strain CR01, pulsetype-NRCS-A   TAS [6]   TAS [7,8]   TAS [9,10]   TAS [11,12]   TAS [13,14]   TAS [11,15,16]   TAS [17]   TAS [2,18]
Gram stain   Positive   TAS [19]
Cell shape   Coccoid   TAS [17]
Motility   Non-motile   TAS [19]
Sporulation   Non-sporuating   TAS [19]
Temperature range   Mesophilic   IDA
Optimum temperature   37°C   TAS [17]
Carbon source   Carbohydrates (glucose,sacharose,fructose,manitol,mannose)   TAS [17]
Energy source   Chemoorganotropic   TAS [17]
Terminal electron receptor   O2   TAS [17]
MIGS-6Habitat   Skin of humans   TAS [17]
MIGS-6.3Salinity   Physiological   TAS [17]
MIGS-22Oxygen   Facultative anaerobes   TAS [17]
MIGS-15Biotic relationship   Free-living   TAS [17]
MIGS-14Pathogenicity   Opportunistic pathogen (Nosocomial bacteremia in premature neonates)   TAS [2]
MIGS-4Geographic location   NICU Lyon, France   TAS [2]
MIGS-5Sample collection time   2007   IDA
MIGS-4.1 MIGS-4.2Latitude – Longitude   45° 45' 35" N 4° 50' 32" E   IDA
MIGS-4.3Depth   Not applicable   IDA
MIGS-4.4Altitude   162 m   IDA

a) Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [20].

a) Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [20]. Species identification of the bacterial isolates and antimicrobial susceptibility testing (AST) were performed, respectively, using Vitek MS (bioMérieux, Marcy l’Etoile), 16S rDNA sequencing, the automated BD Phoenix system (Becton Dickinson, Sparks, MD) and with Shimadzu-MALDI-TOF MS system (Shimadzu Corporation), as implemented on [21]. The strain was identified as being a Staphylococcus capitis by VITEK MS with 99.9% and at 93.7% by the MALDI-TOF MS, using the Shimadzu Launchpad software program and the SARAMIS database application (AnagnosTec GmbH) for automatic measurement and identification (Figure 1). Based on the information provided by the manufacture, when the score is ≥70%, identification is considered of high confidence.
Figure 1

Reference mass spectrum from strain (CR01).

Reference mass spectrum from strain (CR01). The antimicrobial susceptibility test (AST) results were analyzed according to the recommendations of the French Microbiology Society [22]. The bacteremia was considered positive based on a single positive blood culture [2,23]. The NCRS-A isolate CR01, as all isolates from this clone, is resistant to penicillin, methicillin, gentamicin, rifampicin, hetero-resistant to vancomycin and sensitive to fusidic acid and fluoroquinolones. Table 1, Figure 2 and Figure 3 show detailed information concerning general features of strain (CR01) and position within the genus .
Figure 2

Transmission electron microscopy of Staphylococcus capitis strain (CR01) using a JEOL 1400. The scale bar represents 200 nm.

Figure 3

16S rRNA Phylogenetic tree highlighting the position of strain CR01 (indicated by the yellow circle) relative to other type strains within the genus . All 16S rRNA sequences were obtained from the RDP database using as filtering criteria: sequences with more than 1200 nt and classified as “good” quality sequences. The tree uses sequences aligned with the MUSCLE software, with the default parameters as implemented on Seaview version 4 [24], and a tree was inferred based on 1285 sites using the distance model of observed divergence, as implemented in the BioNJ algorithm.

Transmission electron microscopy of Staphylococcus capitis strain (CR01) using a JEOL 1400. The scale bar represents 200 nm. 16S rRNA Phylogenetic tree highlighting the position of strain CR01 (indicated by the yellow circle) relative to other type strains within the genus . All 16S rRNA sequences were obtained from the RDP database using as filtering criteria: sequences with more than 1200 nt and classified as “good” quality sequences. The tree uses sequences aligned with the MUSCLE software, with the default parameters as implemented on Seaview version 4 [24], and a tree was inferred based on 1285 sites using the distance model of observed divergence, as implemented in the BioNJ algorithm. The 16S rRNA sequences were aligned using the MUSCLE software, with the default parameters as implemented on Seaview version 4 [24], and a tree was inferred based on 1285 sites using the distance model of observed divergence, as implemented in the BioNJ algorithm, and a bootstrapping process repeated 500 times. The final tree was rooted using the 16S rRNA sequence of Type strain that belongs to a closely-related sister genus.

Genome sequencing information

The genome sequence of strain CR01 was determined by high-throughput sequencing performed on a Genome Sequencer FLX + system (454 Life Sciences/Roche) using FLX Titanium reagents according to the manufacturer's protocols and instructions, with approximately 47-fold coverage of the genome. This platform provides longer read lengths than other sequencing platforms to obtain raw sequences. De novo assemblies were performed using the Roche Newbler (v 2.7) software package.

Genome project history

Table 2 presents the project information and its association with MIGS version 2.0 compliance [5].
Table 2

Project information

MIGS IDProperty   Term
MIGS-31Finishing quality   Non-contiguous finished
MIGS-28Libraries used   454 pyrosequence rapid library
MIGS-29Sequencing platforms   454 GS FLX+
MIGS-31.2Fold coverage   47.0 × pyrosequence
MIGS-30Assemblers   Newbler Assembler 2.7
GenBank   CBUB000000000.1

Growth conditions and DNA isolation

The sample was prepared for sequencing by growing CR01, aerobically at 37°C in Blood Agar for 24-48 hours. Genomic DNA was extracted using the PureLinkTM genomic DNA kit (InvitrogenTM) according to the manufacturer’s recommended protocol. The quantity of DNA obtained was determined using a NanoVue TM Plus (HVD Life Sciences), and 1 µg of DNA was used for sequencing of whole-genome of this strain.

Genome sequencing and assembly

The isolated DNA of CR01, was used to create 454-shotgun libraries following the GS Rapid library protocol (Roche 454, Roche). The resulting 454 DNA libraries were sequenced using a whole-genome shotgun strategy by GS FLX Titanium sequencing kit XL+ [25] (202,108 reads totaling 2.5 Mb, X48 fold coverage of the genome). Genome sequences were processed by Roche’s sequencing software according to the manufacturer's instructions (454 Life Science). The resulting shotgun reads were assembled de novo using the Roche Newbler assembly software 2.7 (454 Life Science) and 26 large contigs (Contig00001 to Contig00026) were obtained. The N50 was 176239 bp.

Genome annotation

An automatic syntactic and functional annotation of the draft genome was performed using the MicroScope platform pipeline [26,27]. The syntactic analysis combines a set of programs including AMIGene [28], tRNAscan-SE [29], RNAmmer [30], Rfam scan [31] and Prodigal software [32] to predict genomic objects that are mainly CDSs and RNA genes. More than 20 bioinformatics methods are then used for functional and relational analyses: homology search in the generalist databank UniProt [33] and in more specialized databases as COG [34], InterPro [35], PRIAM profiles for enzymatic classification [36], prediction of protein localization using TMHMM [37], SignalP [38] and PsortB [39] tools.

Genome properties

The genome includes one circular chromosome of 2,504,472 bp (32.81% GC content). A total of 2,565 genes were predicted with 2,453 being protein-coding genes, 59 tRNA-enconding genes, 4 rRNA-encoding genes (including 2 copies of 5S rRNA, 1 copy of both the large and the small-subunits, respectively, 23S and 16S rRNA) and 34 other RNA related ORFs. No plasmid was detected. Of the 2,453 protein-coding genes, 1,892 genes (76.7%) were assigned to a putative function with the remaining annotated as hypothetical proteins. The predicted coding density in strain CR01 was 86%. Table 3 and 4 and Figure 4 detailed description of the properties and the statistics of strain CR01 genome. The distribution of the genes into COGs functional categories is presented in Table 5.
Table 4

Nucleotide content and gene count levels of the genome

Attribute   Value   % of totala
Genome size (bp)   2.504.472   100.00%
DNA G+C content (bp)   821.717   32.81%
DNA coding region (bp)   2.158.855   6.2%
Number of Scaffolds   26   -
Total genesb   2566   100.00%
RNA genes   97   4.00%
tRNA-enconding genes   59   2.30%
rRNA-encoding genes   4   0.20%
Protein-coding genes (CDS)   2454   96.00%
Genes assigned to COGs   1999   81.00%
Genes of unknown function   561   23.34%
Genes with transmembrane helicesc   630   25.70%
CRISPR repeats   1   -

a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.

b) Total number of genes includes CDS, RNA genes and pseudogenes.

c) Detection of transmembrane helices was performed using TMHMM v. 2.0 [40]

Figure 4

Graphical circular map of the chromosome. From outside to the center: Genes on the forward strand (colored by COG categories), genes on the reverse strand colored by COG categories), RNA genes (tRNAs green, rRNAs blue), GC content, and GC skew

Table 5

Number of genes associated with general COG functional categories.

CodeValue%agea   Description
J1787.21   Translation
K1847.46   Transcription
L1696.85   Replication, recombination and repair
D291.18   Cell cycle control, mitosis and meiosis
V853.44   Defense mechanisms
T893.61   Signal transduction mechanisms
M1134.58   Cell wall/membrane biogenesis
N230.9   Cell motility
W10.04   Extracellular structures
U331.34   Intracellular trafficking and secretion
O863.48   Posttranslational modification, protein turnover, chaperones
C1445.83   Energy production and conversion
G2108.51   Carbohydrate transport and metabolism
E37014.99   Amino acid transport and metabolism
F923.73   Nucleotide transport and metabolism
H1094.42   Coenzyme transport and metabolism
I923.73   Lipid transport and metabolism
P27010.94   Inorganic ion transport and metabolism
Q562.27   Secondary metabolites biosynthesis, transport and catabolism
R42517.22   General function prediction only
S2038.23   Function unknown
-46919.00   Not in COGs

a) The total is based on the total number of protein coding genes in the annotated genome.

a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome. b) Total number of genes includes CDS, RNA genes and pseudogenes. c) Detection of transmembrane helices was performed using TMHMM v. 2.0 [40] Graphical circular map of the chromosome. From outside to the center: Genes on the forward strand (colored by COG categories), genes on the reverse strand colored by COG categories), RNA genes (tRNAs green, rRNAs blue), GC content, and GC skew a) The total is based on the total number of protein coding genes in the annotated genome.

Conclusion

Here, we described a new genome sequence of (strain CR01 belonging to NRCS-A clone) as a first step toward comparing its content with other sequenced genomes as well as CoNS genomes of species associated with late-onset sepsis. Detailed analyses are in progress to identify virulence factors and mobile genetic elements (MBE), such as the staphylococcal chromosome cassette (SCCmec) [18], potentially related to the high specificity of the NRCS-A clone to the NICU environment.
  29 in total

1.  AMIGene: Annotation of MIcrobial Genes.

Authors:  Stéphanie Bocs; Stéphane Cruveiller; David Vallenet; Grégory Nuel; Claudine Médigue
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Improved prediction of signal peptides: SignalP 3.0.

Authors:  Jannick Dyrløv Bendtsen; Henrik Nielsen; Gunnar von Heijne; Søren Brunak
Journal:  J Mol Biol       Date:  2004-07-16       Impact factor: 5.469

Review 3.  Updated review of blood culture contamination.

Authors:  Keri K Hall; Jason A Lyman
Journal:  Clin Microbiol Rev       Date:  2006-10       Impact factor: 26.132

4.  A hidden Markov model for predicting transmembrane helices in protein sequences.

Authors:  E L Sonnhammer; G von Heijne; A Krogh
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1998

5.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

6.  The ica operon and biofilm production in coagulase-negative Staphylococci associated with carriage and disease in a neonatal intensive care unit.

Authors:  G D I de Silva; M Kantzanou; A Justice; R C Massey; A R Wilkinson; N P J Day; S J Peacock
Journal:  J Clin Microbiol       Date:  2002-02       Impact factor: 5.948

7.  Distribution and persistence of Staphylococcus and Micrococcus species and other aerobic bacteria on human skin.

Authors:  W E Kloos; M S Musselwhite
Journal:  Appl Microbiol       Date:  1975-09

8.  The minimum information about a genome sequence (MIGS) specification.

Authors:  Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal:  Nat Biotechnol       Date:  2008-05       Impact factor: 54.908

9.  MaGe: a microbial genome annotation system supported by synteny results.

Authors:  David Vallenet; Laurent Labarre; Zoé Rouy; Valérie Barbe; Stéphanie Bocs; Stéphane Cruveiller; Aurélie Lajus; Géraldine Pascal; Claude Scarpelli; Claudine Médigue
Journal:  Nucleic Acids Res       Date:  2006-01-10       Impact factor: 16.971

10.  The COG database: an updated version includes eukaryotes.

Authors:  Roman L Tatusov; Natalie D Fedorova; John D Jackson; Aviva R Jacobs; Boris Kiryutin; Eugene V Koonin; Dmitri M Krylov; Raja Mazumder; Sergei L Mekhedov; Anastasia N Nikolskaya; B Sridhar Rao; Sergei Smirnov; Alexander V Sverdlov; Sona Vasudevan; Yuri I Wolf; Jodie J Yin; Darren A Natale
Journal:  BMC Bioinformatics       Date:  2003-09-11       Impact factor: 3.169

View more
  2 in total

1.  Whole genome sequence and comparative genomic analysis of multidrug-resistant Staphylococcus capitis subsp. urealyticus strain LNZR-1.

Authors:  Xiaoxia Li; Min Lei; Yanli Song; Kunwei Gong; Ling Li; Hongyan Liang; Xiaofeng Jiang
Journal:  Gut Pathog       Date:  2014-12-20       Impact factor: 4.181

2.  Single-Molecule Sequencing (PacBio) of the Staphylococcus capitis NRCS-A Clone Reveals the Basis of Multidrug Resistance and Adaptation to the Neonatal Intensive Care Unit Environment.

Authors:  Patrícia Martins Simões; Hajar Lemriss; Yann Dumont; Sanâa Lemriss; Jean-Philippe Rasigade; Sophie Assant-Trouillet; Azeddine Ibrahimi; Saâd El Kabbaj; Marine Butin; Frédéric Laurent
Journal:  Front Microbiol       Date:  2016-12-15       Impact factor: 5.640

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.