| Literature DB >> 26594310 |
Saima Zubair1, Anne Fischer2, Anne Liljander3, Jochen Meens4, Jan Hegerman5, Hadrien Gourlé1, Richard P Bishop3, Ina Roebbelen3, Mario Younan6, Mudassir Imran Mustafa7, Mamoona Mushtaq1, Erik Bongcam-Rudloff1, Joerg Jores3.
Abstract
We report the genome of a Staphylococcus aureus strain (ILRI_Eymole1/1) isolated from a nasal swab of a dromedary camel (Camelus dromedarius) in North Kenya. The complete genome sequence of this strain consists of a circular chromosome of 2,874,302 bp with a GC-content of 32.88 %. In silico annotation predicted 2755 protein-encoding genes and 76 non-coding genes. This isolate belongs to MLST sequence type 30 (ST30). Phylogenetic analysis based on a subset of 283 core genes revealed that it falls within the human clonal complex 30 (CC30) S. aureus isolate cluster but is genetically distinct. About 79 % of the protein encoding genes are part of the CC30 core genome (genes common to all CC30 S. aureus isolates), ~18 % were within the variable genome (shared among multiple but not all isolates) and ~ 3 % were found only in the genome of the camel isolate. Among the 85 isolate-specific genes, 79 were located within putative phages and pathogenicity islands. Protein encoding genes associated with bacterial adhesion, and secretory proteins that are essential components of the type VII secretion system were also identified. The complete genome sequence of S. aureus strain ILRI_Eymole1/1 has been deposited in the European Nucleotide Archive under the accession no LN626917.1.Entities:
Keywords: Camel; Core genome; Pathogenicity islands; ST30; Staphylococcus aureus
Year: 2015 PMID: 26594310 PMCID: PMC4654806 DOI: 10.1186/s40793-015-0098-6
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Scanning electron microscopy of ILRI Eymole 1/1 S. aureus grown on agar. Left: overview of cells grown in a colony; right: single cells in higher magnification
Classification and general features of Staphylococcus aureus ILRI_Eymole1/1
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | IDA | ||
| Species | IDA | ||
| Strain: ILRI_Eymole1/1 (Accession # LN626917.1) | |||
| Gram stain | Positive | IDA | |
| Cell shape | Grape-like coccus | IDA | |
| Motility | Nonmotile | IDA | |
| Sporulation | Nonspore-forming | IDA | |
| Temperature range | 15–42 °C | IDA | |
| Optimum temperature | 37 °C | IDA | |
| pH | 4.5–9.5 | IDA | |
| Optimum pH | 7 | ||
| Carbon Source | Glucose, fructose, mannose, maltose, lactose, trehalose, sucrose, turanose | IDA | |
| MIGS-6 | Habitat | Nasopharyngeal microflora | |
| MIGS-6.3 | Salinity | 1 to 2.5 M NaCl | IDA |
| MIGS-22 | Oxygen requirement | Facultative anaerobe | TAS [ |
| MIGS-15 | Biotic relationship | Free living | |
| MIGS-14 | Pathogenicity | – | |
| MIGS-4 | Geographic location | Kenya | |
| MIGS-5 | Sample collection | 01 February 2004 | |
| MIGS-4.1 | Latitude | 3.916667 | |
| MIGS-4.2 | Longitude | 41.833333 | |
| MIGS-4.3 | Depth | – | |
| MIGS-4.4 | Altitude | 220 |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [62]
Fig. 2Phylogenetic tree showing the position of camel S. aureus strain ILRI_Eymole1/1 relative to other species of the genus Staphylococcus based on Muscle alignment of 1384 bp of 16S rRNA gene. The tree was constructed using MEGA v 6.06 [60, 63] implementing a Neighbor-Joining method with 1000 bootstrap replications and a Kimura 2-parameter model
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Illumina Paired End; Average read length 300 bp; Average insert size 550 bp. |
| MIGS-29 | Sequencing platforms | Illumina GA-II |
| MIGS-31.2 | Fold coverage | 109 × |
| MIGS-30 | Assemblers | MIRA 4.0 |
| MIGS-32 | Gene calling method | RAST server, Basys |
| Locus Tag | – | |
| Genbank ID | LN626917.1 | |
| Genbank Date of Release | October 31, 2014 | |
| GOLD ID | Gp0109422 | |
| BIOPROJECT | PRJEB6577 | |
| MIGS-13 | Source Material Identifier | ILRI_Azizi_biobank |
| Project relevance | Bacterial pathogen in camels |
Nucleotide content and gene count levels of the genome
| Attribute | Value | % of totala |
|---|---|---|
| Genome Size (bp) | 2,874,302 | 100.00 |
| DNA coding | 2,404,314 | 83.65 |
| DNA G + C (bp) | 945,066 | 32.88 |
| Total genes | 2831 | 100.00 |
| Protein-coding genes | 2755 | 97.32 |
| RNA genes | 76 | 2.68 |
| Pseudo genes | 0 | 0.00 |
| Genes in internal clusters | n/a | n/a |
| Genes with function prediction | 2170 | 76.65 |
| Genes assigned to COGs | 2054 | 74.56 |
| Genes with Pfam domains | 1688 | 59.63 |
| Genes with signal peptides | 162 | 5.88 |
| Genes with transmembrane helices | 1046 | 37.97 |
| CRISPR repeats | 0 | 0.00 |
a The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Fig. 3Circular map of the S. aureus ILRI_Eymole1/1 genome. From outer to inner circle; 1) Protein encoding genes in forward orientation are shown in dark blue, 2) Protein encoding genes in reverse orientation are shown in light blue, 3) Ribosomal RNAs are depicted in green while tRNAs are in red, 4) G + C-content plot and 5) GC-skew graph. The Graph was generated using DNAPlotter [64]
Number of genes associated with the 25 general COG functional categories
| Code | Value | % of totala | Description |
|---|---|---|---|
| J | 145 | 5.26 | Translation |
| A | 0 | 0.00 | RNA processing and modification |
| K | 120 | 4.36 | Transcription |
| L | 148 | 5.37 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 24 | 0.87 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.00 | Nuclear structure |
| V | 45 | 1.63 | Defense mechanisms |
| T | 35 | 1.27 | Signal transduction mechanisms |
| M | 90 | 3.27 | Cell wall/membrane biogenesis |
| N | 1 | 0.04 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 0 | 0.00 | Extracellular structures |
| U | 20 | 0.73 | Intracellular trafficking and secretion |
| O | 61 | 2.21 | Posttranslational modification, protein turnover, chaperones |
| C | 91 | 3.30 | Energy production and conversion |
| G | 106 | 3.85 | Carbohydrate transport and metabolism |
| E | 174 | 6.32 | Amino acid transport and metabolism |
| F | 64 | 2.32 | Nucleotide transport and metabolism |
| H | 77 | 2.80 | Coenzyme transport and metabolism |
| I | 50 | 1.82 | Lipid transport and metabolism |
| P | 124 | 4.50 | Inorganic ion transport and metabolism |
| Q | 23 | 0.84 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 228 | 8.28 | General function prediction only |
| S | 220 | 7.99 | Function unknown |
| – | 208 | 7.55 | Other COG categories |
| – | 701 | 25.44 | Not in COGs |
aThe total is based on the total number of protein coding genes in the annotated genome
Fig. 4COG functional classification of CC30 S. aureus core genome
Fig. 5a Maximum likelihood tree of the concatenated sequence of selected 283 core genes in 20 CC30 S. aureus isolates; one ST1 and one ST5 S. aureus isolate Mu50 and N315 respectively. General Time Reverse model was used with 100 bootstrap replications. The bootstrap values are represented above the nodes. ST1 and ST5 are out grouped. b Maximum likelihood unrooted tree of 20 CC30 S. aureus isolates using set of 283 core genes. General Time Reverse model and 100 bootstrap replications were used. The values indicated are the bootstrap values