| Literature DB >> 23991252 |
Songling Zhu1, Hong-Liang Wang, Chunxiao Wang, Le Tang, Xiaoyu Wang, Kai-Jiang Yu, Shu-Lin Liu.
Abstract
Salmonella enterica subsp. houtenae serovar 16:z4, z32:-- str. RKS3027 was isolated from a human in Illinois, USA. S. enterica subsp. houtenae is a facultative aerobic rod-shaped Gram-negative bacterium. Here we describe the features of this organism, together with the draft genome sequence and annotation. The 4,404,136 bp long genome (97 contigs) contains 4,335 protein-coding gene and 28 RNA genes.Entities:
Keywords: Salmonella enterica; genome; houtenae; subspecies
Year: 2013 PMID: 23991252 PMCID: PMC3746422 DOI: 10.4056/sigs.3767427
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of strain RKS3027 relative to the other types and strains of . GenBank accession numbers are indicated in the parentheses. The tree was built based on the comparison of concatenated nucleotide sequences of 2,500 orthologs conserved in all strains. Individual orthologous sequences were aligned by the MAFFT [9] and phylogenetic tree was constructed by using the neighbor-joining method within the MEGA software [10].
Classification and general features of RKS3027 according to the MIGS recommendations [12]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Subspecies | TAS [ | ||
| Strain RKS3027 | IDA | ||
| Serovar 16:z3, z32:-- | IDA | ||
| Gram stain | Negative | IDA | |
| Cell shape | Rod-shaped | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Non-sporulating | IDA | |
| Temperature range | Mesophilic | IDA | |
| Optimum temperature | 35 °C - 37 °C | IDA | |
| Carbon source | Glucose | IDA | |
| Energy source | Chemoorganotrophic | IDA | |
| MIGS-6 | Habitat | Reptiles | IDA |
| MIGS-6.3 | Salinity | Medium | IDA |
| MIGS-22 | Oxygen | Facultative anaerobes | IDA |
| MIGS-15 | Biotic relationship | Endophyte | IDA |
| MIGS-14 | Pathogenicity | Pathogenic | IDA |
| MIGS-4 | Geographic location | Illinois, USA | NAS |
| MIGS-5 | Sample collection time | 1986 | NAS |
| MIGS-4.1 | Latitude | Not report | NAS |
| MIGS-4.2 | Longitude | Not report | NAS |
| MIGS-4.3 | Depth | Not report | NAS |
| MIGS-4.4 | Altitude | Not report | NAS |
a) Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [25].
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Draft |
| MIGS-28 | Libraries used | Illumina Paired-End library |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Fold coverage | 100 × |
| MIGS-30 | Assemblers | SOAPdenovo v1.05 |
| MIGS-32 | Gene calling method | RAST |
| Genbank ID | ANHR00000000 | |
| GOLD ID | Gi21447 | |
| Project relevance | Evolution in bacteria, human pathogen |
Nucleotide content and gene count levels of the genome
| Value | % of totala | |
|---|---|---|
| Genome size (bp) | 4,404,136 | |
| DNA coding region (bp) | 3,824,952 | 86.85 |
| DNA G+C content (bp) | 2,276,005 | 51.68 |
| Total genes | 4,363 | 100 |
| RNA genes | 28 | 0.06 |
| Protein-coding genes | 4,335 | 99.36 |
| Genes assigned to COGs | 3,378 | 77.42 |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Figure 2Graphical circular map of the strain RKS 3027 genome. From the outside to the center: genes on forward strand (color by COG categories), genes on reverse strand (color by COG categories), GC content, GC skew. The map was generated with the CGviewer software.
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 163 | 3.76 | Translation |
| A | 1 | 0.02 | RNA processing and modification |
| K | 281 | 6.48 | Transcription |
| L | 176 | 4.06 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 32 | 0.74 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.00 | Nuclear structure |
| V | 48 | 1.11 | Defense mechanisms |
| T | 103 | 2.38 | Signal transduction mechanisms |
| M | 235 | 5.42 | Cell wall/membrane biogenesis |
| N | 95 | 2.19 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 0 | 0.00 | Extracellular structures |
| U | 41 | 0.95 | Intracellular trafficking and secretion |
| O | 138 | 3.18 | Posttranslational modification, protein turnover, chaperones |
| C | 254 | 5.86 | Energy production and conversion |
| G | 343 | 7.91 | Carbohydrate transport and metabolism |
| E | 319 | 7.36 | Amino acid transport and metabolism |
| F | 77 | 1.78 | Nucleotide transport and metabolism |
| H | 131 | 3.02 | Coenzyme transport and metabolism |
| I | 89 | 2.05 | Lipid transport and metabolism |
| P | 175 | 4.04 | Inorganic ion transport and metabolism |
| Q | 47 | 1.08 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 318 | 7.34 | General function prediction only |
| S | 312 | 7.20 | Function unknown |
| - | 957 | 22.08 | Not in COGs |
a) The total is based on the total number of protein coding genes in the annotated genome.