| Literature DB >> 23408795 |
Syed Shah Hassan1, Luis Carlos Guimarães, Ulisses de Pádua Pereira, Arshad Islam, Amjad Ali, Syeda Marriam Bakhtiar, Dayana Ribeiro, Anderson Rodrigues Dos Santos, Siomar de Castro Soares, Fernanda Dorella, Anne Cybelle Pinto, Maria Paula Cruz Schneider, Maria Silvanira Barbosa, Síntia Almeida, Vinícius Abreu, Flávia Aburjaile, Adriana Ribeiro Carneiro, Louise Teixeira Cerdeira, Karina Fiaux, Eudes Barbosa, Carlos Diniz, Flavia S Rocha, Rommel Thiago Jucá Ramos, Neha Jain, Sandeep Tiwari, Debmalya Barh, Anderson Miyoshi, Borna Müller, Artur Silva, Vasco Azevedo.
Abstract
The Actinobacteria, Corynebacterium pseudotuberculosis strain P54B96, a nonmotile, non-sporulating and a mesophile bacterium, was isolated from liver, lung and mediastinal lymph node lesions in an antelope from South Africa. This strain is interesting in the sense that it has been found together with non-tuberculous mycobacteria (NTMs) which could nevertheless play a role in the lesion formation. In this work, we describe a set of features of C. pseudotuberculosis P54B96, together with the details of the complete genome sequence and annotation. The genome comprises of 2.34 Mbp long, single circular genome with 2,084 protein-coding genes, 12 rRNA, 49 tRNA and 62 pseudogenes and a G+C content of 52.19%. The analysis of the genome sequence provides means to better understanding the molecular and genetic basis of virulence of this bacterium, enabling a detailed investigation of its pathogenesis.Entities:
Keywords: Antelope; Gram-positive pathogen; Ion Torrent; caseous lymphadenitis/cheesy gland disease; genome sequencing; liver lesion; s: biovar ovis
Year: 2012 PMID: 23408795 PMCID: PMC3569390 DOI: 10.4056/sigs.3066455
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree of strain P54B96 representing its position relative to type strains in along with some other type strains of CMNR group. The tree was inferred from 3,537 aligned characters of the rpoB gene sequence using maximum likelihood method and then checked for its agreement with the current classification Table 1. The branch lengths represent the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 bootstrap replicates, indicated when Larger than 60%. Calculations to determine the phylogenetic distances were done by the software MEGA v5 [30].
Classification and general features of strain P54B96 according to the MIGS recommendations [31].
| | | | |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain P54B96 | TAS [ | ||
| Gram stain | Positive | TAS [ | |
| Cell shape | pleomorphic forms | TAS [ | |
| Motility | non-motile | TAS [ | |
| Sporulation | non-sporulating | TAS [ | |
| Temperature range | mesophilic | TAS [ | |
| Optimum temperature | 37°C | TAS [ | |
| Salinity | not reported | NAS | |
| MIGS-22 | Oxygen requirement | aerobic and facultatively anaerobic | TAS [ |
| Carbon source | glucose, fructose, maltose, mannose, | TAS [ | |
| Energy source | Chemoorganotroph | TAS [ | |
| MIGS-6 | Habitat | Host | TAS [ |
| MIGS-15 | Biotic relationship | intracellular facultative pathogen | TAS [ |
| MIGS-14 | Pathogenicity | sheep, goats, horses and cattle, | TAS [ |
| Biosafety level | 2 | TAS [ | |
| Isolation | liver, lung, mediastinal lymph node lesions of antelope | TAS [ | |
| MIGS-4 | Geographic location | Mpumalanga province, South Africa | TAS [ |
| MIGS-5 | Sample collection time | 2009 | TAS [ |
| MIGS-4.1 | Latitude | not reported | |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i. e. a direct report exists in the literature); NAS: Non-traceable Author Statement (i. e. not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [41]. If the evidence code is IDA, then the property was directly observed for a living isolate by one of the authors or an expert mentioned in the acknowledgements.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Fragments (mean size 112 bp) |
| MIGS-29 | Sequencing platforms | Semiconductor Ion Torrent PGM |
| MIGS-31.2 | Sequencing coverage | 35-fold |
| MIGS-30 | Assemblers | CLC Genome Workbench 4.7.2, Velvet |
| MIGS-32 | Gene calling method | Glimmer v3.02 |
| INSDC ID | CP003385 (chromosome) | |
| GenBank Date of Release | April 05, 2012 | |
| GOLD ID | Gc02176 | |
| NCBI project ID | 77871 | |
| Database: IMG-GEBA | 2512564058 | |
| MIGS-13 | Source material identifier | BHI broth, P54B96 |
| Project relevance | Animal Pathogen, Medical |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 2,337,657 | 100.00% |
| DNA coding region (bp) | 2,005,391 | 85.79% |
| DNA G+C content (bp) | 1,219,912 | 52.19% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 2,145 | 100.00% |
| RNA genes | 61 | 2.76% |
| rRNA operons | 4 | |
| Protein-coding genes | 2,084 | 97.16% |
| Pseudo genes | 62 | 2.81% |
| Genes with function prediction | 1,511 | 68.46% |
| Genes in paralog clusters | 425 | 19.26% |
| Genes assigned to COGs | 1,552 | 70.32% |
| Genes assigned Pfam domains | 1,596 | 72.32% |
| Genes with signal peptides | 651 | 29.50% |
| Genes with transmembrane helices | 584 | 26.46% |
| CRISPR repeats | 0 |
Figure 2Graphical circular map of the genome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 140 | 6.72 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.1 | RNA processing and modification |
| K | 121 | 5.8 | Transcription |
| L | 88 | 4.2 | Replication, recombination and repair |
| B | 0 | 0.0 | Chromatin structure and dynamics |
| D | 21 | 1.0 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 25 | 1.2 | Defense mechanisms |
| T | 54 | 2.6 | Signal transduction mechanisms |
| M | 87 | 4.2 | Cell wall/membrane biogenesis |
| N | 1 | 0.1 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 27 | 1.3 | Intracellular trafficking and secretion |
| O | 77 | 3.7 | Posttranslational modification, protein turnover, chaperones |
| C | 90 | 4.3 | Energy production and conversion |
| G | 113 | 5.4 | Carbohydrate transport and metabolism |
| E | 177 | 8.5 | Amino acid transport and metabolism |
| F | 73 | 3.5 | Nucleotide transport and metabolism |
| H | 102 | 4.9 | Coenzyme transport and metabolism |
| I | 57 | 2.7 | Lipid transport and metabolism |
| P | 122 | 5.9 | Inorganic ion transport and metabolism |
| Q | 26 | 1.3 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 169 | 8.1 | General function prediction only |
| S | 136 | 6.5 | Function unknown |
| - | 655 | 31.4 | Not in COGs |
Figure 3Schematic cellular overview of all pathways of the P54B96 metabolism. Nodes represent metabolites, with shape indicating class of metabolite. Lines represent reactions.
Metabolic Network Statistics
| | |
|---|---|
| Total genes | 2,145 |
| Enzymes | 500 |
| Enzymatic reactions | 764 |
| Metabolic pathways | 152 |
| Metabolites | 622 |