| Literature DB >> 27066196 |
Sintia Almeida1, Sandeep Tiwari1, Diego Mariano1, Flávia Souza1, Syed Babar Jamal1, Nilson Coimbra2, Roberto Tadeu Raittz2, Fernanda Alves Dorella3, Alex Fiorine de Carvalho3, Felipe Luiz Pereira3, Siomar de Castro Soares3, Carlos Augusto Gomes Leal3, Debmalya Barh4, Preetam Ghosh5, Henrique Figueiredo3, Lília Ferreira Moura-Costa6, Ricardo Wagner Portela6, Roberto Meyer6, Artur Silva7, Vasco Azevedo1.
Abstract
Corynebacterium pseudotuberculosis strain VD57 (Cp_VD57), a highly virulent, nonmotile, non-sporulating, and a mesophilic bacterium, was isolated from a goat's granulomatous lesion in the municipality of Juazeiro, Bahia State, Brazil. Here, we describe a set of features of the strain, together with the details of its complete genome sequence and annotation. The genome comprises of a 2.5 Mbp long, single circular genome with 2,101 protein-coding genes, 12 rRNA, 49 tRNA and 47 pseudogenes and a G + C content of 52.85 %. Genetic variation was detected in Cp_VD57 using C. pseudotuberculosis strain 1002 as reference, wherein small genomic insertions and deletions were identified. The comparative analysis of the genome sequence provides means to better understand the host pathogen interactions of this strain and can also help us to understand the molecular and genetic basis of virulence of this bacterium.Entities:
Keywords: Biovar ovis; Caseous lymphadenitis; Corynebacterium pseudotuberculosis; Genome sequencing; Goat; Gram-positive pathogen; Ion Torrent PGM
Year: 2016 PMID: 27066196 PMCID: PMC4826502 DOI: 10.1186/s40793-016-0149-7
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Phylogenetic tree of C. pseudotuberculosis strain VD57 representing its position relative to type strains in Corynebacteriaceae along with some other type strains of CMNR group. The tree was inferred from 3,537 aligned characters of the rpoB gene sequence using maximum likelihood method and then checked for its agreement with the current classification in Table 1. The branch lengths represent the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 bootstrap replicates, indicated when larger than 60 %. Calculations to determine the phylogenetic distances were done by the software MEGA v6 [40]. The GenBank accession numbers are shown in parentheses
Classification and general features of Corynebacterium pseudotuberculosis strain VD57 according to the MIGS recommendations [19]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Gram stain | Positive | TAS [ | |
| Cell shape |
| TAS [ | |
| Motility | Non-motile | TAS [ | |
| Sporulation | Non-sporulating | TAS [ | |
| Temperature range | Mesophilic | NAS | |
| Optimum temperature | 37 °C | TAS [ | |
| pH range; Optimum | 7.0–7.2 | TAS [ | |
| Carbon source | Glucose | TAS [ | |
| MIGS-6 | Habitat | Host | TAS [ |
| MIGS-6.3 | Salinity | Not reported | |
| MIGS-22 | Oxygen requirement | Aerobic and Obligate Aerobic | TAS [ |
| MIGS-15 | Biotic relationship | Intracellular facultative pathogen | TAS [ |
| MIGS-14 | Pathogenicity | Goat | TAS [ |
| MIGS-4 | Geographic location | Bahia State, Brazil | TAS [ |
| MIGS-5 | Sample collection time | 2005 | [NAS] |
| MIGS-4.1 | Latitude | 9°24’S | [IDA] |
| MIGS-4.2 | Longitude | 40°30’W | [IDA] |
aEvidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence)
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Fragments |
| MIGS 29 | Sequencing platforms | Semiconductor Ion Torrent PGM |
| MIGS 31.2 | Fold coverage | 78.22-fold |
| MIGS 30 | Assemblers | MIRA .4.0CLC Genome Workbench 4.7.2 |
| MIGS 32 | Gene calling method | Glimmer v3.02 |
| Locus Tag | CpVD57 | |
| Genbank ID | CP009927 (chromosome) | |
| GenBank Date of Release | January 06, 2015 | |
| BIOPROJECT | PRJNA267107 | |
| MIGS 13 | Source Material Identifier | BHI broth, VD57 |
| Project relevance | Animal Pathogen, Medical |
Genome Statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 2,337,177 | 100.0 |
| DNA coding (bp) | 1,998,286. | 85.5 |
| DNA G + C (bp) | 1,235,198 | 52.9 |
| DNA scaffolds | 1 | |
| Total genesa | 2,148 | 100.0 |
| Protein coding genesa | 2,101 | 97.8 |
| RNA genes | 61 | 2.83 |
| Pseudo genes | 47 | 2.2 |
| Genes in internal clusters | NA | NA |
| Genes with function prediction | 1,578 | 73.5 |
| Genes assigned to COGs | 1,629 | 75.8 |
| Genes with Pfam domains | 1,682 | 80,1 |
| Genes with signal peptides | 158 | 7.36 |
| Genes with transmembrane helices | 605 | 28.8 |
| CRISPR repeats | NA | NA |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with the general COG functional categories
| Code | Value | % agea | Description |
|---|---|---|---|
| J | 148 | 7.04 | Translation, ribosomal structure and biogenesis |
| A | 2 | 0.09 | RNA processing and modification |
| K | 113 | 5.37 | Transcription |
| L | 104 | 4.95 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 20 | 0.95 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.00 | Nuclear structure |
| V | 31 | 1.47 | Defense mechanisms |
| T | 51 | 2.42 | Signal transduction mechanisms |
| M | 93 | 4.42 | Cell wall/membrane biogenesis |
| N | 5 | 0.23 | Cell motility |
| Z | 1 | 0.04 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 33 | 1.57 | Intracellular trafficking and secretion |
| O | 82 | 3.90 | Posttranslational modification, protein turnover, chaperones |
| C | 100 | 4.75 | Energy production and conversion |
| G | 115 | 5.47 | Carbohydrate transport and metabolism |
| E | 191 | 9.09 | Amino acid transport and metabolism |
| F | 69 | 3.28 | Nucleotide transport and metabolism |
| H | 103 | 4.90 | Coenzyme transport and metabolism |
| I | 62 | 2.95 | Lipid transport and metabolism |
| P | 128 | 6.09 | Inorganic ion transport and metabolism |
| Q | 31 | 1.47 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 193 | 9.18 | General function prediction only |
| S | 141 | 6.71 | Function unknown |
| - | 472 | 22.46 | Not in COGs |
| Totalb | 2288 | 104.42 |
aThe percentage is based on the total number of protein coding genes in the annotated genome
bThe total does not correspond to 1,537 CDSs, because some genes are associated with more than one COG functional categories
Fig. 2Graphical circular map of the genome [41]. From center to the outside: In wine Ovis strains, in blue Equi strains, RNA genes (tRNAs green, rRNAs orange, tRNAs red), GC content in black, GC skew
Metabolic Network Statistics
| Attribute | Value |
|---|---|
| Total genes | 2145 |
| Enzymes | 599 |
| Enzymatic reactions | 1197 |
| Metabolic pathways | 232 |
| Compounds | 912 |
Total number of SNP’s in C. pseudotuberculosis VD57 in comparison to other strains
| Reference | Total SNPs | SNP coding regions | SNP intergenic regions |
|---|---|---|---|
|
| 25,609 | 19,811 | 5,798 |
|
| 25,706 | 21,303 | 4,403 |
|
| 24,352 | 18,085 | 6,267 |
|
| 25,866 | 20,017 | 5,849 |
|
| 24,274 | 18,501 | 5,773 |
|
| 25,905 | 20,911 | 4,994 |
|
| 35 | 28 | 7 |
|
| 952 | 741 | 211 |
|
| 999 | 754 | 245 |
|
| 968 | 762 | 206 |
|
| 471 | 374 | 97 |
|
| 2,404 | 1,869 | 535 |
|
| 1,060 | 804 | 256 |
|
| 956 | 735 | 221 |
|
| 502 | 411 | 91 |
|
| 521 | 394 | 127 |
|
| 516 | 392 | 124 |
|
| 492 | 380 | 112 |