| Literature DB >> 30338024 |
Doglas Parise1, Mariana T D Parise1, Marcus V C Viana1, Adrian V Muñoz-Bucio2, Yazmin A Cortés-Pérez2, Beatriz Arellano-Reynoso2, Efrén Díaz-Aparicio2, Fernanda A Dorella3, Felipe L Pereira3, Alex F Carvalho3, Henrique C P Figueiredo3, Preetam Ghosh4, Debmalya Barh1,5,6, Anne C P Gomide1, Vasco A C Azevedo1.
Abstract
Corynebacterium pseudotuberculosis is a pathogenic bacterium which has been rapidly spreading all over the world, causing economic losses in the agricultural sector and sporadically infecting humans. Six C. pseudotuberculosis strains were isolated from goats, sheep, and horses with distinct abscess locations. For the first time, Mexican genomes of this bacterium were sequenced and studied in silico. All strains were sequenced using Ion Personal Genome Machine sequencer, assembled using Newbler and SPAdes software. The automatic genome annotation was done using the software RAST and in-house scripts for transference, followed by manual curation using Artemis software and BLAST against NCBI and UniProt databases. The six genomes are publicly available in NCBI database. The analysis of nucleotide sequence similarity and the generated phylogenetic tree led to the observation that the Mexican strains are more similar between strains from the same host, but the genetic structure is probably more influenced by transportation of animals between farms than host preference. Also, a putative drug target was predicted and in silico analysis of 46 strains showed two gene clusters capable of differentiating the biovars equi and ovis: Restriction Modification system and CRISPR-Cas cluster.Entities:
Keywords: CRISPR-Cas; Drug target; Genomic sequencing; Phylogenetics; Restriction-modification systems
Year: 2018 PMID: 30338024 PMCID: PMC6180578 DOI: 10.1186/s40793-018-0325-z
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of strains MEX1, MEX9, MEX25, MEX29, MEX30, and MEX31 according to the MIGS recommendations [41]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| strain: | |||
| Gram stain |
| TAS [ | |
| Cell shape |
| TAS [ | |
| Motility |
| TAS [ | |
| Sporulation |
| TAS [ | |
| Temperature range |
| TAS [ | |
| Optimum temperature |
| TAS [ | |
| pH range; Optimum |
| TAS [ | |
| Carbon source |
| TAS [ | |
| MIGS-6 | Habitat |
| TAS [ |
| MIGS-6.3 | Salinity |
| TAS [ |
| MIGS-22 | Oxygen requirement |
| TAS [ |
| MIGS-15 | Biotic relationship |
| TAS [ |
| MIGS-14 | Pathogenicity |
| TAS [ |
| MIGS-4 | Geographic location |
| TAS [ |
| MIGS-5 | Sample collection |
| TAS [ |
| MIGS-4.1 | Latitude |
| IDA |
| MIGS-4.2 | Longitude |
| IDA |
| MIGS-4.4 | Altitude |
| IDA |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [75]
Fig. 1Phylogenetic tree of new strains of this work inside the rectangles, with other strains of the group CMNR. The blue rectangles highlight the biovar ovis strains and the red rectangle highlights the biovar equi strains of this work. The numbers near the nodes indicate bootstrap values
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Fragments |
| MIGS 29 | Sequencing platforms | Ion Torrent PGM |
| MIGS 31.2 | Fold coverage | 115× (MEX1); 129× (MEX9); 99× ( |
| MIGS 30 | Assemblers | Newbler, SPAdes. |
| MIGS 32 | Gene calling method | RAST |
| Locus Tag | CpMEX1_ (MEX1); CpMEX9_ (MEX9); AN397_ ( | |
| Genbank ID | ||
| GenBank Date of Release | 2017/01/30 (MEX1); 2016/05/27 (MEX9); 2015/12/23 ( | |
| GOLD ID | - (MEX1); Go0366057 (MEX9); Go0139540 ( | |
| BIOPROJECT | ||
| MIGS 13 | Source Material Identifier | BHI broth |
| Project relevance | Animal Pathogen, Medical |
Genome statistics
| Attribute | MEX1 | MEX9 | MEX25 | MEX29 | MEX30 | MEX31 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Value | % | Value | % | Value | % | Value | % | Value | % | Value | % | |
| Genome size (bp) | 2,337,090 | 100.0 | 2,337,578 | 100.0 | 2,337,529 | 100.0 | 2,337,866 | 100.0 | 2,368,140 | 100.0 | 2,367,880 | 100.0 |
| DNA coding (bp) | 2,012,758 | 86.12 | 2,017,915 | 86.33 | 2008,915 | 85.94 | 2025,972 | 86.66 | 1,966,942 | 83.06 | 2,051,473 | 86.64 |
| DNA G + C (bp) | 1,219,520 | 52.18 | 1,219,842 | 52.18 | 1,219,763 | 52.18 | 1,219,957 | 52.18 | 1,234,064 | 52.11 | 1,233,547 | 52.10 |
| DNA scaffolds | 1 | 100.0 | 1 | 100.0 | 1 | 100.0 | 1 | 100.0 | 1 | 100.0 | 1 | 100.0 |
| Total genes | 2145 | 100.0 | 2146 | 100.0 | 2143 | 100.0 | 2145 | 100.0 | 2188 | 100.0 | 2182 | 100.0 |
| Protein coding genes | 2021 | 94.22 | 2025 | 94.36 | 2016 | 94.07 | 2032 | 94.73 | 2008 | 91.77 | 2058 | 94.32 |
| RNA genes | 64 | 2.98 | 64 | 2.98 | 64 | 2.99 | 64 | 2.98 | 66 | 3.02 | 63 | 2.89 |
| Pseudo genes | 60 | 2.80 | 57 | 2.66 | 63 | 2.94 | 49 | 2.28 | 114 | 5.21 | 61 | 2.80 |
| Genes in internal clusters | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA |
| Genes with function prediction | 1579 | 73.61 | 1576 | 73.44 | 1583 | 73.87 | 1578 | 73.57 | 1605 | 73.36 | 1610 | 73.79 |
| Genes assigned to COGs | 2007 | 93.57 | 2013 | 93.80 | 2009 | 93.75 | 2020 | 94.17 | 1998 | 91.32 | 2046 | 93.77 |
| Genes with Pfam domains | 1679 | 78.28 | 1675 | 78.05 | 1670 | 77.93 | 1690 | 78.79 | 1664 | 76.05 | 1731 | 79.33 |
| Genes with signal peptides | 157 | 7.32 | 151 | 7.04 | 159 | 7.42 | 155 | 7.23 | 144 | 6.58 | 153 | 7.01 |
| Genes with transmembrane helices | 595 | 27.74 | 591 | 27.54 | 585 | 27.30 | 601 | 28.02 | 585 | 26.74 | 596 | 27.31 |
| CRISPR repeats | 0 | 00 0 | 1 | 0.05 | 1 | 0.05 | 1 | 0.05 | 4 | 0.18 | 4 | 0.18 |
Number of genes associated with general COG functional categories
| Code | MEX1 | MEX9 | MEX25 | MEX29 | MEX30 | MEX31 | Description | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Value | %age | Value | %age | Value | %age | Value | %age | Value | %age | Value | %age | ||
| J | 182 | 9.01 | 182 | 8.99 | 181 | 8.98 | 186 | 9.15 | 183 | 9.11 | 189 | 9.18 | Translation, ribosomal structure and biogenesis |
| A | 2 | 0.10 | 2 | 0.10 | 2 | 0.10 | 2 | 0.10 | 2 | 0.10 | 2 | 0.10 | RNA processing and modification |
| K | 138 | 6.83 | 139 | 6.87 | 137 | 6.80 | 138 | 6.79 | 136 | 6.77 | 134 | 6.51 | Transcription |
| L | 105 | 5.20 | 105 | 5.19 | 96 | 4.76 | 102 | 5.02 | 102 | 5.08 | 101 | 4.91 | Replication, recombination and repair |
| B | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.05 | 0 | 0 | Chromatin structure and dynamics |
| D | 44 | 2.18 | 43 | 2.12 | 45 | 2.23 | 45 | 2.22 | 43 | 2.14 | 47 | 2.28 | Cell cycle control, Cell division, chromosome partitioning |
| V | 68 | 3.37 | 67 | 3.31 | 69 | 3.42 | 71 | 3.49 | 75 | 3.74 | 70 | 3.40 | Defense mechanisms |
| T | 99 | 4.90 | 101 | 4.99 | 99 | 4.91 | 103 | 5.07 | 98 | 4.88 | 98 | 4.76 | Signal transduction mechanisms |
| M | 124 | 6.14 | 122 | 6.03 | 119 | 5.90 | 120 | 5.91 | 119 | 5.93 | 117 | 5.69 | Cell wall/membrane biogenesis |
| N | 21 | 1.04 | 22 | 1.09 | 20 | 0.99 | 20 | 0.98 | 13 | 0.65 | 17 | 0.83 | Cell motility |
| U | 32 | 1.58 | 31 | 1.53 | 30 | 1.49 | 31 | 1.53 | 29 | 1.44 | 30 | 1.46 | Intracellular trafficking and secretion |
| O | 128 | 6.33 | 122 | 6.03 | 121 | 6.00 | 126 | 6.20 | 122 | 6.08 | 122 | 5.93 | Posttranslational modification, protein turnover, chaperones |
| C | 125 | 6.19 | 124 | 6.12 | 116 | 5.75 | 124 | 6.10 | 123 | 6.13 | 121 | 5.88 | Energy production and conversion |
| G | 158 | 7.82 | 154 | 7.61 | 151 | 7.49 | 156 | 7.68 | 151 | 7.52 | 161 | 7.82 | Carbohydrate transport and metabolism |
| E | 212 | 10.49 | 213 | 10.52 | 204 | 10.12 | 213 | 10.48 | 219 | 10.91 | 223 | 10.84 | Amino acid transport and metabolism |
| F | 82 | 4.06 | 82 | 4.05 | 79 | 3.92 | 82 | 4.04 | 78 | 3.88 | 81 | 3.94 | Nucleotide transport and metabolism |
| H | 143 | 7.08 | 141 | 6.96 | 135 | 6.70 | 141 | 6.94 | 151 | 7.52 | 152 | 7.39 | Coenzyme transport and metabolism |
| I | 92 | 4.55 | 91 | 4.49 | 90 | 4.46 | 93 | 4.58 | 86 | 4.28 | 87 | 4.23 | Lipid transport and metabolism |
| P | 162 | 8.02 | 157 | 7.75 | 162 | 8.04 | 162 | 7.97 | 166 | 8.27 | 168 | 8.16 | Inorganic ion transport and metabolism |
| Q | 49 | 2.43 | 46 | 2.27 | 46 | 2.28 | 48 | 2.36 | 52 | 2.59 | 50 | 2.43 | Secondary metabolites biosynthesis, transport, and catabolism |
| R | 170 | 8.41 | 164 | 8.10 | 157 | 7.79 | 167 | 8.22 | 167 | 8.32 | 169 | 8.21 | General function prediction only |
| S | 138 | 6.83 | 142 | 7.01 | 127 | 6.30 | 141 | 6.94 | 138 | 6.87 | 140 | 6.80 | Function unknown |
| – | 14 | 0.69 | 12 | 0.59 | 7 | 0.35 | 12 | 0.59 | 10 | 0.50 | 12 | 0.58 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Fig. 2Circular map of strain MEX1 (biovar ovis) in comparison with the other strains of this study. The cluster of methylation type III, which is only present in biovar ovis strains, is highlighted in blue
Fig. 3Circular map of strain MEX30 (biovar equi) in comparison with the other strains of this study. The cluster of CRISPR-Cas, which is only present in biovar equi strains, is highlighted in blue. The nitrate reductase gene cluster is highlighted by a black rectangle
Fig. 4Alignment generated using Gegenees software showing the percentage similarity among the strains, based on the accessory genome. The blue rectangle highlights the grouping of the biovar ovis. The red rectangle highlights the grouping of the biovar equi