| Literature DB >> 26452736 |
Vartul Sangal1, Jochen Blom2, Iain C Sutcliffe3, Christina von Hunolstein4, Andreas Burkovski5, Paul A Hoskisson6.
Abstract
BACKGROUND: Non-toxigenic Corynebacterium diphtheriae strains are emerging as a major cause of severe pharyngitis and tonsillitis as well as invasive diseases such as endocarditis, septic arthritis, splenic abscesses and osteomyelitis. C. diphtheriae strains have been reported to vary in their ability to adhere and invade different cell lines. To identify the genetic basis of variation in the degrees of pathogenicity, we sequenced the genomes of four strains of C. diphtheriae (ISS 3319, ISS 4060, ISS 4746 and ISS 4749) that are well characterised in terms of their ability to adhere and invade mammalian cells.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26452736 PMCID: PMC4600297 DOI: 10.1186/s12864-015-1980-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Details of the genome assemblies of C. diphtheriae strains ISS 3319, ISS 4060, ISS 4746 and ISS 4749
| Details | ISS 3319 | ISS 4060 | ISS 4746 | ISS 4749 |
|---|---|---|---|---|
| Number of reads | 218,328 | 188,708 | 164,106 | 186,176 |
| Average read length (bp) | 479 | 463 | 454 | 467 |
| Size of assembly (bp) | 2,366,093 | 2,395,664 | 2,379,713 | 2,366,484 |
| Number of contigs | 33 | 47 | 53 | 40 |
| Average coverage | 70x | 58x | 55x | 66x |
| N50 (bp) | 139,564 | 102,641 | 119,602 | 145,623 |
| Average GC content (mol%) | 53.53 | 53.61 | 53.46 | 53.45 |
| Number of features | 2,272 | 2,284 | 2,275 | 2,258 |
| - CDS | 2,168 | 2,198 | 2,185 | 2,168 |
| - rRNA | 3 | 4 | 4 | 2 |
| - tRNA | 49 | 46 | 52 | 49 |
| - ncRNA | 1 | 1 | 1 | 1 |
Fig. 1A Venn diagram showing the numbers of shared and specific genes among C. diphtheriae strains ISS 3319, ISS 4060, ISS 4746 and ISS 4749
Strain-specific gene content based on the comparison of the draft genomes of C. diphtheriae strains ISS 3319, ISS 4060, ISS 4746 and ISS 4749
| Protein annotation | ISS 3319 | ISS 4060 | ISS 4746 | ISS 4749 | ISS 4746 – ISS 4749a |
|---|---|---|---|---|---|
| Hypothetical | 69 | 73 | 22 | 10 | 86 |
| Fimbrial & membrane | 4 | 4 | - | 2 | 12 |
| Transposase | 6 | 13 | 3 | 1 | 14 |
| Othersb | 24 | 31 | 5 | 2 | 37 |
aStrains ISS 4746 and ISS 4749 belong to the same ST, ST32. Hence, the genes shared by them but not with ISS 3319 and ISS 4060 were treated as strain-specific and these are detailed in the column “ISS 4746 – ISS 4749”
bCDS encoding transcription regulators, components of restriction-modification systems or other cellular and metabolic activities were grouped within the ‘Others’ category
Fig. 2Structure and organisation of pilus gene clusters in C. diphtheriae strains ISS 3319, ISS 4060, ISS 4746 and ISS 4749. The common genes between different strains are shown in the same colour and the pseudogenes are shown using broken arrows. The direction of the arrow indicates the orientation of the coding sequence. The schematic is not to scale
Fig. 3The distribution of predicted protein functions (a) in the core genome, (b) in the accessory genome of C. diphtheriae. The distribution of the functional protein categories based on the similarities in the Clusters of Orthologous Genes (COG) database within predicted (c). Transmembrane proteins (outer circle), (d) secreted proteins (outer circle) and (e) cytoplasmic proteins. The proportion of different types of membrane-associated and secreted proteins is shown in the inner circles within panel c and d, respectively. (COG codes - A: RNA processing and modification; B: Chromatin structure and dynamics; C: Energy production and conversion; D: Cell cycle control, cell division, chromosome partitioning; E: Amino acid transport and metabolism; F: Nucleotide transport and metabolism; G: Carbohydrate transport and metabolism; H: Coenzyme transport and metabolism; I: Lipid transport and metabolism; J: Translation, ribosomal structure and biogenesis; K: Transcription; L: Replication, recombination and repair; M: Cell wall/membrane/envelope biogenesis; N: Cell motility; O: Post-translational modification, protein turnover, and chaperones; P: Inorganic ion transport and metabolism; Q: Secondary metabolites biosynthesis, transport, and catabolism; T: Signal transduction mechanisms; U: Intracellular trafficking, secretion, and vesicular transport; V: Defence mechanisms; W: Extracellular structures; Y: Nuclear structure; Z: Cytoskeleton; Not assigned: General function prediction and unknown function)
Fig. 4a A neighbour-joining phylogenetic tree from the variation in the core genome (1625 CDS) of 20 C. diphtheriae strains. The scale bar represents the number of substitutions per site. ST designations of the strains are mapped on the tree in the parentheses. b A maximum-likelihood tree from the binary data of the presence or absence of genes in the accessory genome. The scale bar with a distance of 0.05 represents the difference of 118.2 genes