| Literature DB >> 25887617 |
Bernard J Wolff1, Shatavia S Morrison2, Denise Pesti3, Satishkumar Ranganathan Ganakammal4, Ganesh Srinivasamoorthy4, Shankar Changayil4, M Ryan Weil4, Duncan MacCannell4, Lori Rowe4, Michael Frace4, Branson W Ritchie3, Deborah Dean5, Jonas M Winchell2.
Abstract
Chlamydia psittaci is an obligate intracellular bacterium that can cause significant disease among a broad range of hosts. In humans, this organism may cause psittacosis, a respiratory disease that can spread to involve multiple organs, and in rare untreated cases may be fatal. There are ten known genotypes based on sequencing the major outer-membrane protein gene, ompA, of C. psittaci. Each genotype has overlapping host preferences and virulence characteristics. Recent studies have compared C. psittaci among other members of the Chlamydiaceae family and showed that this species frequently switches hosts and has undergone multiple genomic rearrangements. In this study, we sequenced five genomes of C. psittaci strains representing four genotypes, A, B, D and E. Due to the known association of the type III secretion system (T3SS) and polymorphic outer-membrane proteins (Pmps) with host tropism and virulence potential, we performed a comparative analysis of these elements among these five strains along with a representative genome from each of the remaining six genotypes previously sequenced. We found significant genetic variation in the Pmps and tbl3SS genes that may partially explain differences noted in C. psittaci host infection and disease.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25887617 PMCID: PMC4635502 DOI: 10.1099/mic.0.000097
Source DB: PubMed Journal: Microbiology ISSN: 1350-0872 Impact factor: 2.777
Whole genome sequencing and annotation results
| Strain | |||||
| Characteristic | DD34 | UGA | CP3 | NJ1 | Frances |
| Genotype | A | A | B | D | E |
| GenBank accession no. | |||||
| Total assembled size (bp) | 1 163 748 | 1 164 948 | 1 163 075 | 1 160 660 | 1 162 120 |
| Plasmid size (bp) | 7553 | 7553 | 7553 | 7532 | 7545 |
| Total reads | 13 181 088 | 12 562 928 | 12 695 762 | 7 803 252 | 13 291 438 |
| Total bases | 1 317 642 394 | 1 256 351 619 | 1 268 662 067 | 780 739 971 | 1 327 907 989 |
| Mean read length | 101 | 101 | 101 | 101 | 101 |
| Mean coverage | 576.2 | 548 | 556.1 | 340.4 | 581.1 |
| Number of contigs | 4 | 6 | 4 | 3 | 2 |
| N50 | 778 174 | 779 321 | 778 149 | 775 399 | 1 159 687 |
| Genes predicted | 1057 | 1059 | 1054 | 1057 | 1044 |
| tRNAs predicted | 38 | 38 | 38 | 38 | 38 |
| DNA G+C content (mol%) | 39.03 | 39.02 | 39.01 | 38.95 | 39.03 |
Fig. 1.brig analysis. A brig comparing the C. psittaci genomes sequenced in this study and C. psittaci genomes sequenced by Read . Genotype 6BC was used as a reference for comparison. The innermost ring represents the base position along the map. The second ring indicates the GC content along the length of the genome. The colour rings correspond to the genomes with each genome indicated by a unique colour as indicated in the figure legend. A change in colour corresponds to a decrease in pairwise identity compared to the reference genome. Four regions with sequence divergence are noted in red circles with gene identities listed in orange boxes.
Fig. 2.Whole genome SNP tree. A whole genome SNP tree rooted to C. abortus strain S26/3 reconstructed with all available genome sequences in the NCBI database along with the five genomes sequenced for this study. The strain name is followed by the genotype for each branch of the tree (Strain name–Genotype). Strain names colored in blue were sequenced for this study, names colored in pink were obtained from NCBI, and the genome colored in red was the root genome.
T3SS genes
| Strain: | DD34 | UGA | CP3 | GR9 | NJ1 | Frances | VS225 | RTH | WS/RT/E30 | M56 | WC | ||
| Genotype: | A | A | B | C | D | E | F | G | E/B | M56 | WC | ||
| 6BC locus tag (CP002549) | Name | Pairwise identity (%) | |||||||||||
|
| |||||||||||||
| CPSIT_0074 | Hypothetical protein | 100 | 100 | 100 | >95 | 100 | 100 | 100 | 88 | >95 | >95 | 100 | |
| CPSIT_0245 | Cabohydrate isomerase | 100 | 100 | 100 | >95 | 100 | 100 | 100 | 95 | >95 | >95 | 100 | |
| CPSIT_0313 | Polymorphic membrane protein, G family | 100 | 100 | >95 | 100 | 100 | 100 | 100 | 89 | 100 | 100 | 100 | |
| CPSIT_0357 | Hypothetical protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 89 | >95 | 100 | 100 | |
| CPSIT_0397 | Hypothetical protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 96 | 100 | 100 | 100 | |
| CPSIT_0421 | Hypothetical protein | 100 | 100 | 10 | 100 | 100 | 100 | 100 | 88 | 100 | >95 | 100 | |
| CPSIT_0429 | Hypothetical protein | 100 | 100 | 100 | 100 | >95 | 100 | 100 | – | 100 | 100 | 100 | |
| CPSIT_0431 | Putative membrane protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 93 | 100 | 100 | 100 | |
| CPSIT_0490 | Hypothetical serine rich protein | 100 | 100 | 100 | 100 | 100 | >99 | 100 | 94 | 100 | 100 | 100 | |
| CPSIT_0594 | IncA | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 77 | 100 | >95 | 100 | |
| CPSIT_0602 | Hypothetical protein | 100 | 100 | 100 | >95 | 100 | 100 | 100 | 91 | >95 | 100 | 100 | |
| CPSIT_0656 | Putative integral membrane protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 90 | 100 | 100 | 100 | |
| CPSIT_0749 | Hypothetical protein | 100 | 100 | 100 | >80 | 100 | 100 | >80 | 78 | >80 | 100 | 100 | |
| CPSIT_0767 | 3-phosphoshikimate-1-carboxyvinyltransferase | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 95 | 100 | 100 | 100 | |
| CPSIT_0785 | Hypothetical serine rich protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 93 | 100 | 100 | 100 | |
| CPSIT_0828 | DNA recombination protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 95 | 100 | 100 | 100 | |
| CPSIT_0844 | IncA family protein | 100 | 100 | >95 | – | 90 | >95 | 100 | – | – | >77 | 100 | |
| CPSIT_0930 | tRNA (Uracil-5-)-methyltransferase | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 95 | 100 | 100 | 100 | |
| CPSIT_0933 | Putative membrane protein | >90 | >90 | >90 | 100 | 100 | >95 | 100 | – | 100 | >95 | 100 | |
| CPSIT_0997 | Putative inner membrane protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 95 | 100 | 100 | 100 | |
| CPSIT_1054 | 5-formyltetrahydrofolate-cyclo-ligase | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 92 | 100 | 100 | 100 | |
|
| |||||||||||||
| CPSIT_0192 | tarp | 100 | 100 | >95 | >95 | 100 | >95 | >95 | 92 | >95 | >92 | >95 | |
| CPSIT_0220 | Cyclodiphosphate synthase | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 95 | 100 | 100 | 100 | |
| CPSIT_0296 | Hypothetical protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | – | 100 | >95 | 100 | |
| CPSIT_0314 | Polymorphic membrane protein, G family | 100 | 100 | >95 | >95 | 100 | >95 | >95 | 89 | >95 | >95 | >95 | |
| CPSIT_0422 | Hypothetical protein | 100 | 100 | 100 | 100 | >95 | 100 | 100 | 84 | 100 | >95 | 100 | |
| CPSIT_0461 | Hypothetical protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 91 | 100 | 99 | 100 | |
| CPSIT_0463 | Putative inner-membrane protein | 100 | 100 | >98 | 99 | 96 | >99 | 99 | – | 99 | >92 | – | |
| CPSIT_0532 | IncB | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 83 | 100 | 100 | 100 | |
| CPSIT_0555 | Putative inner-membrane protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 76 | 100 | 100 | 100 | |
| CPSIT_0580 | Putative inner-membrane protein | 100 | 100 | 100 | 100 | >95 | 100 | 100 | 86 | 100 | 100 | 100 | |
| CPSIT_0689 | Hypothetical protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | – | 100 | 100 | 100 | |
| CPSIT_0757 | Dihydrodipicolinate reductase | 100 | 100 | >95 | >95 | >95 | >95 | 100 | 100 | >95 | >95 | >95 | |
| CPSIT_0760 | Hypothetical membrane protein | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| CPSIT_0846 | IncA family protein | 100 | 100 | >75 | – | >93 | – | >80 | – | – | >85 | 100 | |
| CPSIT_0856 | Membrane protein | 100 | 100 | 100 | >95 | 100 | 100 | 100 | 94 | >95 | 100 | 100 | |
| CPSIT_0962 | Flagellar biosynthesis/type III secretory pathway | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 97 | 100 | 100 | 100 | |
| CPSIT_0974 | Trigger factor | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 96 | 100 | 100 | 100 | |
| CPSIT_1042 | Deoxyribonucleotide triphosphate pyrophosphatase | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 95 | 100 | 100 | 100 | |
Locus tags highlighted in yellow are absent in one or more strains, and those highlighted in blue are discussed in the manuscript. –, The gene was not present.
Fig. 3.Heatmap analysis of tbl3SS genes. The nucleotide pairwise identities for (a) the apparatus genes and (b) the effector genes are shown for the strains in this study compared with 6BC tbl3SS reference genes. Pairwise identities of ≥ 75 % are shown. Pairwise identifies are coded with the following colour schema: red: 0–59 %, orange: 60 − 69 %, yellow: 70–79 %, green: 80–89 % and blue: 90–100 %.
Polymorphic outer-membrane protein genes
| PMP family | 6 BC locus tag (CP002549) | Strain: | DD34 | UGA | CP3 | GR9 | NJ1 | Frances | VS225 | RTH | WS/RT/E30 | M56 | WC |
| Genotype: | A | A | B | C | D | E | F | G | E/B | M56 | WC | ||
| Pairwise identity (%) | |||||||||||||
| A | CPSIT_0232 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 95 | 100 | >95 | 100 | |
| B/C | CPSIT_0231 | 100 | 100 | 100 | >95 | >95 | 100 | 100 | 89 | >95 | >90 | 100 | |
| D | CPSIT_0856 | 100 | 100 | 100 | >95 | 100 | 100 | 100 | 94 | >95 | 100 | 100 | |
| E/F | CPSIT_0297 | 100 | 100 | 100 | 100 | >95 | 100 | 100 | – | 100 | – | 100 | |
| CPSIT_0298 | 100 | 100 | >95 | >95 | >95 | >95 | >95 | – | >95 | – | 100 | ||
| G | CPSIT_0302 | 100 | 100 | >95 | 100 | >75 | >95 | >95 | 86 | 100 | >84 | >95 | |
| CPSIT_0304 | 100 | >95 | >99 | 100 | 100 | >99 | 100 | 84 | 100 | >95 | 100 | ||
| CPSIT_0305 | >95 | >95 | >94 | >95 | 81 | >95 | >95 | 78 | >95 | >79 | >95 | ||
| CPSIT_0306 | 100 | 100 | >95 | 100 | 100 | 100 | >95 | 93 | >95 | 100 | 100 | ||
| CPSIT_0307 | 100 | 100 | >95 | 100 | 100 | 100 | 100 | 93 | 100 | 100 | 100 | ||
| CPSIT_0309 | >75 | >96 | – | >85 | – | >95 | >95 | 91 | – | >85 | >95 | ||
| CPSIT_0310 | – | – | 80 | – | >76 | – | – | – | – | >79 | >95 | ||
| CPSIT_0311 | – | – | >87 | – | – | – | – | – | – | 75 | >95 | ||
| CPSIT_0312 | >76 | >83 | >76 | >95 | – | >84 | >95 | 89 | – | >90 | >85 | ||
| CPSIT_0313 | 100 | 100 | >95 | 100 | 100 | 100 | 100 | 89 | 100 | 100 | 100 | ||
| CPSIT_0314 | 100 | 100 | >95 | >95 | >95 | >95 | >95 | 89 | >95 | >95 | >95 | ||
| CPSIT_0666 | – | – | >77 | 100 | – | – | >95 | – | >95 | – | – | ||
| CPSIT_0667 | >78 | >78 | – | – | 84 | >95 | >95 | – | – | >85 | >95 | ||
| CPSIT_0668 | 75 | >81 | – | >95 | – | 80 | >95 | 86 | >75 | >90 | >85 | ||
| H | CPSIT_0301 | 100 | 100 | >95 | >95 | 100 | >95 | >95 | 88 | >95 | >95 | 100 | |
| Other | CPSIT_0057 | 100 | 100 | 100 | – | – | 100 | – | 95 | 100 | 100 | – | |
| CPSIT_0207 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 97 | 100 | >95 | 100 | ||
| CPSIT_0300 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 81 | 10 | >95 | 100 | ||
| CPSIT_0329 | 100 | 100 | >95 | >95 | >95 | >95 | >95 | 95 | >95 | >95 | >95 | ||
| CPSIT_0330 | 100 | 100 | >95 | >95 | 100 | >95 | 100 | 92 | 100 | 100 | 100 | ||
| CPSIT_0345 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 96 | 100 | 100 | 100 | ||
| CPSIT_0523 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 95 | 100 | 100 | 100 | ||
| CPSIT_0967 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 97 | 100 | 100 | 100 | ||
| CPSIT_1035 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 95 | 100 | 100 | 100 | ||
“–” indicates the gene was not present.
Fig. 4.Heatmap analysis of Pmp genes. The nucleotide pairwise identities for all 30 C. psittaci Pmp genes are shown for the strains in this study compared with 6BC Pmp reference genes. Pairwise identities of ≥ 75 % are shown. Pairwise identifies are coded with the following colour schema: red: 0–59 %, orange: 60 − 69 %, yellow: 70–79 %, green: 80–89 % and blue: 90–100 %.
Fig. 5.Phylogenetic tree of Pmp genes. Phylogenetic relationships computed using the maximum-likelihood from the conserved Pmp genes found in the C. psittaci genomes included in this study. Protein sequences with a pairwise identity ≥ 75 % to the C. psittaci 6BC genome were included. Bootstrapping percentage values are shown at branch points, and the bar represents the residue substitution per site. The strain name is followed by the genotype for each branch of the tree (Strain name–Genotype).