| Literature DB >> 22111657 |
Shivakumara Siddaramappa1, Jean F Challacombe, Alison J Duncan, Allison F Gillaspy, Matthew Carson, Jenny Gipson, Joshua Orvis, Jeremy Zaitshik, Gentry Barnes, David Bruce, Olga Chertkov, J Chris Detter, Cliff S Han, Roxanne Tapia, Linda S Thompson, David W Dyer, Thomas J Inzana.
Abstract
BACKGROUND: Pneumonia and myocarditis are the most commonly reported diseases due to Histophilus somni, an opportunistic pathogen of the reproductive and respiratory tracts of cattle. Thus far only a few genes involved in metabolic and virulence functions have been identified and characterized in H. somni using traditional methods. Analyses of the genome sequences of several Pasteurellaceae species have provided insights into their biology and evolution. In view of the economic and ecological importance of H. somni, the genome sequence of pneumonia strain 2336 has been determined and compared to that of commensal strain 129Pt and other members of the Pasteurellaceae.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22111657 PMCID: PMC3339403 DOI: 10.1186/1471-2164-12-570
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Characteristics of the genomes of H.somni strains
| Genome | Strain 2336 | Strain 129Pt |
|---|---|---|
| Chromosome size | 2263857 bp | 2007700 bp |
| Number of protein coding genes | 1980 | 1792 |
| Overall coding density | 88.56% | 89.6% |
| *Number of subsystems | 247 | 240 |
| G+C content | 37.38% | 37.19% |
| 5S ribosomal RNA genes | 6 | 6 |
| 16S ribosomal RNA genes | 5 | 5 |
| 23S ribosomal RNA genes | 5 | 5 |
| Number of tRNA genes | 49 | 49 |
| Plasmids | None | pHS129 (5178 bp) [ |
| Prophage regions | 4 | 1 |
| Genomic islands | 3 | 6 |
| GenBank accession number | [GenBank: | [GenBank: |
*Subsystems predicted by RAST server [27].
Figure 1Alignment of the chromosomes of strains 2336 (top) and 129Pt (bottom) using MAUVE 2. Identically colored boxes, known as locally collinear blocks (LCBs), depict homologous regions in the two chromosomes. The edges of LCBs indicate chromosome rearrangements due to recombination, insertions, and/or inversions. Sequences of strain 129Pt inverted in relation to those of strain 2336 are shown as blocks below the horizontal line. The vertical lines connect regions of homology among the two chromosomes. Numbers above the maps indicate nucleotide positions within the respective chromosomes.
Figure 2Synteny plot of the chromosomes of strains 2336 and 129Pt generated by NUCmer. Regions of identity between the two chromosomes are shown based on pair-wise alignments. The strain 2336 sequence is represented on the X-axis, and the strain 129Pt sequence is represented on the Y-axis. Numbers indicate nucleotide positions within the respective chromosomes. Plus strand matches are slanted from the bottom left to the upper right corner and are shown in red. Minus strand matches are slanted from the upper left to the lower right and are shown in blue. The number of dots/lines shown in the plot is the same as the number of exact matches found by NUCmer.
Characteristics of the prophage regions and genomic islands of H.somni strains 2336 and 129Pt
| Strain | Annotation | Chromosomal location | Size (bp) | GC % | Number of genes |
|---|---|---|---|---|---|
| 2336 | PR I | 243787 bp to 255490 bp | 11704 | 40.23 | 13 |
| PR II | 759194 bp to 783660 bp | 24467 | 43.95 | 25 | |
| PR III | 1001039 bp to 1049028 bp | 47990 | 39.63 | 55 | |
| PR IV | 1301649 bp to 1329985 bp | 28337 | 39.85 | 39 | |
| GI I | 1877815 bp to 1902856 bp | 25042 | 37.00 | 21 | |
| GI II | 1963256 bp to 1992078 bp | 28823 | 42.81 | 31 | |
| GI III | 2124957 bp to 2143800 bp | 18844 | 36.00 | 18 | |
| 129Pt | GI I | 464154 bp to 472854 bp | 8701 | 37.60 | 13 |
| GI II | 567352 bp to 574742 bp | 7391 | 33.82 | 11 | |
| GI III | 1524138 bp to 1529684 bp | 5547 | 35.41 | 7 | |
| PR I | 1569138 bp to 1610766 bp | 41629 | 39.37 | 49 | |
| GI IV | 1955573 bp to 1961650 bp | 6078 | 37.86 | 8 | |
| GI V | 1962205 bp to 1966694 bp | 4490 | 31.27 | 7 | |
| GI VI | 1969857 bp to 1974596 bp | 4740 | 38.19 | 9 |
Figure 3(A) BLASTN of prophage region III of strain 2336 (47,990 bp) against the chromosome of strain 129Pt and (B) BLASTN of prophage region I of strain 129Pt (41,629 bp) against the chromosome of strain 2336. The numbers below each of the maps refer to the nucleotide positions in the respective chromosomes. The matches are color-coded according to the BLASTN alignment scores (Red < 40, Blue = 40-50, Green = 50-80, Pink = 80-200, and Black ≥ 200; a gray line indicates the absence of a significant match).
Figure 4A matrix view (or heatmap) of homologs of the proteins encoded by the predicted ORFs from prophage regions I-IV of strain 2336 (A-D, respectively) and prophage region I of strain 129Pt (E) within the Prophinder database. The columns on the top represent the locus tags of the putative coding sequences within the respective prophage regions of the two strains of H. somni. The rows on the left represent 13 different bacteriophages within the Profinder database that contain two or more related coding sequences. The matches are color-coded according to the BLASTP E-values (a gradient from red to blue represents E-values from 10e-200 to 0.01 and a gray box indicates the absence of a significant match).
Putative glycosyltransferase genes of H.somni strains
| Strain 2336 locus tag, protein,1(SSR, if present)n | Strain 129Pt locus tag, protein,1(SSR, if present)n | SWISS-PROT accession number | Closest | Closest GT homolog (Outside |
|---|---|---|---|---|
| [GenBank: | [GenBank: | [Swiss-Prot: | None | |
| [GenBank: | None | [Swiss-Prot: | ||
| 3[GenBank: | [GenBank: | [Swiss-Prot | ||
| 3[GenBank: | [GenBank: | [Swiss-Prot | ||
| 4[GenBank: | None | [Swiss-Prot: | None | |
| 5[GenBank: | 5[GenBank: | [Swiss-Prot: | ||
| [GenBank: | None | [Swiss-Prot: | ||
| 6[GenBank: | [GenBank: | [Swiss-Prot: | ||
| 7[GenBank: | None | [Swiss-Prot: | ||
| 8[GenBank: | [GenBank: | [Swiss-Prot: | ||
| 9[GenBank: | [GenBank: | [Swiss-Prot: | ||
| [GenBank: | None | [Swiss-Prot: | ||
| [GenBank: | [GenBank: | [Swiss-Prot: | ||
| 10[GenBank: | None | [Swiss-Prot: | ||
| [GenBank: | [GenBank: | [Swiss-Prot: | [GenBank: | |
| None | [GenBank: | [Swiss-Prot: | ||
| None | [GenBank: | [Swiss-Prot: |
1Simple sequence repeats identified within the predicted ORF.
2Identity at the predicted protein level as compared to strain 2336 or strain 129Pt full-length or partial homolog.
3HSM_0398 and 0399 are predicted to encode lipopolysaccharide heptosyltransferases II and I, respectively.
4HSM_0856 contains 28 copies of CAGT 67 bp upstream of the predicted start codon.
5HS_0636 (lob2D), encoding a putative glycosyl transferase [13], is truncated in strain 2336 (HSM_0975a).
6HSM_0976 (lob2C), encoding a putative glycosyltransferase-like protein, has an 117 bp homolog in strain 129Pt (HS_0636a).
7HSM_0977 (lob2B) has been predicted to encode an N-acetylglucosamine transferase [5].
8HSM_0978 (lob2A), encoding a putative glycosyl transferase, had 66% identity to HSM_0977 (lob2B) at the predicted protein level.
9HSM_0979/HS_0638 (lob1) has been predicted to encode a galactosyl transferase [7].
10HSM_1794 contains 4 copies of CAAGCCTACAAGCCTACAAGCCTA 262 bp upstream of the predicted start codon.
Figure 5Comparison of the chromosomal regions of strains 2336 and 129Pt that encode putative filamentous hemagglutinins and/or subtilisin. The numbers on either side of the maps refer to the nucleotide positions in the respective chromosomes. Although the loci are of different sizes, each map within a locus is drawn to scale. Bold arrows represent genes of interest. Arrows with identical slashes inside them represent orthologous genes among the two strains. Unmarked white arrows represent strain-specific genes, genes found at a different location in the other chromosome, or genes encoding hypothetical proteins. Within each locus, the top portion represents strain 2336 and the bottom portion represents strain 129Pt. Locus I. 1. Hypothetical protein ([GenBank:HSM_0264]), 2. Methylglyoxal synthase, 3. Hypothetical protein ([GenBank:HSM_0266]), 4. Hemolysin activation/secretion protein (fhaC), 5, 6, 7, 8. Filamentous hemagglutinin outer membrane protein (fhaB), t. tRNA, 9. Cystathionine gamma-synthase, 10. Thioredoxin, 11. Chromosomal replication initiator DnaA, 12. Uracil-xanthine permease, 13. Uracil phosphoribosyltransferase, 14. Glutamine amidotransferase class-II, 15. Glutamate dehydrogenase, a,b,c. Chromosome/plasmid partition locus. Locus II. 1. DNA-binding transcriptional activator GutM, 2. Sorbitol-6-phosphate dehydrogenase, 3. PTS system glucitol/sorbitol-specific IIA component, 4. PTS system, glucitol/sorbitol-specific, IIBC subunit, 5. PTS system, glucitol/sorbitol-specific, IIC subunit, A. 153 bp sequence found only in strains 129Pt and 2336, B. 500 bp sequence with homology to type III restriction-modification genes, C. 173 bp sequence found only in strain 129Pt, 6-fhaC, 7. fhaB, T. Transposon-related sequence, 8. UbiH/UbiF/VisC/COQ6 family ubiquinone biosynthesis hydroxylase, 9. Polyprenyl-6-methoxyphenol 4-hydroxylase, 10. Peptidase M24, 11. YecA family protein, 12. Cell division protein ZapA. Locus III. 1. Biotin synthase, 2. Thiamine ABC transporter, ATP-binding protein, 3. Thiamine ABC transporter, inner membrane subunit, 4. Thiamine ABC transporter, periplasmic binding protein, 5. fhaB, 6. fhaC, 7. Na+ antiporter NhaC, 8. Predicted primosomal replication protein N, 9. ECF subfamily RNA polymerase sigma-24 factor, 10. Oxidoreductase molybdopterin binding, 11. Ferric reductase domain-containing protein, 12. Permease, t. tRNA, 13. Ribosomal RNA large subunit methyltransferase L. Locus IV. 1. O-succinylbenzoate synthase, 2. Naphthoate synthase, 3. Phosphopyruvate hydratase, 4. Type I restriction enzyme, specificity subunit, 5. HsdR family type I site-specific deoxyribonuclease, 6. Abi family protein, 7. Restriction-modification system DNA specificity subunit, 8. Filamentation induced by cAMP protein Fic, 9. Type I restriction-modification system M subunit, 10. Phage integrase family protein, 11. Restriction-modification system DNA specificity subunit, 12. Acyltransferase, 13. Transposase, 14. Transposase, 15, 16, 17, 18, 19, 20. fhaB, 21. fhaC, 22. Hypothetical protein, 23. Ubiquinone/menaquinone biosynthesis methyltransferase, 24. Hypothetical protein, 25. Probable ubiquinone biosynthesis protein ubiB, 26, 27, 28. Twin-arginine translocation proteins, 29. Delta-aminolevulinic acid dehydratase, 30. Phosphoribosylglycinamide formyltransferase, 31. S-ribosylhomocysteine lyase (luxS), 32. Transketolase, 33. CTP synthase, 34. Acetyltransferase, a. Restriction-modification system DNA specificity subunit, b. Phage integrase. Locus V. 1. LysR family transcriptional regulator, 2. Phage repressor, 3. Transposase, 4. Phage transposase, 5. Transposase, 6. DNA methylase, 7. Transposase, 8. Transposase, 9. AAA ATPase, 10. Peptidase S8 and S53 subtilisin kexin sedolisin, 11. Permease DsdX, 12. D-serine dehydratase, 13. DNA topoisomerase III, t. tRNA.
Figure 6ClustalW-BOXSHADE multiple sequence alignment of N-terminal fragments of FhaB homologs from strain 2336 and comparison to HpmA of . GenBank accession numbers of protein sequences are as follows: [GenBank:HSM_1651] (fhaB) filamentous hemagglutinin outer membrane protein [GenBank:ACA31420]; [GenBank:HSM_0268], (fhaB) filamentous hemagglutinin outer membrane protein [GenBank:ACA31896]; [GenBank:PMI2057] (hpmA) hemolysin [GenBank:CAR44120]; [GenBank:BP1879] (fhaB) filamentous hemagglutinin/adhesin, [GenBank:CAE42162]; [GenBank:HSM_1489] (fhaB) cysteine protease domain-containing protein [GenBank:ACA31239]; and [GenBank:HSM_1090] (fhaB) filamentous hemagglutinin outer membrane protein [GenBank:ACA30807]. Homologous regions are box-shaded black (identical amino acid residues) and gray (conserved amino acid substitutions). Arrows point to arginine, asparagine, glutamic acid, methionine, and serine residues that are shown to be involved in FHA secretion. Regions that correspond to the secondary structural components of FHA and HpmA are marked with a plus sign. S1 (NXXL), S2 (NXXG), and S3 (CXXC) indicate motifs that may be involved in stabilization of the helical structure of FHA and/or HpmA. Numbers at the end of each sequence denote amino acid positions.
Figure 7Comparison of 8 members of the NCBI protein cluster CLSK923564. Ba; Burkholderia ambifaria AMMD [GenBank:YP_773860], Pn; Polaromonas naphthalenivorans CJ2 [GenBank:YP_982671], Ve; Verminephrobacter eiseniae EF01-2 [GenBank:YP_998470], Xo; Xanthomonas oryzae pv. oryzae PXO99A [GenBank:YP_001913386], Pf; P. fluorescens Pf0-1 [GenBank:YP_351424], Ps; P. syringae pv. phaseolicola 1448A [GenBank:YP_272486], Ec; E. coli O127:H6 str. E2348/69 [GenBank:YP_002329331], Hs; H. somni strain 2336 [GenBank:YP_001785203]. Gene features are shown on the left side: Start and stop codons of ORFs encoding ATPase and sutbilase are marked with plus and asterisk signs, respectively. The length of each ORF is measured from the first base after the start codon to the last base before the stop codon. The 4 bp gene overlap is box-shaded light gray. (B) denotes genes found within prophage-like sequences and (T) denotes genes associated with transposons. ClustalW-BOXSHADE multiple sequence alignment of subtilase motifs containing the catalytic triad is shown on the right side: Homologous regions are box-shaded black (identical amino acid residues) and gray (conserved amino acid substitutions). Arrows point to the conserved residues Asp-His-Ser of subtilases of the D-H-S family.