| Literature DB >> 30383795 |
Tracy L Nicholson1, Brian W Brunelle1, Darrell O Bayles1, David P Alt1, Sarah M Shore1.
Abstract
Haemophilus parasuis is a respiratory pathogen of swine and the etiological agent of Glässer's disease. H. parasuis isolates can exhibit different virulence capabilities ranging from lethal systemic disease to subclinical carriage. To identify genomic differences between phenotypically distinct strains, we obtained the closed whole-genome sequence annotation and genome-wide methylation patterns for the highly virulent Nagasaki strain and for the non-virulent D74 strain. Evaluation of the virulence-associated genes contained within the genomes of D74 and Nagasaki led to the discovery of a large number of toxin-antitoxin (TA) systems within both genomes. Five predicted hemolysins were identified as unique to Nagasaki and seven putative contact-dependent growth inhibition toxin proteins were identified only in strain D74. Assessment of all potential vtaA genes revealed thirteen present in the Nagasaki genome and three in the D74 genome. Subsequent evaluation of the predicted protein structure revealed that none of the D74 VtaA proteins contain a collagen triple helix repeat domain. Additionally, the predicted protein sequence for two D74 VtaA proteins is substantially longer than any predicted Nagasaki VtaA proteins. Fifteen methylation sequence motifs were identified in D74 and fourteen methylation sequence motifs were identified in Nagasaki using SMRT sequencing analysis. Only one of the methylation sequence motifs was observed in both strains indicative of the diversity between D74 and Nagasaki. Subsequent analysis also revealed diversity in the restriction-modification systems harbored by D74 and Nagasaki. The collective information reported in this study will aid in the development of vaccines and intervention strategies to decrease the prevalence and disease burden caused by H. parasuis.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30383795 PMCID: PMC6211672 DOI: 10.1371/journal.pone.0205700
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
General features of the genomes of H. parasuis strains D74 and Nagasaki.
| D74 | Nagasaki | |
|---|---|---|
| Sequence | 9 | 5 |
| 2,467,568 | 2,348,962 | |
| G + C Content (%) | 39.7% | 40.0% |
| 2,252 | 2,268 | |
| Pseudogenes | 95 | 95 |
| 2,157 | 2,173 | |
| rRNA (16S-23S-5S) | 6-6-7 | 6-6-8 |
| 57 | 60 | |
| Phage Regions | 6 | 9 |
| Plasmid | 1 | 0 |
aEncoding either an incomplete predicted protein sequence, a frameshift, or an internal stop codon.
Phage regions identified in H. parasuis strains D74 and Nagasaki.
| Strain | Region # | Start | End | Length | Classification |
|---|---|---|---|---|---|
| Nagasaki | 1 | 75,146 | 95,579 | 20,434 | Incomplete |
| 2 | 578,665 | 613,918 | 35.254 | Intact | |
| 3 | 930,933 | 982,262 | 51.330 | Intact | |
| 4 | 978,209 | 1,022,173 | 43,965 | Intact | |
| 5 | 1,146,622 | 1,187,091 | 40,470 | Intact | |
| 6 | 1,837,359 | 1,870,441 | 33,083 | Intact | |
| 7 | 1,890,250 | 1,901,023 | 10,774 | Incomplete | |
| 8 | 2,053,720 | 2,085,576 | 31,857 | Intact | |
| 9 | 2,113,966 | 2,153,632 | 39,667 | Intact | |
| D74 | 1 | 1,021,542 | 1,030,153 | 8,612 | Incomplete |
| 2 | 1,179,090 | 1,197,266 | 18,177 | Incomplete | |
| 3 | 1,186,455 | 1,216,786 | 30,332 | Incomplete | |
| 4 | 1,303,971 | 1,339,245 | 35,275 | Intact | |
| 5 | 1,390,803 | 1,407,651 | 16,840 | Incomplete | |
| 6 | 1,868,288 | 1,889,812 | 21,525 | Incomplete |
aBasepair chromosomal location.
bLength of region depicted in base pairs.
cClassification determined by PHASTER analysis (http://phaster.ca/) [48].
Fig 1Map of plasmid pD74.
Arrows indicate annotated CDSs; blue arrows represent CDSs with predicted functions based on sequence homology, pink arrows represent predicted CDSs of unknown function. The MFS transporter gene, labeled with a blue-grey arrow, is predicted to be a pseudogene. Dark grey box indicates region containing 4 copies of a 22-bp tandem repeat sequence identified using the Tandem Repeats Finder tool [41]. Arcs inside map indicate sequence with similarity to other sources; green arc represents sequence similar to H. parasuis plasmid pHS-Rec [49], purple arc represents sequence similar to a portion of the Pasteurella trehalosi plasmid pCCK13698 [50], and yellow arc represents sequence similar to a region from the genome sequence of Gallibacterium anantis strain UMN179 [51].
Fig 2Comparison of the genome sequences of H. parasuis strains D74 and Nagasaki.
(A) Distribution of protein-encoding genes in H. parasuis strains Nagasaki and D74. Venn diagram demonstrating the unique 2,173 protein-encoding genes in Nagasaki (blue), the 2,157 protein-encoding genes D74 (yellow), and the shared coding sequences defined as bi-directional best hits (center circle, green) via 1-to-1 reciprocal BLASTP comparison using RAST [38] (excluding frame-shifted genes). (B) Comparison of the linear organization of the H. parasuis strains Nagasaki and D74 chromosomes. Mauve [40] was used to compare the chromosomes of Nagasaki and D74. Locally collinear blocks (LCBs) representing regions of sequence that align in each genome are illustrated as colored rectangles connected by lines. Nagasaki was used as the reference sequence. In D74, LCBs placed above the center line are in the same orientation as in Nagasaki and LCBs placed below the center line are in the reverse orientation relative to Nagasaki. Blank sections are regions that did not align and are likely to contain unique or strain-specific sequence. Many of the larger blank regions not in LCBs contain loci predicted to encode products of phage origin.
Predicted virulence-associated genes identified in H. parasuis D74.
| Group | D74 locus_tag | D74 Name | Product/ Function | Hit | Nagasaki locus_tag | % Identity |
|---|---|---|---|---|---|---|
| Adhesin | A2U20_02695 | Type IV pilus biogenesis/stability protein PilW | bi | A2U21_08925 | 100 | |
| A2U20_03215 | Type II secretory pathway, pseudopilin PulG | bi | A2U21_08415 | 97 | ||
| A2U20_03220 | Type II secretory pathway, component PulJ | bi | A2U21_08410 | 96 | ||
| A2U20_03425 | Type IV pilus biogenesis protein PilM | bi | A2U21_08215 | 98 | ||
| A2U20_03430 | Type IV pilus biogenesis protein PilN | bi | A2U21_08210 | 98 | ||
| A2U20_03445 | Type IV pilus biogenesis PilQ | bi | A2U21_08195 | 94 | ||
| A2U20_03490 | Outer membrane protein P5 | bi | A2U21_08150 | 92 | ||
| A2U20_03710 | Outer membrane protein precursor P1 | bi | A2U21_07950 | 90 | ||
| A2U20_03770 | pilT domain-containing protein | uni | A2U21_06460 | 56 | ||
| A2U20_04380 | Outer membrane protein P2 precursor | bi | A2U21_07315 | 85 | ||
| A2U20_04655 | Surface antigen (D15), outer membrane protein | bi | A2U21_07040 | 99 | ||
| A2U20_04855 | Outer membrane protein A | bi | A2U21_06820 | 98 | ||
| A2U20_06825 | Filamentous hemagglutinin transporter protein FhaC | uni | A2U21_11135 | 52 | ||
| A2U20_07210 | Type IV pilin PilA | bi | A2U21_10860 | 87 | ||
| A2U20_07215 | Type IV fimbrial assembly ATPase PilB | bi | A2U21_10855 | 98 | ||
| A2U20_07220 | Type IV fimbrial assembly protein PilC | bi | A2U21_10850 | 99 | ||
| A2U20_07225 | Tfp pilus assembly pathway, fimbrial leader peptidase | bi | A2U21_10845 | 94 | ||
| A2U20_08340 | Type V secretory pathway, adhesin AidA | bi | A2U21_04065 | 72 | ||
| A2U20_08910 | Adhesin BmaC autotransporter | uni | A2U21_04065 | 51 | ||
| A2U20_08920 | Putative F17-like fimbrial usher | uni | A2U21_03995 | 98 | ||
| A2U20_08925 | Putative F17-like fimbrial usher | bi | A2U21_03995 | 99 | ||
| Hemolysin | A2U20_02515 | Serralysin C | uni | A2U21_00470 | 58 | |
| A2U20_06885 | Hemagglutinin/hemolysin-related protein | - | ||||
| A2U20_07380 | Hemolysin secretion protein D | - | ||||
| A2U20_08300 | Hemolysin transporter protein ShlB | bi | A2U21_11135 | 53 | ||
| A2U20_08375 | Osmotically-inducible protein OsmY; putative hemolysin | bi | A2U21_04100 | 100 | ||
| A2U20_09890 | Hemolysin regulation protein AhpA | bi | A2U21_02620 | 99 | ||
| A2U20_10965 | Serralysin B, hemolysin-type calcium-binding region | uni | A2U21_00470 | 56 | ||
| A2U20_10970 | Serralysin B, hemolysin-type calcium-binding region | bi | A2U21_00470 | 69 | ||
| Secretion | A2U20_01865 | Putative secretory immunoglobulin A-binding protein | bi | A2U21_08055 | 51 | |
| A2U20_09675 | Putative periplasmic/secreted protein | bi | A2U21_02375 | 93 | ||
| Toxin | A2U20_00255 | EbgC protein_Toxin-antitoxin biofilm protein TabA | bi | A2U21_00620 | 98 | |
| A2U20_01495 | Antitoxin HigA_mRNA interferase antitoxin | uni | A2U21_00995 | 63 | ||
| A2U20_01500 | Addiction module toxin HicA | uni | A2U21_01000 | 59 | ||
| A2U20_01850 | VapC toxin family PIN domain ribonuclease | bi | A2U21_10000 | 78 | ||
| A2U20_02115 | Serine/threonine-protein kinase toxin HipA | bi | A2U21_09500 | 95 | ||
| A2U20_02705 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U21_08915 | 93 | ||
| A2U20_02710 | mRNA interferase toxin HigB | bi | A2U21_08910 | 96 | ||
| A2U20_02805 | Putative toxin and drug export protein A | bi | A2U21_08785 | 97 | ||
| A2U20_02840 | Addiction module toxin RelE | bi | A2U21_03035 | 59 | ||
| A2U20_03250 | Programmed cell death toxin MazF | uni | A2U21_06190 | 52 | ||
| A2U20_03635 | Aaddiction module toxin HicA | bi | A2U21_02950 | 94 | ||
| A2U20_04570 | Antitoxin HigA_mRNA interferase antitoxin | uni | A2U21_00995 | 63 | ||
| A2U20_04795 | Antitoxin/toxin system zeta toxin | uni | A2U21_03130 | 53 | ||
| A2U20_04825 | Cytolethal distending toxin subunit CdtC | bi | A2U21_06570 | 99 | ||
| A2U20_04830 | Cytolethal distending toxin subunit CdtB | uni | A2U21_06565 | 95 | ||
| A2U20_04835 | Cytolethal distending toxin subunit CdtA | bi | A2U21_06560 | 99 | ||
| A2U20_04880 | mRNA interferase toxin HigB | uni | A2U21_05325 | 55 | ||
| A2U20_04885 | Antitoxin HigA_mRNA interferase antitoxin | uni | A2U21_08915 | 56 | ||
| A2U20_05030 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U21_06130 | 98 | ||
| A2U20_05035 | mRNA interferase toxin HigB | bi | A2U21_06135 | 99 | ||
| A2U20_05120 | Cytolethal distending toxin subunit CdtC | uni | A2U21_06570 | 98 | ||
| A2U20_05125 | Cytolethal distending toxin subunit CdtB | bi | A2U21_06565 | 96 | ||
| A2U20_05130 | Cytolethal distending toxin subunit CdtA | uni | A2U21_06560 | 99 | ||
| A2U20_05385 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U21_06310 | 99 | ||
| A2U20_05390 | mRNA interferase toxin HigB | bi | A2U21_06305 | 100 | ||
| A2U20_05455 | mRNA interferase toxin HigB | bi | A2U21_06250 | 99 | ||
| A2U20_05460 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U21_06245 | 100 | ||
| A2U20_05480 | Contact-dependent growth inhibition (CDI) toxin | - | ||||
| A2U20_05555 | Antitoxin ChpS | bi | A2U21_06185 | 99 | ||
| A2U20_05585 | Toxin-antitoxin biofilm protein TabA | bi | A2U21_06150 | 100 | ||
| A2U20_05690 | Addiction module toxin HicA | bi | A2U21_04705 | 62 | ||
| A2U20_06620 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U21_05330 | 93 | ||
| A2U20_06625 | mRNA interferase toxin HigB | bi | A2U21_05325 | 99 | ||
| A2U20_06830 | Contact-dependent growth inhibition (CDI) toxin | - | ||||
| A2U20_06850 | Contact-dependent growth inhibition (CDI) toxin | - | ||||
| A2U20_06865 | Contact-dependent growth inhibition (CDI) toxin | - | ||||
| A2U20_06875 | Contact-dependent growth inhibition (CDI) toxin | - | ||||
| A2U20_07065 | Antitoxin/toxin system zeta toxin | bi | A2U21_03130 | 75 | ||
| A2U20_07375 | Leukotoxin export ATP-binding protein LtxB | uni | A2U21_07020 | 52 | ||
| A2U20_08285 | Contact-dependent growth inhibition (CDI) toxin | - | ||||
| A2U20_08295 | Contact-dependent growth inhibition (CDI) toxin | - | ||||
| A2U20_08515 | Persistence and stress-resistance antitoxin PasI | bi | A2U21_04275 | 98 | ||
| A2U20_10085 | Antitoxin HicB | bi | A2U21_00970 | 100 | ||
| A2U20_10110 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U21_00995 | 98 | ||
| A2U20_10115 | mRNA interferase toxin HigB | bi | A2U21_01000 | 99 | ||
| A2U20_10565 | Serine/threonine-protein kinase toxin HipA | bi | A2U21_00565 | 99 | ||
| Other | A2U20_00515 | Sialidase | bi | A2U21_11540 | 99 | |
| A2U20_01700 | Putative extracellular serine protease | bi | A2U21_09655 | 75 | ||
| A2U20_01715 | Putative extracellular serine protease | bi | A2U21_09640 | 78 | ||
| A2U20_03355 | Putative VacJ lipoprotein | bi | A2U21_08285 | 97 | ||
| A2U20_05150 | Regulator of disulfide bond formation | bi | A2U21_06540 | 99 | ||
| A2U20_05155 | Invasion protein expression up-regulator SirB | bi | A2U21_06535 | 100 | ||
| A2U20_07890 | Superoxide dismutase | bi | A2U21_03770 | 100 | ||
| A2U20_08325 | Superoxide dismutase | bi | A2U21_04050 | 92 |
a The Hit column contains a '-' (no hit), 'uni' or 'bi' RAST server results from a one-to-one BLASTP comparison of the protein coding sequences in the Nagasaki genome using the D74 genome as the reference. “bi” represents a bidirectional best hit in which the reverse hit from the Nagasaki comparison genome to the D74 reference genome was also the best hit. “uni” indicates a uni-directional hit in which the reverse hit from the comparison genome to the reference genome was not also the best hit. “-”indicates no hit or match was found.
bGlobal pairwise nucleotide percent sequence identity.
Predicted virulence-associated genes identified in H. parasuis Nagasaki.
| Group | Nagasaki locus_tag | Nagasaki Name | Product/ Function | Hit | D74 locus_tag | % Identity |
|---|---|---|---|---|---|---|
| Adhesin | A2U21_02720 | Type V secretory pathway, adhesin AidA | - | |||
| A2U21_03995 | Putative F17-like fimbrial usher | bi | A2U20_08925 | 99 | ||
| A2U21_04065 | Type V secretory pathway, adhesin AidA | bi | A2U20_08340 | 72 | ||
| A2U21_06820 | Outer membrane protein A precursor | bi | A2U20_04855 | 98 | ||
| A2U21_07040 | Surface antigen (D15), outer membrane protein | bi | A2U20_04655 | 99 | ||
| A2U21_07315 | Outer membrane protein P2 precursor | bi | A2U20_04380 | 85 | ||
| A2U21_07950 | Outer membrane protein precursor P1 | bi | A2U20_03710 | 90 | ||
| A2U21_08150 | putative outer membrane protein P5 | bi | A2U20_03490 | 92 | ||
| A2U21_08195 | Type IV pilus biogenesis protein PilQ | bi | A2U20_03445 | 94 | ||
| A2U21_08210 | Type IV pilus biogenesis protein PilN | bi | A2U20_03430 | 98 | ||
| A2U21_08215 | Type IV pilus biogenesis protein PilM | bi | A2U20_03425 | 98 | ||
| A2U21_08410 | Type II secretory pathway, component PulJ | bi | A2U20_03220 | 96 | ||
| A2U21_08415 | Type II secretory pathway, pseudopilin PulG | bi | A2U20_03215 | 97 | ||
| A2U21_08925 | Type IV pilus biogenesis/stability protein PilW | bi | A2U20_02695 | 100 | ||
| A2U21_10845 | Tfp pilus assembly pathway, fimbrial leader peptidase | bi | A2U20_07225 | 94 | ||
| A2U21_10850 | Type IV fimbrial assembly protein PilC | bi | A2U20_07220 | 99 | ||
| A2U21_10855 | Type IV fimbrial assembly ATPase PilB | bi | A2U20_07215 | 98 | ||
| A2U21_10860 | Type IV pilin PilA | bi | A2U20_07210 | 87 | ||
| Hemolysin | A2U21_00465 | Putative hemagglutinin/hemolysin-related protein | - | |||
| A2U21_00470 | Serralysin B, hemolysin-type calcium-binding region | bi | A2U20_10970 | 69 | ||
| A2U21_02620 | Hemolysin regulation protein AhpA | bi | A2U20_09890 | 99 | ||
| A2U21_04100 | Osmotically-inducible protein OsmY_putative hemolysin | bi | A2U20_08375 | 100 | ||
| A2U21_11130 | Hemolysin transporter protein ShlB | - | ||||
| A2U21_11135 | Hemolysin transporter protein ShlB | bi | A2U20_08300 | 53 | ||
| A2U21_11140 | putative hemolysin | - | ||||
| A2U21_11145 | Putative hemolysin | - | ||||
| A2U21_11150 | Hemolysin | - | ||||
| A2U21_02375 | Periplasmic/secreted protein | bi | A2U20_09675 | 93 | ||
| A2U21_08050 | Putative secretory immunoglobulin A-binding protein | - | ||||
| A2U21_08055 | Putative secretory immunoglobulin A-binding protein | bi | A2U20_01865 | 51 | ||
| Toxin | A2U21_00565 | Serine/threonine-protein kinase toxin HipA | bi | A2U20_10565 | 99 | |
| A2U21_00620 | EbgC protein_Toxin-antitoxin biofilm protein TabA | bi | A2U20_00255 | 98 | ||
| A2U21_00970 | Antitoxin HicB | bi | A2U20_10085 | 100 | ||
| A2U21_00995 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U20_10110 | 98 | ||
| A2U21_01000 | mRNA interferase toxin HigB | bi | A2U20_10115 | 99 | ||
| A2U21_02950 | Addiction module toxin HicA | bi | A2U20_03635 | 94 | ||
| A2U21_03035 | Addiction module toxin RelE | bi | A2U20_02840 | 59 | ||
| A2U21_03130 | Antitoxin/toxin system zeta toxin | bi | A2U20_07065 | 75 | ||
| A2U21_03940 | Toxin-antitoxin system, antitoxin component | - | ||||
| A2U21_04275 | Persistence and stress-resistance antitoxin PasI | bi | A2U20_08515 | 98 | ||
| A2U21_04650 | Antitoxin HigA_mRNA interferase antitoxin | uni | A2U20_01495 | 54 | ||
| A2U21_04705 | Addiction module toxin HicA | bi | A2U20_05690 | 62 | ||
| A2U21_04735 | Addiction module toxin HicA | uni | A2U20_03635 | 94 | ||
| A2U21_04850 | mRNA interferase toxin RelE | - | ||||
| A2U21_05325 | mRNA interferase toxin HigB | bi | A2U20_06625 | 99 | ||
| A2U21_05330 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U20_06620 | 93 | ||
| A2U21_05725 | Addiction module toxin RelE | - | ||||
| A2U21_05930 | Antitoxin HicB | - | ||||
| A2U21_06130 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U20_05030 | 98 | ||
| A2U21_06135 | mRNA interferase toxin HigB | bi | A2U20_05035 | 99 | ||
| A2U21_06150 | Putative Toxin-antitoxin biofilm protein TabA | bi | A2U20_05585 | 100 | ||
| A2U21_06185 | Antitoxin | bi | A2U20_05555 | 99 | ||
| A2U21_06190 | Programmed cell death toxin PemK | bi | A2U20_05550 | 97 | ||
| A2U21_06245 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U20_05460 | 100 | ||
| A2U21_06250 | mRNA interferase toxin HigB | bi | A2U20_05455 | 99 | ||
| A2U21_06305 | mRNA interferase toxin HigB | bi | A2U20_05390 | 100 | ||
| A2U21_06310 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U20_05385 | 99 | ||
| A2U21_06560 | Toxin | bi | A2U20_04835 | 99 | ||
| A2U21_06565 | Cytolethal distending toxin subunit CdtB | bi | A2U20_05125 | 96 | ||
| A2U21_06570 | Toxin | bi | A2U20_04825 | 99 | ||
| A2U21_06840 | Toxin | uni | A2U20_04835 | 99 | ||
| A2U21_06845 | Cytolethal distending toxin subunit CdtB | uni | A2U20_04830 | 94 | ||
| A2U21_06850 | Toxin | uni | A2U20_04825 | 94 | ||
| A2U21_08015 | Addiction module toxin RelE | - | ||||
| A2U21_08020 | RelB/StbD replicon stabilization protein; antitoxin to RelE/StbE | - | ||||
| A2U21_08785 | Putative memebrane toxin/ drug export protein A | bi | A2U20_02805 | 97 | ||
| A2U21_08910 | mRNA interferase toxin HigB | bi | A2U20_02710 | 96 | ||
| A2U21_08915 | Antitoxin HigA_mRNA interferase antitoxin | bi | A2U20_02705 | 93 | ||
| A2U21_09500 | Toxin HipA | bi | A2U20_02115 | 95 | ||
| A2U21_09900 | Virulence-associated protein D; endoribonuclease | - | ||||
| A2U21_10000 | VapC toxin family PIN domain ribonuclease | bi | A2U20_01850 | 78 | ||
| Other | A2U21_03770 | Superoxide dismutase | bi | A2U20_07890 | 100 | |
| A2U21_04050 | Superoxide dismutase | bi | A2U20_08325 | 92 | ||
| A2U21_06535 | Invasion protein expression up-regulator SirB | bi | A2U20_05155 | 100 | ||
| A2U21_06540 | Regulator of disulfide bond formation | bi | A2U20_05150 | 99 | ||
| A2U21_08285 | putative VacJ lipoprotein | bi | A2U20_03355 | 97 | ||
| A2U21_09640 | Putative serine protease | bi | A2U20_01715 | 78 | ||
| A2U21_09655 | Putative serine protease | bi | A2U20_01700 | 75 | ||
| A2U21_11540 | Sialidase | bi | A2U20_00515 | 99 |
a The Hit column contains a '-' (no hit), 'uni' or 'bi' RAST server results from a one-to-one BLASTP comparison of the protein coding sequences in the Nagasaki genome using the D74 genome as the reference. “bi” represents a bidirectional best hit in which the reverse hit from the Nagasaki comparison genome to the D74 reference genome was also the best hit. “uni” indicates a uni-directional hit in which the reverse hit from the comparison genome to the reference genome was not also the best hit. “-”indicates no hit or match was found.
bGlobal pairwise nucleotide percent sequence identity.
Fig 3cdiA genes encoding putative contact-dependent growth inhibition proteins identified in H. parasuis D74.
(A) Organization of cdi loci in D74. The three genomic regions containing putative cdiA genes are depicted. Gene function was assigned based on the results of BLASTX searches. Each arrow represents a gene within the locus; direction of the arrow indicates orientation within the closed genome sequence. Dark blue arrows represent the cdiA genes, red arrows represent cdiB homologues, light blue arrows represent potential “orphan” cdiA genes, yellow arrows represent putative cdiI candidates, and grey arrows represent genes predicted to have functions relating to horizontal gene transmission, black arrows represent genes whose predicted function is unrelated to contact-dependent inhibition. Genomic regions are not shown to scale. (B) Domain architecture of the predicted CdiA proteins. Domain content of the CdiA proteins was determined using a pfam database search. Grey boxes represent ESPR signal peptide domains (PF13018), green boxes represent Haemagg_act domains (PF05860), light blue boxes represent Fil_haemagg domains (PF05594), orange boxes represent Fil_haemagg_2 domains (PF13332), red boxes represent PT-VENN domains (PF04829), and purple boxes represent EndoU_bacteria domains (PF14436).
vtaA genes identified in H. parasuis Nagasaki.
| Nagasaki | Nagasaki | Name assigned by Pina et al. [ | Group assigned by Pina et al. [ | Hit | D74 | D74 Name | % Identity |
|---|---|---|---|---|---|---|---|
| A2U21_00055 | vtaA_N1 | vtaA1 | 1 | - | 0 | ||
| A2U21_00360 | vtaA_N2 | vtaA10 | 2 | uni | A2U20_04585 | vtaA_D1 | 49.73 |
| A2U21_00905 | vtaA_N3 | vtaA4 | 1 | - | 0 | ||
| A2U21_03315 | vtaA_N4 | vtaA8 | 1 | bi | A2U20_05995 | vtaA_D3 | 36.84 |
| A2U21_04400 | vtaA_N5 | vtaA9 | 1 | uni | A2U20_04585 | vtaA_D1 | 30.74 |
| A2U21_05305 | vtaA_N6 | vtaA6 | 1 | - | 0 | ||
| A2U21_05400 | vtaA_N7 | vtaA2 | 1 | - | 0 | ||
| A2U21_05640 | vtaA_N8 | vtaA11 | 2 | uni | A2U20_04585 | vtaA_D1 | 50.94 |
| A2U21_06175 | vtaA_N9 | vtaA3 | 1 | - | 0 | ||
| A2U21_06350 | vtaA_N10 | vtaA12 | 1 | bi | A2U20_04585 | vtaA_D1 | 94.09 |
| A2U21_07110 | vtaA_N11 | vtaA13 | 1 | uni | A2U20_04585 | vtaA_D1 | 59.87 |
| A2U21_08125 | vtaA_N12 | vtaA5 | 3 | - | 0 | ||
| A2U21_09355 | vtaA_N13 | vtaA7 | 3 | - | 0 |
a The Hit column contains a '-' (no hit), 'uni' or 'bi' RAST server results from a one-to-one BLASTP comparison of the protein coding sequences in the D74 genome using the Nagasaki genome as the reference. “bi” represents a bidirectional best hit in which the reverse hit from the D74 comparison genome to the Nagasaki reference genome was also the best hit. “uni” represents uni-directional indicates a hit in which the reverse hit from the comparison genome to the reference genome was not also the best hit. “-”indicates no hit or match was found.
bLocal percent sequence identity.
c18 amino acid insertion compared to the Pina et al. [15] sequence.
d Single amino acid difference at position 1191.
Fig 4Domain architecture of the predicted VtaA proteins in H. parasuis strains D74 and Nagasaki.
(A) Domain architecture of the predicted VtaA proteins from Nagasaki. Assigned gene name designations are shown at left. Schematic depictions of the 13 Nagasaki VtaA proteins is shown. The domains were identified by a pfam database search. ESPR signal peptides (PF13018) are shown in purple, YadA head domains (PF05658) in red, YadA stalk domains (PF05662) in blue, YadA anchor domains (PF03895) in green, collagen domains (PF1391) in orange, TAA-Trp-ring domains (PF15401) in yellow. (B) Domain architecture of the predicted VtaA proteins from D74. Assigned gene name designations are shown at left. Schematic depictions of the pfam domains, colored as in Panel A, for the 3 VtaA proteins from D74.
Methylation motifs detected in H. parasuis D74.
| Motif | Modification Type | # Detected | # in Genome | % Detected | Mean Modification QV | Mean Motif Coverage | Partner Motif |
|---|---|---|---|---|---|---|---|
| m6A | 767 | 772 | 99.4% | 535.6 | 376.2 | ||
| m6A | 817 | 825 | 99.0% | 436.6 | 380.3 | ||
| m6A | 813 | 825 | 98.5% | 421.3 | 381.9 | ||
| m6A | 623 | 630 | 98.9% | 442.4 | 384.0 | ||
| m6A | 15,534 | 15,718 | 98.8% | 410.7 | 381.5 | ||
| m6A | 217 | 220 | 98.6% | 420.1 | 378.9 | ||
| m6A | 55 | 57 | 96.5% | 397.1 | 348.8 | ||
| m6A | 47 | 55 | 85.5% | 346.8 | 365.8 | ||
| m6A | 316 | 659 | 48.0% | 99.1 | 382.2 | ||
| m6A | 207 | 447 | 46.3% | 131.9 | 391.1 | ||
| m6A | 662 | 2,632 | 25.2% | 108.6 | 376.7 | ||
| modified_base | 1,575 | 8,346 | 18.9% | 52.3 | 356.8 | ||
| modified_base | 179,073 | 1,197,696 | 15.0% | 52.7 | 367.5 | ||
| modified_base | 9,920 | 78,834 | 12.6% | 48.6 | 366.4 | ||
| m6A | 15,854 | 200,853 | 7.9% | 112.6 | 371.8 |
aBold underlined bases indicate methylated base in motif sequence.
bBase modification not identified or recognized by software.
Methylation motifs detected in H. parasuis Nagasaki.
| Motif | Modification | # | # in Genome | % Detected | Mean Modification QV | Mean Motif Coverage | Partner Motif |
|---|---|---|---|---|---|---|---|
| m6A | 1,322 | 1,324 | 99.8% | 719.7 | 571.6 | ||
| m6A | 1,089 | 1,092 | 99.7% | 676.2 | 592.7 | ||
| m6A | 1,088 | 1,092 | 99.6% | 700.0 | 622.7 | ||
| m6A | 217 | 218 | 99.5% | 641.8 | 610.1 | ||
| m6A | 215 | 218 | 98.6% | 671.3 | 582.9 | ||
| m6A | 14,603 | 14,778 | 98.8% | 602.6 | 617.8 | ||
| m6A | 454 | 460 | 98.7% | 688.5 | 590.4 | ||
| m6A | 303 | 1,013 | 29.9% | 130.6 | 608.1 | ||
| m6A | 721 | 2,881 | 25.0% | 107.0 | 582.3 | ||
| modified_base | 181,179 | 1,123,132 | 16.1% | 58.7 | 579.9 | ||
| modified_base | 9,439 | 61,834 | 15.3% | 54.8 | 582.8 | ||
| modified_base | 7,871 | 64,478 | 12.2% | 52.9 | 587.1 | ||
| m6A | 11,510 | 134,477 | 8.6% | 178.5 | 591.9 | ||
| m6A | 1,133 | 24,217 | 4.7% | 140.2 | 629.3 |
aBold underlined bases indicate methylated base in motif sequence.
bBase modification not identified or recognized by software.
Putative H. parasuis D74 restriction modification systems.
| Type | Gene | D74 locus_tag | D74 Name | Predicted Recognition Sequence | REBASE Name |
|---|---|---|---|---|---|
| I | M | A2U20_05240 | hsdM | M.Hpa74ORFFP | |
| I | S | A2U20_05245 | hsdS | S.Hpa74ORFFP | |
| I | R | A2U20_05260 | hsdR2 | Hpa74ORFFP | |
| I | R | A2U20_10200 | hsdR3 | Hpa74ORFIP | |
| I | S | A2U20_10215 | hsdS2 | S1.Hpa74ORFIP | |
| I | S | A2U20_10220 | hypothetical protein CDS | S2.Hpa74ORFIP | |
| I | M | A2U20_10230 | hsdM2 | M.Hpa74ORFIP | |
| I | R | A2U20_10460 | hsdR4 | Hpa74ORFJP | |
| I | S | A2U20_11315 | hsdS3 | S.Hpa74III | |
| I | R | A2U20_11320 | hsdR5 | Hpa74III | |
| I | M | A2U20_11330 | hsdM3 | M.Hpa74III | |
| II | M | A2U20_00375 | hindIIIM | M.Hpa74I | |
| II | R | A2U20_00380 | hindIIIR | Hpa74IP | |
| II | RM | A2U20_00575 | bcgIA | Hpa74ORFBP | |
| II | S | A2U20_00580 | bcgIB | S.Hpa74ORFBP | |
| II | S | A2U20_00800 | bcgIB | S1.Hpa74ORFDP | |
| II | S | A2U20_00805 | bcgIB2 | S2.Hpa74ORFDP | |
| II | M | A2U20_03405 | dam | Hpa74II | |
| II | RM | A2U20_07465 | hypothetical protein CDS | Hpa74ORFGP | |
| II | R | A2U20_07700 | restriction endonuclease CDS | Hpa74ORFHP | |
| II | M | A2U20_07705 | haeIIIM | M.Hpa74ORFHP | |
| II | M | A2U20_10010 | yhdJ | M.Hpa74ORFOP | |
| II | M | A2U20_10475 | modification methylase CDS | M.Hpa74ORFLP | |
| II | R | A2U20_10480 | HNH endonuclease CDS | Hpa74ORFLP | |
| II | S | A2U20_10485 | bcgIB3 | S.Hpa74ORFMP | |
| II | RM | A2U20_10490 | bcgIA2 | Hpa74ORFMP |
aSystems were designated Type I, II, or III based on REBASE analyses.
bGene designations of methylase (M), restriction (R), fused restriction-modification (RM), or specificity (S), along with predicted recognition sequence and REBASE name, were determined using REBASE analysis.
CPredicted pseudogene.
Putative H. parasuis Nagasaki restriction modification systems.
| Type | Gene | Nagasaki locus_tag | Nagasaki Name | Predicted Recognition Sequence | REBASE Name |
|---|---|---|---|---|---|
| I | M | A2U21_00095 | hsdM | M.HpaNNII | |
| I | R | A2U21_00110 | hsdR | HpaNNIIP | |
| I | S | A2U21_00115 | hsdS | S.HpaNNII | |
| I | R | A2U21_01085 | hsdR2 | HpaNNORFEP | |
| I | S | A2U21_01100 | hsdS2 | S1.HpaNNORFEP | |
| I | S | A2U21_01105 | hypothetical protein CDS | S2.HpaNNORFEP | |
| I | M | A2U21_01115 | type I restriction-modification system subunit M CDS | M.HpaNNORFEP | |
| I | R | A2U21_04210 | hsdR4 | HpaNNORFIP | |
| I | S | A2U21_04220 | hsdS3 | S1.HpaNNORFEP | |
| I | S | A2U21_04225 | hsdS4 | S2.HpaNNORFEP | |
| I | M | A2U21_04230 | hsdM2 | M.HpaNNORFEP | |
| I | R | A2U21_06440 | hsdR5 | HpaNNORFJP | |
| I | S | A2U21_06445 | hsdS5 | S.HpaNNORFJP | |
| I | M | A2U21_06450 | hsdM3 | M.HpaNNORFJP | |
| II | S | A2U21_00855 | bcgIB | S1.HpaNNORFDP | |
| II | S | A2U21_00860 | bcgIB2 | S2.HpaNNORFDP | |
| II | RM | A2U21_00865 | bcgIA | HpaNNORFDP | |
| II | M | A2U21_03345 | hypothetical protein CDS | M.HpaNNORFGP | |
| II | M | A2U21_03850 | hhalM | M.HpaNNORFHP | |
| II | R | A2U21_03855 | type II RM endonuclease | HpaNNORFHP | |
| II | M | A2U21_06885 | hpaIIM | M.HpaNNORFAP | |
| II | M | A2U21_08235 | dam | M.HpaNNI | |
| II | M | A2U21_09040 | bspRIM | M.HpaNNORFLP | |
| II | R | A2U21_09045 | hypothetical protein CDS | HpaNNORFLP | |
| II | M | A2U21_09985 | restriction endonuclease subunit M CDS | M1.HpaNNORFMP | |
| II | M | A2U21_09990 | restriction endonuclease CDS | M2.HpaNNORFMP | |
| II | RM | A2U21_09995 | restriction endonuclease CDS | HpaNNORFMP | |
| II | M | A2U21_10160 | hypothetical protein CDS | M.HpaNNORFNP | |
| II | M | A2U21_10520 | hypothetical protein CDS | M.HpaNNORFOP | |
| III | M | A2U21_01380 | restriction endonuclease subunit M CDS | M1.HpaNNORFFP | |
| III | M | A2U21_01385 | bamHIM | M2.HpaNNORFFP | |
| III | R | A2U21_01390 | type III restriction endonuclease subunit R CDS | HpaNNORFFP | |
| III | M | A2U21_11010 | site-specific DNA-methyltransferase CDS | M.HpaNNORFPP | |
| III | R | A2U21_11015 | restriction endonuclease CDS | HpaNNORFPP |
aSystems were designated Type I, II, or III based on REBASE analyses.
bGene designations of methylase (M), restriction (R), fused restriction-modification (RM), or specificity (S), along with predicted recognition sequence and REBASE name, were determined using REBASE analysis.