| Literature DB >> 22720110 |
Lorenzo Giacani1, Sujay Chattopadhyay, Arturo Centurion-Lara, Brendan M Jeffrey, Hoavan T Le, Barbara J Molini, Sheila A Lukehart, Evgeni V Sokurenko, Daniel D Rockey.
Abstract
In the rabbit model of syphilis, infection phenotypes associated with the Nichols and Chicago strains of Treponema pallidum (T. pallidum), though similar, are not identical. Between these strains, significant differences are found in expression of, and antibody responses to some candidate virulence factors, suggesting the existence of functional genetic differences between isolates. The Chicago strain genome was therefore sequenced and compared to the Nichols genome, available since 1998. Initial comparative analysis suggested the presence of 44 single nucleotide polymorphisms (SNPs), 103 small (≤3 nucleotides) indels, and 1 large (1204 bp) insertion in the Chicago genome with respect to the Nichols genome. To confirm the above findings, Sanger sequencing was performed on most loci carrying differences using DNA from Chicago and the Nichols strain used in the original T. pallidum genome project. A majority of the previously identified differences were found to be due to errors in the published Nichols genome, while the accuracy of the Chicago genome was confirmed. However, 20 SNPs were confirmed between the two genomes, and 16 (80.0%) were found in coding regions, with all being of non-synonymous nature, strongly indicating action of positive selection. Sequencing of 16 genomic loci harboring SNPs in 12 additional T. pallidum strains, (SS14, Bal 3, Bal 7, Bal 9, Sea 81-3, Sea 81-8, Sea 86-1, Sea 87-1, Mexico A, UW231B, UW236B, and UW249C), was used to identify "Chicago-" or "Nichols -specific" differences. All but one of the 16 SNPs were "Nichols-specific", with Chicago having identical sequences at these positions to almost all of the additional strains examined. These mutations could reflect differential adaptation of the Nichols strain to the rabbit host or pathoadaptive mutations acquired during human infection. Our findings indicate that SNPs among T. pallidum strains emerge under positive selection and, therefore, are likely to be functional in nature.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22720110 PMCID: PMC3373638 DOI: 10.1371/journal.pntd.0001698
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
T. pallidum strains used in this study.
| Strain name | Source | Location | Year of isolation | Provided by |
| Chicago | Primary chancre | Chicago | 1951 | Paul Hardy and Ellen Nell (Johns Hopkins University, Baltimore, MD) |
| Nichols (Seattle) | Cerebrospinal fluid | Washington DC | 1912 | James N. Miller (University of California, Los Angeles) |
| Nichols (Farmington) | Cerebrospinal fluid | Washington DC | 1912 | Justin D. Radolf (University of Connecticut Health Center) |
| Nichols (Dallas) | Cerebrospinal fluid | Washington DC | 1912 | Michael V. Norgard (University of Texas, Southwestern Medical Center) |
| Nichols (UCLA) | Cerebrospinal fluid | Washington DC | 1912 | David Blanco (University of California, Los Angeles) |
| Nichols (Houston) | Cerebrospinal fluid | Washington DC | 1912 | Steven J. Norris (University of Texas, Health Sciences Center, Houston) |
| Mexico A | Primary chancre | Mexico | 1953 | Paul Hardy and Ellen Nell |
| Yobs | Lymph node | Atlanta | 1965 | Paul Hardy and Ellen Nell |
| Sea81-4 | Primary chancre | Seattle | 1980 | Isolated by Sheila A. Lukehart (University of Washington, Seattle WA) |
| Sea81-3 | Cerebrospinal fluid | Seattle | 1981 | Isolated by Sheila A. Lukehart (University of Washington, Seattle, WA) |
| Sea81-8 | Cerebrospinal fluid | Seattle | 1981 | Isolated by Sheila A. Lukehart (University of Washington, Seattle, WA) |
| Sea86-1 | Cerebrospinal fluid | Seattle | 1986 | Isolated by Sheila A. Lukehart (University of Washington, Seattle WA) |
| Sea87-1 | Cerebrospinal fluid | Seattle | 1987 | Isolated by Sheila A. Lukehart (University of Washington, Seattle, WA) |
| Bal-3 | Blood | Baltimore | Not known | Paul Hardy and Ellen Nell |
| Bal-7 | Cerebrospinal fluid | Baltimore | 1976 | Paul Hardy and Ellen Nell |
| Bal-9 | Liver | Baltimore | Not known | Paul Hardy and Ellen Nell |
| Street Strain 14 (SS14) | Skin | Atlanta | 1977 | Sexually Transmitted Disease Laboratory Program, Centers for Disease Control, Atlanta (GA) |
| UW104B | Blood | Seattle | 2002 | Christina Marra (University of Washington, Seattle, WA) |
| UW231B | Blood | Seattle | 2004 | Christina Marra (University of Washington, Seattle, WA) |
| UW236B | Blood | Seattle | 2004 | Christina Marra (University of Washington, Seattle, WA) |
| UW249C | Cerebrospinal fluid | Seattle | 2004 | Christina Marra (University of Washington, Seattle, WA) |
Nichols isolates currently propagated at these locations.
Mutations within ORFs and intergenic regions (IGR).
| Locus Tagin Chicago | Locus Tagin Nichols | Product | Strand | Amino acid change and position in the ORF Nichols→Chicago | Nucleotide change Nichols→Chicago and genomic position | Length in Chicago | Length in Nichols |
| TPChic0051 | TP0051 | Peptide chain release factor 1, PrfA | F | S (104)→P (104) | T (59887)→C (59897) | 351 | 351 |
| TPChic0076 | TP0076 | Sugar ABC transporter, permease protein | F | L (198)→V (198) | C (83983)→G (83994) | 276 | 273 |
| TPChic0265 | TP0265 | Branched-chain amino acid transport system II carrier protein, BrnQ | R | P (230)→L (230) | G (277587)→A (278806) | 455 | 455 |
| TPChic0299 | TP0300 | Ribose/galactose ABC transporter, ATP-binding protein | F | D (320)→G (389) | A (313326)→G (314550) | 585 | 74 |
| TPChic0430 | TP0430 | V-type ATP synthase subunit K | F | F (70)→L (70) | T (458815)→C (460043) | 140 | 140 |
| TPChic0433 | TP0433 | Hypothetical protein | F | V (232)→A (232) | T (461227)→C (462456) | 256 | 256 |
| TPChic0434 | TP0434 | Hypothetical protein | F | L (31)→V (17) | C (461335)→G (462564) | 213 | 227 |
| TPChic0443 | TP0443 | Conserved hypothetical protein | F | T (120)→A (103) | A (469810)→G (471039) | 267 | 284 |
| TPChic0488 | TP0488 | Methyl-accepting chemotaxis protein | F | V (295)→A (422) | T (522138)→C (523372) | 718 | 845 |
| TPChic0584 | TP0584 | Conserved hypothetical protein | F | A (314)→T (314) | G (634785)→A (636027) | 469 | 469 |
| TPChic0621 | TP0621 | TprJ protein | R | F (572)→V (567) | A (673514)→C (674758) | 755 | 758 |
| TPChic0746 | TP0746 | Pyruvate, phosphate dikinase | R | E (501)→G (501) | T (810274)→C (811525) | 901 | 901 |
| TPChic0748 | TP0748 | Cytoplasmic filament protein A | R | C (31)→Y (31) | C (814853)→A (816100) | 678 | 678 |
| TPChic0790 | TP0790 | Antibiotic transport protein, putative | R | C (168)→R (139) | A (857221)→G (858469) | 859 | 888 |
| TPChic0924 | TP0924 | Tex protein | F | Y (529)→STOP | C (1004582)→A (1005844) | 545 | 788 |
| TPChic0978 | TP0978 | LspA signal peptidase II | F | F (195)→L (184) | T (1062062)→G (1063328) | 186 | 197 |
[6].
F indicates the forward or plus (+) strand; R indicates the reverse or minus (−) strand.
Diversity in size reflects differences between the annotation of the Chicago genome and the annotation of the Nichols genome [6].
SNP analysis in T. pallidum strains.
| Locus Tag in Chicago | Locus Tag in Nichols | Product/Functional Category | Amino acid change and position in the ORF Nichols→Chicago | Chicago, Bal 9, Sea 86-1, Sea 81-3, Mexico A, Sea 81-8, Bal 3, UW231B, UW236B, UW249C, SS14, Sea 87-1 | Bal 7 | Nichols |
| TPChic0051 | TP0051 | Peptide chain release factor 1 (PrfA)/Transcriptional regulator- DNA binding protein | S (104)→P (104) | C | C | N |
| TPChic0076 | TP0076 | Sugar ABC transporter, permease protein/Membrane-Transport | L (198)→V (198) | C | C | N |
| TPChic0265 | TP0265 | Amino acid ABC transporter, permease protein (BfnQ)/Membrane-Transport | P (230)→L (230) | C | C | N |
| TPChic0299 | TP0300 | Ribose, galactose ABC transporter, ATP-binding protein/ATP binding- Transport | D (320)→G (389) | C | C | N |
| TPChic0430 | TP0430 | V-type ATPase, subunit K (AtpK-1)/Membrane-Transport | F (70)→L (70) | C | C | N |
| TPChic0433 | TP0433 | Hypothetical protein/Uncharacterized | V (232)→A (232) | C | C | N |
| TPChic0434 | TP0434 | Hypothetical protein/Uncharacterized | L (31)→V (17) | C | C | N |
| TPChic0443 | TP0443 | Conserved hypothetical protein/Uncharacterized | T (120)→A (103) | C | C | N |
| TPChic0488 | TP0488 | Methyl-accepting chemotaxis protein (Mcp2-1)/Membrane-Signal transducer | V (295)→A (422) | C | C | N |
| TPChic0584 | TP0584 | Conserved hypothetical protein/Uncharacterized | A (314)→T (314) | C | C | N |
| TPChic0621 | TP0621 | Tpr protein J/Nucleotide binding | F (572)→V (567) | C | N | N |
| TPChic0746 | TP0746 | Pyruvate, phosphate dikinase/ATP binding | E (501)→G (501) | N | N | N |
| TPChic0748 | TP0748 | Cytoplasmic filament protein A (CfpA)/Cytoplasmic protein | C (31)→Y (31) | C | C | N |
| TPChic0790 | TP0790 | Antibiotic transport protein, putative/Membrane-Transport | C (168)→R (139) | C | C | N |
| TPChic0924 | TP0924 | Toxin expression protein (Tex)/RNA binding-Hydrolase activity | Y (529)→STOP | C | C | N |
| TPChic0978 | TP0978 | Signal peptidase II (LspA)/Membrane-Proteolysis | F (195)→L (184) | C | C | N |
[6].
C = identical to Chicago.
N = identical to Nichols.
Differences within ORFs and intergenic regions (IGR) not confirmed by DT-sequencing in the Nichols strain.
| Locus Tag (Chicago) | Locus tag | Product | Strand | Nucleotide difference(s) | ORF coordinates in Chicago | Length in Chicago (aa) | Length in Nichols | Confirmed by DT-sequencing in Chicago |
| TPChic0065 | TP0065 | Hypothetical protein | F | G insertion | 71726:72313 | 195 | 195 | Yes |
| TPChic0126 | TP0126 | Hypothetical protein | R | C insertion | 147627:148310 | 227 | 291 | No |
| TPChic0127 | TP0127 | Hypothetical protein | F | GG deletion | 149834:150193 | 119 | 229 | No |
| TPChic0132 | TP0132 | Hypothetical protein | R | G insertion | 154158:154352 | 64 | 69 | No |
| TPChic0135 | TP0135 | Hypothetical protein | R | C insertion | 156755:157243 | 162 | 313 | Yes |
| TPChic0177 | TP0177 | Hypothetical protein | R | G deletion | 195519:196931 | 470 | 436 | No |
| TPChic0208 | TP0208 | Preprotein translocase subunit, SecY | F | G insertion | 214955:216274 | 439 | 450 | No |
| TPChic0217 | TP0217 | Chaperone protein, DnaJ | F | G insertion | 221992:223242 | 416 | 324 | No |
| TPChic0248 | TP0248 | Hypothetical protein | F | C deletion | 262114:262587 | 157 | 134 | No |
| TPChic0263 | TP0236 | Hypothetical protein | F | G insertion | 275427:277034 | 535 | 594 | Yes |
| TPChic0279 | TP0279 | Bifunctional cytidylate kinase/ribosomal protein S1 | F | G deletion | 295567:298056 | 829 | 863 | Yes |
| TPChic0329 | TP0329 | Serine hydroxymethyl transferase, GlyA | F | G deletion | 351513:353054 | 5135 | 574 | Yes |
| TPChic0348 | TP0348 | Hypothetical protein | R | G insertion | 374026:375150 | 374 | 469 | Yes |
| TPChic0376 | TP0376 | Hypothetical protein | R | C insertion | 402077:402973 | 298 | 301 | Yes |
| TPChic0377 | TP0377 | Flagellar basal body-associated protein, FliL | F | G deletion | 403076:403555 | 513 | 574 | Yes |
| TPChic0397 | TP0397 | Flagellar basal-body rod protein, FlgC | F | G deletion | 423486:423944 | 152 | 129 | Yes |
| TPChic0407 | TP0407 | Hypothetical protein | F | C deletion | 432163:432843 | 224 | 226 | No |
| TPChic0415 | TP0415 | Hypothetical protein | F | G insertion | 443468:443950 | 160 | 177 | Yes |
| TPChic0424 | TP0424 | Hypothetical protein | F | G deletion | 453081:453695 | 204 | 232 | Yes |
| TPChic0425 | TP0425 | Hypothetical protein | F | GG insertion | 453695:454240 | 181 | 86 | No |
| TPChic0477 | TP0477 | Hypothetical protein | R | C insertion | 507750:508454 | 234 | 241 | No |
| TPChic0479 | TP0479 | Hypothetical protein | R | C insertion | 510135:510692 | 185 | 224 | Yes |
| TPChic0486 | TP0486 | Hypothetical protein | R | T deletion | 518618:520213 | 581 | 484 | Yes |
| TPChic0491 | TP0491 | Hypothetical protein | F | G insertion | 526253:527308 | 351 | 344 | No |
| TPChic0520 | TP0520 | Hypothetical protein | R | C insertion | 562151:562975 | 274 | 151 | Yes |
| TPChic0535 | TP0535 | Conserved hypothetical protein | R | C insertion | 579388:579687 | 99 | 70 | No |
| TPChic0536 | TP0536 | Hypothetical protein | R | 2 insertions (C, C) | 579729:579968 | 79 | 133 | No |
| TPChic0555 | TP0555 | Serine/threonine sodium symporter | R | G insertion | 602163:603344 | 393 | 396 | Yes |
| TPChic0594 | TP0594 | Hypothetical protein | R | C deletion | 647488:648165 | 225 | 202 | Yes |
| TPChic0597 | TP0597 | Hypothetical protein | R | 4 insertions (G, A, A, A) | 650054:652117 | 687 | 605 | No |
| TPChic0618 | TP0618 | Hypothetical protein | R | C deletion | 671051:671440 | 129 | 119 | No |
| TPChic0651 | TP0651 | Hypothetical protein | R | 2 insertions (G, C) | 714371:716938 | 803 | 855 | Yes |
| TPChic0731 | TP0731 | Hypothetical protein | F | G insertion | 797978:798598 | 206 | 236 | Yes |
| TPChic0762 | TP0762 | Conserved hypothetical protein | F | C insertion | 826405:827505 | 366 | 393 | Yes |
| TPChic0830 | TP0830 | Hypothetical protein | F | C insertion | 897880:898665 | 261 | 280 | No |
| TPChic0856 | TP0856 | Hypothetical protein | F | C insertion | 934375:935559 | 394 | 325 | No |
| TPChic0993 | TP0993 | Hypothetical protein | R | G insertion | 1078502:1079527 | 341 | 318 | No |
| TPChic0995 | TP0995 | Cyclic nucleotide binding protein | F | G insertion | 1080774:1082108 | 444 | 428 | No |
| TPChic0976 | TP0976 | Conserved hypothetical protein | F | G deletion | 1060425:1061900 | 491 | 459 | No |
| TPChic1031 | TP1031 | TprL protein | F | 2 insertions (C, G) | 1125339:1127168 | 609 | 514 | No |
[6].
F indicates the forward or plus (+) strand; R indicates the reverse or minus (−) strand. fliG1, TPChic0126, and tprJ are located in the minus strand.
The Chicago ORF to the left encompasses these Nichols ORFs.
Figure 1TPChic0899 sequence alignment in T. pallidum isolates.
The single artifactual G deletion in the Nichols strain results in a frameshift and puts in frame the TGA triplet resulting in a premature stop codon and two ORFs (TP0899 and TP0900) was not confirmed in either the several Nichols lineages (Seattle, Dallas, Houston, UCLA, and Farmington) and in the eleven other non-Nichols T. pallidum isolates examined. Amino acid position is indicated according to the Chicago strain genome annotation. Nichols (Gen) and SS14 (Gen) refers to the genome sequences already available for these strains in GenBank (accession numbers are NC_000919 and CP000805.1, respectively).
Figure 2tex gene sequence alignment in T. pallidum isolates.
The C→A transversion that generates a premature TAA (stop) codon in the TPChic0924 gene was found in eleven other non-Nichols T. pallidum isolates. No truncating mutation was found in the tex gene from the Nichols isolate currently propagated in our laboratory (Nichols Seattle), and in the Nichols strain used in the orginal T. pallidum genome project (Houston) [6], or in Nichols strains obtained from other laboratories (Dallas, UCLA, and Farmington). Amino acid position is assigned according to the published Nichols strain genome annotation [6]. Nichols (Gen) refers to the genome sequences already available for this strain in GenBank (accession number is NC_000919).