| Literature DB >> 27344187 |
Jun Sun1, Zhefeng Meng2, Kaiqi Wu3, Biao Liu3, Sufang Zhang4, Yudan Liu4, Yuezhu Wang5, Huajun Zheng5, Jian Huang5,6, Pingyu Zhou1,4.
Abstract
Syphilis is a systemic sexually transmitted disease caused by Treponema pallidum ssp. pallidum (TPA). The origin and genetic background of Chinese TPA strains remain unclear. We identified a total of 329 single-nucleotide variants (SNVs) in eight Chinese TPA strains using next-generation sequencing. All of the TPA strains were clustered into three lineages, and Chinese TPA strains were grouped in Lineage 2 based on phylogenetic analysis. The phylogeographical data showed that TPA strains originated earlier than did T. pallidum ssp. pertenue (TPE) and T. pallidum ssp. endemicum (TPN) strains and that Chinese TPA strains might be derived from recombination between Lineage 1 and Lineage 3. Moreover, we found through a homology modeling analysis that a nonsynonymous substitution (I415F) in the PBP3 protein might affect the structural flexibility of PBP3 and the binding constant for substrates based on its possible association with penicillin resistance in T. pallidum. Our findings provide new insight into the molecular foundation of the evolutionary origin of TPA and support the development of novel diagnostic/therapeutic technology for syphilis.Entities:
Keywords: NGS; Pathology Section; SNVs; Treponema pallidum; evolution; syphilis
Mesh:
Substances:
Year: 2016 PMID: 27344187 PMCID: PMC5189996 DOI: 10.18632/oncotarget.10154
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Background information and sequencing statistics for genome of T. pallidum
| Strain | Origin Place | Year of isolation | Raw data | Clean data | Ratio | unique reads | unique ratio(%) | Coverage (%) | Mean depth | SNV number | Count of treponemes |
|---|---|---|---|---|---|---|---|---|---|---|---|
| SHC-0 | Shanghai | 2014 | 40416294 | 37469842 | 92.71% | 33483604 | 89.36% | 99.99% | 1899.73 | 286 | 1×107 |
| SHD-R | Shanghai | 2014 | 28323906 | 26245436 | 92.66% | 21129166 | 80.51% | 99.99% | 2253.37 | 282 | 1×107 |
| SHE-V | Shanghai | 2014 | 26352892 | 24296460 | 92.20% | 23352990 | 96.12% | 99.99% | 413.53 | 277 | 1×107 |
| SHG-I2 | Shanghai | 2014 | 23141660 | 20869470 | 90.18% | 20312002 | 97.33% | 99.99% | 163.98 | 279 | 1×107 |
| B3 | Shanghai | 2015 | 26527340 | 26340014 | 99.29% | 26046390 | 98.89% | 99.99% | 55.28 | 243 | 1×106 |
| C3 | Shanghai | 2015 | 26746096 | 26581754 | 99.39% | 25883586 | 97.37% | 99.99% | 487.29 | 268 | 7×107 |
| K3 | Shanxi | 2015 | 25504866 | 25293142 | 99.17% | 24091738 | 95.25% | 99.99% | 796.43 | 284 | 1×107 |
| Q3 | Anhui | 2015 | 24819110 | 24649998 | 99.32% | 20028148 | 81.25% | 99.99% | 2047.53 | 281 | 1×107 |
Figure 1Flowchart for analysis of the data derived from next-generation sequencing
Figure 2Characteristics of SNVs in different TPA lineages
A. Neighbor-joining phylogeny analysis of 697 SNVs defining TPA lineages. B. Venn diagram showing the number of SNVs in different TPA lineages. C. Percentage of mutation spectra of SNVs in different TPA lineages. D. Percentage of nonsynonymous and synonymous SNVs in different lineages.
Number of SNVs in the hree TPA lineages
| Lineage 1 | Lineage 2 | Lineage 3 | |
|---|---|---|---|
| tprC | 11 | ||
| tprD | 20 | ||
| tprI | 4 | ||
| tprE | 3 | ||
| tpr G | 1 | ||
| tpr K | 4 | 22 | |
| tpr L | 6 | ||
| TPANIC_0136 | 18 | 9 | |
| TPANIC_0548 | 13 | ||
| Hypothetical protein | 1 | 1 | 8 |
| 5 | 44 | ||
| Penicillin-binding protein | 4 | ||
| Methyl-accepting chemotaxis protein (mcp2-1) | 24 | ||
| others | 7 | 8 | 11 |
| 30 | 44 | 150 |
The SNVs of penicillin regulatory Proteins in all Lineages
| Gene | Annotation | SNV | Mutation | Mutation type | Lineage | Strain | SIFT Score |
|---|---|---|---|---|---|---|---|
| penicillin-binding protein (pbp-1) | C537571T | P564L | nonsynonymous | 2,3 | Strain C-0, D-R, E-V, G-I2, B3, C3, K3, Q3 and SS14 | 0.2 | |
| 4-hydroxy-3-methylbut-2-enyl diphosphate reductase/penicillin tolerance protein (lytB) | T592290C | / | synonymous | 2,3 | Strain C-0, D-R, E-V, G-I2, B3, C3, K3, Q3, SS14 and Mexico A | / | |
| T592614C | / | synonymous | 2,3 | Strain C-0, D-R, E-V, G-I2, B3, C3, K3, Q3, SS14 and Mexico A | / | ||
| penicillin-binding protein (pbp-2) | G773571A | A506V | nonsynonymous | 2 | Strain C-0, D-R, E-V, G-I2, B3, C3, K3 and Q3 | 0.03 | |
| penicillin-binding protein (pbp-3) | G824922A | A366T | nonsynonymous | 2 | Strain C-0, and B3 | 0.14 | |
| A825069T | I415F | nonsynonymous | 2 | Strain C-0, D-R, E-V, G-I2, B3, C3 and Q3 | 0.03 | ||
| C825071G | I415M | nonsynonymous | 2 | Strain K3 | 0.17 |
functional SNVs predicted using SIFT (SIFT score ≤0.05 as the deleterious effect).
Figure 3Predication of SNV effects on PBP3 and PBP2 proteins by homology model analysis
A. Ribbon representation of the PBP3 protein as predicted by the Phyre2 Server. Red arrows show the structural alteration caused by the SNVs compared to the wild-type model. B. Ribbon representation of the PBP2 protein as predicted by the Phyre2 Server. Red arrows show the structural alteration caused by the SNVs compared to the wild-type model.
Figure 4Phylogenetic analysis based on the tpr genes and SNVs of T. pallidum
. A. Phylogenetic tree showing clusters of Subfamily I of tpr genes. B. Phylogenetic tree showing clusters of Subfamily II of tpr genes. C. Phylogenetic tree showing clusters of tpr genes based on genomic SNVs after the removal of SNVs from the tprD and tprG genes. The tree was constructed using MrBayes 3.1. Significant linkages using Bayesian phylogenetic inference analysis were considered as probabilities of 100%.
Figure 5Maximum clade credibility trees of tpr family gene sequences and modified SNVs of T. pallidum
Phylogeographical trees were constructed using BEAST V1.6.2. The tree branches are colored according to their respective cluster or branch. The tMRCA of each cluster is labeled at the tree nodes. A. Phylogeographical analysis showing the clusters based on the tprC and tprD gene sequences among treponemal strains. B. Phylogeographical analysis showing clusters based on the genomic SNVs of tpr family genes after removing the SNVs of tprD and tprG genes. tMRCA stands for time to the most recent common ancestor.