| Literature DB >> 30130365 |
Linda Grillová1, Lorenzo Giacani2, Lenka Mikalová1, Michal Strouhal1, Radim Strnadel3, Christina Marra2, Arturo Centurion-Lara2, Lucy Poveda4, Giancarlo Russo4, Darina Čejková5, Vladimír Vašků6, Jan Oppelt7,8, David Šmajs1.
Abstract
Treponema pallidum subsp. pallidum (TPA) is the infectious agent of syphilis, a disease that infects more than 5 million people annually. Since TPA is an uncultivable bacterium, most of the information on TPA genetics comes from genome sequencing and molecular typing studies. This study presents the first complete TPA genome (without sequencing gaps) of clinical isolate (UZ1974), which was obtained directly from clinical material, without multiplication in rabbits. Whole genome sequencing was performed using a newly developed Anti-Treponemal Antibody Enrichment technique combined with previously reported Pooled Segment Genome Sequencing. We identified the UW074B genome, isolated from a sample previously propagated in rabbits, to be the closest relative of the UZ1974 genome and calculated the TPA mutation rate as 2.8 x 10(-10) per site per generation.Entities:
Mesh:
Year: 2018 PMID: 30130365 PMCID: PMC6103504 DOI: 10.1371/journal.pone.0202619
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Workflow of ATAE coupled with PSGS.
Dark-field microscopy, MLST, and determination of TPA DNA copies were performed on the UZ1974 clinical sample taken from a syphilis positive patient. TPA cells were concentrated in the sample using polyclonal antibodies conjugated with biotin, which were bound to streptavidin covered beads. Prior to NGS, whole genome amplification (WGA) was carried out using multiple displacement amplification using phi 29 polymerase; WGA products were purified using QIAEX II beads. The number of TPA DNA copies was monitored using the nested PCR protocol for polA detection [20]. Using the BWA MEM algorithm, the whole set of obtained reads from NGS (Illumina NextSeq 500) was mapped to the human genome reference (hg38), removed, and the rest of the reads were mapped to the TPA reference genome (GenBank Acc. No. CP004011.1).
NGS statistics for the UZ1974 genome obtained using ATAES.
| NGS parameter | UZ1974 |
|---|---|
| Total number of obtained reads after quality control | 154 million |
| Reads mapped to the TPA SS14 genome reference (GenBank Acc. No. CP004011.1) | 198,765 |
| Average coverage depth | 24.76x |
| Median coverage depth | 9.55x |
| Broad coverage of TPA reference genome | 96.73% |
| Mean length of reads; range (bp) | 142.54; 70–150 |
aThe broad coverage calculated from positions with coverage ≥ 3.
Identified SNVs between UZ1974 and SS14 genomes.
| SNV | ORF (Gene) | Gene product | Functional category | Syn/Nonsyn |
|---|---|---|---|---|
| A94901C | TPASS_20085 | PTS family fructose porter component IIA | Transport | Syn |
| G135108C | TPASS_20117 | Tpr protein C | Virulence | Nonsyn (P534A) |
| C174177T | TPASS_20151 | putative NADH dehydrogenase (ubiquinone), subunit RnfD | General metabolism | Nonsyn (V260I) |
| T333559C | IGR | NA | NA | NA |
| G342703A | TPASS_20324 | putative outer membrane protein | Unknown | Nonsyn (A540T) |
| T364888C | TPASS_20341 | UDP-N-acetylmuramate—L-alanine ligase | General metabolism | Nonsyn (L64P) |
| A522907G | TPASS_20488 | methyl-accepting chemotaxis protein | Cell processes | Nonsyn (D195G) |
| C556154T | TPASS_0515 | putative outer membrane protein | Unknown | Nonsyn (R456C) |
| G593294A | TPASS_20548 | putative outer membrane protein | Unknown | Nonsyn (G53R) |
| G593298A | TPASS_20548 | putative outer membrane protein | Unknown | Nonsyn (G53E) |
| A593912G | TPASS_20548 | putative outer membrane protein | Unknown | Nonsyn (K145E) |
| C674219T | TPASS_20620 | Tpr protein I | Virulence | Nonsyn (H46R) |
| A674227C | TPASS_20620 | Tpr protein I | Virulence | Nonsyn (V48G) |
| T674233C | TPASS_20620 | Tpr protein I | Virulence | Nonsyn (E51K) |
| T760092C | TPASS_20691 | segregation and condensation protein ScpA | Cell processes | Nonsyn (K30R) |
| C772846T | TPASS_20705 | bifunctional membrane carboxypeptidase/penicillin-binding protein | General metabolism | Nonsyn (M625V) |
| T773095C | TPASS_20705 | bifunctional membrane carboxypeptidase/penicillin-binding protein | General metabolism | Nonsyn (G708S) |
| T861444C | TPASS_20793 | hypothetical protein | Unknown | Nonsyn (L311R) |
aCoordinates according to the SS14 reference genome (GenBank Acc. No. CP004011.1).
bSynonymous/nonsynonymous amino acid replacement.
cIGR—Intergenic region.
dNA—Not applicable.