| Literature DB >> 29971217 |
Luciane A Santos1,2, Haritha Adhikarla1, Xiting Yan3, Zheng Wang3, Derrick E Fouts4, Joseph M Vinetz5, Luiz C J Alcantara2, Rudy A Hartskeerl6, Marga G A Goris6, Mathieu Picardeau7, Mitermayer G Reis2, Jeffrey P Townsend3, Hongyu Zhao3, Albert I Ko1,2, Elsio A Wunder1,2.
Abstract
Leptospirosis is a worldwide zoonosis, responsible for more than 1 million cases and 60,000 deaths every year. Among the 13 pathogenic species of the genus Leptospira, serovars belonging to L. interrogans serogroup Icterohaemorrhagiae are considered to be the most virulent strains, and responsible for majority of the reported severe cases. Serovars Copenhageni and Icterohaemorrhagiae are major representatives of this serogroup and despite their public health relevance, little is known regarding the genetic differences between these two serovars. In this study, we analyzed the genome sequences of 67 isolates belonging to L. interrogans serovars Copenhageni and Icterohaemorrhagiae to investigate the influence of spatial and temporal variations on DNA sequence diversity. Out of the 1072 SNPs identified, 276 were in non-coding regions and 796 in coding regions. Indel analyses identified 258 indels, out of which 191 were found in coding regions and 67 in non-coding regions. Our phylogenetic analyses based on SNP dataset revealed that both serovars are closely related but showed distinct spatial clustering. However, likelihood ratio test of the indel data statistically confirmed the presence of a frameshift mutation within a homopolymeric tract of lic12008 gene (related to LPS biosynthesis) in all the L. interrogans serovar Icterohaemorrhagiae strains but not in the Copenhageni strains. Therefore, this internal indel identified can genetically distinguish L. interrogans serovar Copenhageni from serovar Icterohaemorrhagiae with high discriminatory power. To our knowledge, this is the first study to identify global sequence variations (SNPs and Indels) in L. interrogans serovars Copenhageni and Icterohaemorrhagiae.Entities:
Keywords: Copenhageni; Icterohaemorrhagiae; Indels; Leptospira; SNPs; leptospirosis; phylogeny; whole-genome sequencing
Mesh:
Substances:
Year: 2018 PMID: 29971217 PMCID: PMC6018220 DOI: 10.3389/fcimb.2018.00193
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 5.293
Figure 1Pipeline process to identify SNPs and Indels. The flowchart represents the pipelines employed for calling SNPs (Stampy and Samtools) and Indels (CLC).
Classification and total number of identified mutations among isolates of L. interrogans serovars Copenhageni and Icterohaemorrhagiae.
| SNPs | Synonymous | 258 | NA | 258 |
| Non-synonymous | 538 | NA | 538 | |
| Total | 796 | 276 | 1,072 | |
| Indels | Insertion | 84 | 41 | 125 |
| Deletion | 107 | 26 | 133 | |
| Total | 191 | 67 | 258 |
NA, Not applied.
Identification of genes showing multiple SNPs and Indels in L. interrogans serovars Copenhageni and Icterohaemorrhagiae.
| SNPs | Hypothetical protein | 17 | 1 | 3498112 | 9,423 | |
| Cytoplasmic protein | 8 | 1 | 590325 | 8,364 | ||
| Indels | Adenylate/guanylate | 6 | 1 | 1085410 | 1,404 | |
| Hypothetical protein | 4 | 1 | 819364 | 432 | ||
| Acriflavin resistance | 4 | 1 | 3673377 | 3,312 |
Total number of variants in corresponding gene.
Chromosome.
Gene position based on the L. interrogans serovar Copenhageni strain Fiocruz L1-130 reference genome (Accession numbers: .
Figure 2Phylogenetic relationship among Leptospira interrogans serovar Copenhageni and Icterohaemorrhagiae strains. Maximum likelihood tree depicting phylogenetic relationship among Leptospira interrogans serovars Copenhageni and Icterohaemorrhagiae from different geographical locations. Phylogeny constructed using the SNPs sites. Asterisk (*) represents clusters with statistical Bootstrap support higher than 70%. The sequences from serovar Icterohaemorrhagiae are underlined. L. interrogans serovar Lai was used as an outgroup for this analysis. The strain sequences are colored by country of isolation following the legend. The clusters numbers are related to country of isolation: 1A−1F (Brazil), 2 (Hawaii, US), 3 (French Polynesia), 4 (Egypt, Netherlands and Denmark), 5 (Japan), 6 (Guadalupe and Slovenia), and 7 (Colombia).
Correlation of isolates based on the geographical clades from phylogenetic analysis.
| 1A | LV239, LV3726, LV2750, LV2766, LV4278, LV4361, LV4497 | Brazil |
| 1B | LV3213, R83, LV4157, R85 | Brazil |
| 1C | LV2948, LV2973 | Brazil |
| 1D | LV4102, LV4498 | Brazil |
| 1E | L1-130, LV3737 | Brazil |
| 1F | LV4211, R62 | Brazil |
| 2 | 2007005490, 2002009669 | Hawaii |
| 3 | 200903008, 200701182 | French Polynesia |
| 4 | 2006006972, 200600682, M20, Wijnberg | Egypt, Denmark, Netherlands |
| 5 | Ictero1, Yamamoto, Shibaura | Japan |
| 6 | 200801909, 201000458, 201100516, 200700457, 200201190 | Guadeloupe, Slovenia |
| 7 | CIDEIM R006, CIDEIM R107, CIDEIM R081, CIDEIM R103 | Colombia |
Details of all the isolates and countries of origin are listed in Table .
Figure 3Genomic organization and domain architecture of gene lic12008. (A) Organization of lic12008 loci in L. interrogans serovars. (B) Domain architecture of LIC12008 protein. (C) Details of amino acid fragment (1-55aa) showing the sequence differences in L. interrogans Copenhageni (I) and L. interrogans Icterohaemorrhagiae (II) serovars.