| Literature DB >> 29310748 |
Roman Thibeaux1, Gregorio Iraola2, Ignacio Ferrés2, Emilie Bierque1, Dominique Girault1, Marie-Estelle Soupé-Gilbert1, Mathieu Picardeau3, Cyrille Goarant1.
Abstract
Despite recent advances in our understanding of the genomics of members of the genus Leptospira, little is known on how virulence has emerged in this heterogeneous bacterial genus as well as on the lifestyle of pathogenic members of the genus Leptospira outside animal hosts. Here, we isolated 12 novel species of the genus Leptospira from tropical soils, significantly increasing the number of known species to 35 and finding evidence of highly unexplored biodiversity in the genus. Extended comparative phylogenomics and pan-genome analyses at the genus level by incorporating 26 novel genomes, revealed that, the traditional leptospiral 'pathogens' cluster, as defined by their phylogenetic position, can be split in two groups with distinct virulence potential and accessory gene patterns. These genomic distinctions are strongly linked to the ability to cause or not severe infections in animal models and humans. Our results not only provide new insights into virulence evolution in the members of the genus Leptospira, but also lay the foundations for refining the classification of the pathogenic species.Entities:
Keywords: Leptospira; ecology; evolution; genomics; virulence
Mesh:
Year: 2018 PMID: 29310748 PMCID: PMC5857368 DOI: 10.1099/mgen.0.000144
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Phylogenetic position of the novel species. (a) Circos diagram showing the relationships between leptospiral genomes based on overall genomic relatedness indices (OGRIs). The inner violet ribbons connect pairs of genomes if they share >95 % average nucleotide identity (ANI) and average amino acid identity (AAI). Blocks represent each genome coloured as explained below. The outer highlights show the leptospiral clades. (b) Maximum-likelihood phylogeny for the genus Leptospira based on the core genome alignment. The tree is rooted with Leptonema illini DSM 21528T. The three classic leptospiral clades historically associated with differential pathogenicity are highlighted in red (‘pathogens’), yellow (‘intermediates’) and green (‘saprophytes’). Coloured circles at species labels indicate a public genome from a previously described species (grey), a genome sequenced in this study assigned to a previously described species (blue) or a genome sequenced in this study from a novel species (orange).
Fig. 2.Virulence in animal models and accessory genome topology. (a) Virulence of novel species in experimental challenge infections (n=2 for each strain and animal model). Only the pathogenic strains Leptospira interrogans L495 and Leptospira borgpeterseni B3-13S were recovered from hamster cardiac blood or evidenced from mouse urine and kidney. (b) Tanglegram comparing the topology of the core genome phylogeny (left) and the topology obtained by clustering the genomes using Jaccard distance calculated over the accessory gene patterns (right). On the left, genomes are coloured according to the classic phylogenetic classification (only pathogens and intermediates are shown here). On the right, genomes are coloured according to the new classification based on accessory gene patterns.
Fig. 3.Functional analysis of discriminating accessory genes. (a) Venn diagram showing the Bayesian identification of cluster-defining accessory genes from the pan-genome. (b) Clustering analysis based on Jaccard distances calculated from the presence/absence vectors of cluster-defining genes. (c) Barplots showing the percentage of cluster-defining genes assigned to each COG functional category in each cluster. Statistical significance (P<0.001, test of proportions) is indicated with asterisks.
Fig. 4.Protein domains analysis. (a) Scatterplot showing the first and second discriminant functions obtained from the Discriminant Analysis of Principal Components (DAPC), performed with protein domain abundances extracted from the coding sequences of each genome. Groups are coloured according to the new classification: intermediates (grey), low-virulence pathogens (cyan) and virulent pathogens (purple). (b) Heatmap showing the relationships between genomes obtained by calculating the Bray–Curtis distances from abundance patterns of a subset of highly discriminating domains obtained from the DAPC analysis. Redness indicates increasing domain copy number.
Fig. 5.Analysis of paralogous genes. (a) Linear regression showing the correlation between patristic distances calculated from the core genome phylogeny and Bray–Curtis distances calculated from the abundance patterns of paralogous genes. Dots are coloured according to virulence clusters when both genomes in the pair belong to the same cluster, black dots represent pairs of genomes belonging to different groups. (b) Boxplots showing the distribution of paralogous genes in the three virulence clusters. Asterisks indicate P<0.001 (Mann–Witney U test). (c) Clustering analysis using the Bray–Curtis distances calculated from the abundance patterns of paralogous genes. Horizontal bars indicate the number of paralogous genes per genome and are coloured according to virulence clusters.