| Literature DB >> 31375067 |
Helisson Faoro1,2,3, Willian K Oliveira4,5, Vinicius A Weiss6,4, Michelle Z Tadra-Sfeir6, Rodrigo L Cardoso6, Eduardo Balsanelli6, Liziane C C Brusamarello-Santos6, Doumit Camilios-Neto6,7, Leonardo M Cruz6, Roberto T Raittz4, Ana C Q Marques8, John LiPuma9, Cyntia M T Fadel-Picheth8, Emanuel M Souza6, Fabio O Pedrosa10.
Abstract
BACKGROUND: Herbaspirillum seropedicae is an environmental β-proteobacterium that is capable of promoting the growth of economically relevant plants through biological nitrogen fixation and phytohormone production. However, strains of H. seropedicae have been isolated from immunocompromised patients and associated with human infections and deaths. In this work, we sequenced the genomes of two clinical strains of H. seropedicae, AU14040 and AU13965, and compared them with the genomes of strains described as having an environmental origin.Entities:
Keywords: Clinical isolates; Genome comparison; Genomic adaptation; Herbaspirillum seropedicae; LPS substitution
Year: 2019 PMID: 31375067 PMCID: PMC6679464 DOI: 10.1186/s12864-019-5982-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Average Nucleotide Identity (ANI) among species and strains of the genus Herbaspirillum. The ANI was calculated using the pyani program after blastn alignment. Only regions present in all genomes were used in the ANI calculation. The dendrogram directly reflects the degree of identity between genomes. An ANI above 95% between two genomes is an indication that they belong to the same species
Fig. 2Circular comparative view of H. seropedicae strain genomes. Predicted proteins of strains Z67, WT00C, AU14040 and AU13965 were compared to strain SmR1, a predicted reference proteome, using blastp. The inner ring represents the SmR1 genome GC% content. The colored rings represent proteins of other strains that share at least 90% identity with SmR1 proteins. Hallmark regions of the SmR1 genome are presented in the outer ring in green
Fig. 3Gene clusters related to nitrogen fixation and plant association in H. seropedicae. The genomes of strains Z67, WT00C, AU14040 and AU13965 were aligned through blastn and visualized using the ACT tool. (a) The nitrogen fixation gene cluster of the SmR1 strain is located between a gloA gene duplication. This gene set is absent in the AU14040, AU13965 and WT00C genomes, in which only one copy of gloA was found. The upstream region is also absent and flanked by the fabG gene. (b) The type 3 secretion system (T3SS) is also absent in AU14040 and AU13965. In both cases, the genes flanking the absent genomic regions are conserved between strains
Fig. 4Genome sequence and structure comparison between H. seropedicae strains. Dotplot alignment of the six-frame translated genome sequences of strains Z67, WT00C, AU14040 and AU13965 against the SmR1 reference genome using MUMMER. The dotted line with slope equal to 1 represents an undisturbed segment of conservation between the two sequences. The closer a plot is to an imaginary line f(x) = x, the fewer macroscopic differences exist between the two sequences
Fig. 5Development of the core vs pan genome. Analysis of the core and pan genomes was performed using strain SmR1 as a reference. Starting with the reference genome, the core and pan genome sizes were calculated and plotted with each new added genome based on the most frequent SRV score ratio (master SRV). This value is used to filter blastp hits within the database. Only hits with SRV values greater than or equal to the master are included
Fig. 6Comparison of pan genome orthologs. (a) Venn diagram comparing pan genome proteins between clinical and environmental strains. The exclusive sets include proteins that appeared in both or only in one genome. Proteins in the intersection set include proteins that appear in at least one clinical genome and one environmental genome. The master SRV cutoff used for classification was SRV = 33. (b) Functional classification of pan genome exclusive proteins of the clinical and environmental strains. Classification was made according to KEGG categories through the algorithm BlastKOALA
Fig. 7Comparison of the lipopolysaccharide (LPS) biosynthesis clusters of H. seropedicae strains. In all genomes, a tRNAPhe gene at the 3′ end and orthologous genes at the 5′ end flank the clusters. The names in the strain SmR1 cluster correspond to the genomic annotations. Orthologous genes relative to the SmR1 strain are presented in the same colors with the level of identity below. The GC% frame plot of each region is above the genes. The black and gray arrows represent genes exclusive to strains AU1404 and AU13965, respectively. Featured in the red box are the Neu5Ac biosynthesis genes (neuABC) in strain AU14040