| Literature DB >> 35602040 |
Ruimei Geng1, Lirui Cheng1, Changdai Cao2, Zhengwen Liu1, Dan Liu1, Zhiliang Xiao1, Xiuming Wu1, Zhenrui Huang3, Quanfu Feng1, Chenggang Luo1, Zhiqiang Chen1, Zhenchen Zhang3, Caihong Jiang1, Min Ren1, Aiguo Yang1.
Abstract
Ralstonia solanacearum species complex (RSSC) is a diverse group of plant pathogens that attack a wide range of hosts and cause devastating losses worldwide. In this study, we conducted a comprehensive analysis of 131 RSSC strains to detect their genetic diversity, pathogenicity, and evolution dynamics. Average nucleotide identity analysis was performed to explore the genomic relatedness among these strains, and finally obtained an open pangenome with 32,961 gene families. To better understand the diverse evolution and pathogenicity, we also conducted a series of analyses of virulence factors (VFs) and horizontal gene transfer (HGT) in the pangenome and at the single genome level. The distribution of VFs and mobile genetic elements (MGEs) showed significant differences among different groups and strains, which were consistent with the new nomenclatures of the RSSC with three distinct species. Further functional analysis showed that most HGT events conferred from Burkholderiales and played a great role in shaping the genomic plasticity and genetic diversity of RSSC genomes. Our work provides insights into the genetic polymorphism, evolution dynamics, and pathogenetic variety of RSSC and provides strong supports for the new taxonomic classification, as well as abundant resources for studying host specificity and pathogen emergence.Entities:
Keywords: Ralstonia solanacearum; diversity; genetic; pangenome; pathogenicity
Year: 2022 PMID: 35602040 PMCID: PMC9121018 DOI: 10.3389/fmicb.2022.854792
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 6.064
Figure 1Taxonomic classification of Ralstonia solanacearum species complex (RSSC) based on comparison at the whole genome level. Phylogenetic tree based on blast all vs. all of 131 RSSC strains, which could be divided into three species, and the species names were labeled on the front of each branch (numbers I, II, and III were marked in the right). Three stains belong to the subspecies of R. syzygii subsp. indonesiensis were marked with three different colors near their names.
Figure 2Pangenome shape and function of RSSC. (A) The pangenome flower plot showing the core genome and different unique gene for each strain, the different colors correspond with the different groups in the core genome tree. (B) Gene accumulation curves for the pangenome, the cumulative sizes were calculated by selecting strains without replacement in random order 1,000 times. (C) A histogram of the prevalence of different gene families in the pangenome. A total of 30,763 non-redundant gene families identified in 131 genomes are based on their frequency distribution. Three gene categories are clearly distinguished, highlighting the unique genes (genes that only exist in one strain), core genes (gene families are present in more than 120 strains), and accessory genes (gene families that exhibit variable frequencies). (D) Distribution of COG categories in the core, accessory, and unique genes.
Figure 3Comparison of two phylogenetic trees constructed using core genome and whole genome, respectively. (A) Core genome tree was based on 1,129 core genes of 131 strains, and different cores represent three different species, which was linked with that in the whole genome tree (right). (B) Whole genome tree could also be divided into three groups, the colors linked with that in core genome tree (left). Bidirectional arrows in the middle represents the same species in these two trees.
Figure 4The pathogenicity and evolution of RSSC. (A) Distribution of virulence factor (VF), TXSS, horizontal gene transfer (HGT), insertion sequence (IS), and Phages in each strain of RSSC. The different groups were associated with that in the core genome tree (Figure 3A). (B) Distribution of GO functions in the core genome (red) and accessory genome (green). *t test p < 0.05; **t test p < 0.01. (C) Distribution of clusters of orthologous groups of proteins (COG) categories for HGT gene families.