Literature DB >> 34850947

VFDB 2022: a general classification scheme for bacterial virulence factors.

Bo Liu1, Dandan Zheng1, Siyu Zhou1, Lihong Chen1, Jian Yang1.   

Abstract

The virulence factor database (VFDB, http://www.mgc.ac.cn/VFs/) is dedicated to presenting a comprehensive knowledge base and a versatile analysis platform for bacterial virulence factors (VFs). Recent developments in sequencing technologies have led to increasing demands to analyze potential VFs within microbiome data that always consist of many different bacteria. Nevertheless, the current classification of VFs from various pathogens is based on different schemes, which create a chaotic situation and form a barrier for the easy application of the VFDB dataset for future panbacterial metagenomic analyses. Therefore, based on extensive literature mining, we recently proposed a general category of bacterial VFs in the database and reorganized the VFDB dataset accordingly. Thus, all known bacterial VFs from 32 genera of common bacterial pathogens collected in the VFDB are well grouped into 14 basal categories along with over 100 subcategories in a hierarchical architecture. The new coherent and well-defined VFDB dataset will be feasible and applicable for future panbacterial analysis in terms of virulence factors. In addition, we introduced a redesigned JavaScript-independent web interface for the VFDB website to make the database readily accessible to all users with various client settings worldwide.
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 34850947      PMCID: PMC8728188          DOI: 10.1093/nar/gkab1107

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Bacterial pathogens are usually only approximately one micrometer in size with a small genome of several mega base pairs but nonetheless are complex organisms that continuously threaten public health worldwide (1). In recent years, astonishing progress has been made to understand the abilities of bacterial pathogens to adhere, invade, survive and persist in various niches, and the characteristics that enable them to be so successful with a diverse repertoire of virulence factors (VFs). The virulence factor database (VFDB, http://www.mgc.ac.cn/VFs/) summarizes the current knowledge of bacterial VFs and provides significant insights aimed at exposing possible targets for the development of novel treatment and preventive strategies (2). Boosted by the rapid development of next-generation sequencing (NGS) technologies, newly emerging human/environmental microbiome studies have become a hotspot of molecular microbiology in recent years (3,4). Traditional microbiological studies generally rely on laboratory bacterium isolation and cultivation, while recent NGS-based metagenomic analyses have revealed a large number of unculturable bacteria, the majority of which are yet uncharacterized (5). Thus, the follow-up in-depth mining of the panbacterial microbiome data in terms of pathogenesis requires a well-organized reference category of all established bacterial VFs from various known pathogens (6). However, the commonly used nomenclature of current bacterial VFs usually follows the historical conventions of the field. Although most of the VF nomenclatures are indicative and meaningful, some can be vague or even misleading. For example, Escherichia coli fimbriae K88 and K99 were given K denominations just because they were initially wrongly identified as capsular antigens (7). In addition, the independent naming of homologous VFs in different bacteria also leads to considerable confusion and hampers follow-up panbacterial analyses. The existing general classification schemes such as the Gene Ontology (GO) and the Clusters of Orthologous Genes (COGs) are designed to cover entire biological processes and molecular functions of various living organisms (8,9), so they have very few categories associated with bacterial pathogenesis. The recent established Victors database introduced an ontology of host-pathogen interactions (10), but no VF classification information is available yet. Therefore, a well-defined classification scheme with a unified nomenclature is essential for future efficient data mining of bacterial VFs. To better organize and present bacterial VFs in the database, the VFDB proposed an individual simplified classification scheme for each bacterial genus based on field conventions since inception in 2004 (2). Nevertheless, the current VFDB database covers 32 genera of well-studied bacterial pathogens, and so has 32 different classification schemes (Supplementary Table 1). As traditional microbiologists usually focus on the pathogenesis of one or several related bacteria only, the 32 classification schemes are generally independent of each other, although they share certain similarities (11). The absence of a unified classification scheme for all bacterial VFs from various pathogens poses a barrier to future panbacterial metagenomic analyses in terms of pathogenesis. Therefore, we recently proposed a general VF category in the database and reorganized the VFDB dataset accordingly to make it readily applicable for future panbacterial data mining. In addition, we introduced a redesigned JavaScript-independent web interface in the database to keep it easily accessible to all users with various client settings worldwide.

DATABASE UPDATES

VF category: a general classification scheme for bacterial VFs

Because of the confusion caused by the aforementioned different classification schemes, we recently introduced a unified VF category applicable to various bacterial pathogens into the VFDB database. Since the majority of the current classifications of various bacterial VFs have proven very useful and durable for phylogenetic analyses, we have tried to fully follow the existing conventions based on extensive literature mining. Thus, the newly proposed general classification scheme should be instantly familiar to and readily acceptable by traditional bacteriologists. However, unlike the previous scheme proposed in the 2012 release (12), which contains only four major bacterial VF categories (i.e. adhesion and invasion, secretion system, toxin, and iron acquisition), the newly established general classification scheme was designed to be a comprehensive system capable of covering all known bacterial VFs. The VF category was constructed in a tree-like hierarchical architecture with 14 basal categories and >100 subcategories thus far (Table 1). The current basal categories not only fully cover the four major categories previously proposed but also refine all of them to make each category well defined and independent of each other. Adherence is usually the primary step in bacterial pathogenesis (13). Though the adhesion process is always followed by bacterial invasion into host cells by some intracellular pathogens, it is not necessary for many noninvasive bacteria. Hence, for clarity, the current scheme separates adherence and invasion into two independent categories, which consist of 1885 and 391 known bacterial VFs, respectively, in the current VFDB database (Table 1). Bacterial secretion systems are membrane-anchored nanomachines that enable the bacteria to transport various effector proteins either out of the cell into the surrounding niche or directly into the cytoplasm of eukaryotic/prokaryotic cells. A variety of different bacterial secretion systems (e.g. type III, type IV, type VI and type VII secretion systems) have been identified to play a critical role in bacterial pathogenesis (14). Interestingly, recent studies have revealed that the widespread extracellular contractile injection systems are capable of delivering effectors in a contact-independent manner and highly resemble type VI secretion systems (T6SSs) in the headless phage-like overall structure (15). The original category of secretion systems was therefore renamed effector delivery systems to improve their commonality to accommodate additional related bacterial VFs in the future. Generally, bacterial toxins include both exotoxins and endotoxins (16). However, in most contexts, they specifically refer to only exotoxins, which are proteins synthesized inside bacterial cells and then released to outside medium or target cells to produce virulence. In contrast, endotoxins are membrane-associated lipopolysaccharides (LPS) produced by only gram-negative bacteria that can induce a variety of host immune disorders. Given the distinct mechanisms of exotoxins and endotoxins, the previous category of toxins is now replaced by the more specific term ‘exotoxin’, whereas endotoxins are now grouped into an additional category of immune modulation (Table 1). Besides the aforementioned LPS, pathogenic bacteria have developed various mechanisms to control and modulate the host immune system to benefit their survival, such as anti-phagocytosis, disrupting and depleting the complement system, and interfering with the inflammatory signaling pathway (17). Iron is an essential nutrient for the proliferation and pathogenicity of bacterial pathogens. The well-characterized host defense strategy of iron sequestration highlights the crucial role of iron acquisition systems in bacterial pathogenesis. Nevertheless, recent studies show that many other nutritional requirements or metabolic adaptation strategies, such as other metal ions (e.g. zinc and magnesium), certain carbon, nitrogen, and sulfur sources, can also contribute to bacterial virulence, particularly for intracellular pathogens (18). Thus, we proposed a new category of nutritional/metabolic factors to cover and extend the original category of iron acquisition (Table 1).
Table 1.

The newly proposed VF category and the current statistics of known bacterial VFs from 32 genera in the VFDB database (as of October 2021)

CategorySubcategoryaNumber of VFs
Adherence1885
Fimbrial adhesin608
Non-fimbrial adhesin1273
Invasion391
Effector delivery system1242
Type II secretion system41
Type III secretion system228
Type IV secretion system141
Type V secretion system391
Type VI secretion system171
Type VII secretion system270
Motility189
Flagella-mediated motility91
Intracellular motility98
Exotoxin1101
Membrane-acting toxin893
Intracellularly active toxin188
Exoenzyme522
Hyaluronidase33
Kinase24
Coagulase15
Lipase19
Protease244
Nuclease43
Immune modulation1540
Antiphagocytosis180
Complement evasion/Serum resistance201
Immunoglobulin29
Antigen variation20
Apoptosis80
Inflammatory signaling pathway237
Biofilm297
Biofilm formation222
Quorum sensing75
Nutritional/Metabolic factor1912
Metal uptake900
Metabolic adaptation708
Stress survival492
Post-translational modification308
Antimicrobial activity/Competitive advantage86
Regulation1140
Others427

aFor brevity, only the top level subcategories are listed. Full VF category is available from the VFDB database website (http://www.mgc.ac.cn/VFs/VFcategory.htm).

The newly proposed VF category and the current statistics of known bacterial VFs from 32 genera in the VFDB database (as of October 2021) aFor brevity, only the top level subcategories are listed. Full VF category is available from the VFDB database website (http://www.mgc.ac.cn/VFs/VFcategory.htm). In addition to the aforementioned VF categories associated with the previous scheme, some additional categories were further introduced into the current scheme to represent the entire VF repertoires of pathogenic bacteria (Table 1). Exoenzymes are a group of special enzymes functioning outside of the producing cells to degrade macromolecules to small soluble molecules. In contrast to exotoxins that directly target host cells, some bacterial exoenzymes (e.g. hyaluronidase) contribute to virulence by facilitating the intracellular/intercellular spread of pathogens (19). Bacterial motility is an important capability for a successful pathogen to avoid hostile environments and discover useful resources for survival (20). Many pathogenic bacteria employ surface appendages such as flagella and type IV pili to drive motility. In addition, recent studies revealed that some bacterial pathogens are able to form a biofilm, which largely improves their resistance to antimicrobials and host immunities to contribute to the survival, dispersion and colonization of the bacteria (21). Bacterial defenses against various severe host conditions (e.g. acidic, oxidative and nitrosative stresses) are indispensable for successful colonization and disease pathogenesis (22). Further, some host-derived molecules such as antimicrobial peptides are critical components of the host defense system to protect against microorganisms. As a consequence, pathogenic bacteria have evolved diverse systems (e.g. AcrAB of Klebsiella and MtrCDE of Neisseria) to provide resistance to hostile molecules, so as to provide competitive advantages and contribute to bacterial virulence (23). Post-translational modification of some VFs has been demonstrated to be pivotal to activate virulence. For instance, both O-glycosylation of the flagellin structural protein and N-glycosylation of many membrane-associated proteins in Campylobacter jejuni are essential for bacterial adherence, invasion and colonization (24). Moreover, it has been established that for a successful pathogen the tight and fine regulation of each VF is as important as the possession of the VF itself (25). Table 1 lists the 14 basal categories along with the top level of subcategories for brevity, while a complete list of the current VF category is available from the database website (http://www.mgc.ac.cn/VFs/VFcategory.htm). The full enrollment of all known bacterial VFs into the new classification scheme enables easy visualization of the bacterial genome from the perspective of virulence categories. An example pathogenomic map of Pseudomonas aeruginosa available from the VFDB website is shown in Figure 1A, which is more informative than previous illustration using general classification scheme (Supplementary Figure 1A). Since the database presents a dynamic circular map created by the CGView package (26) for each of the 32 genera, it offers an intuitive overview of the molecular diversity of bacterial VFs from various pathogens.
Figure 1.

Examples of visualization of the newly proposed VF category available from the VFDB website. (A) A circular pathogenomic map of Pseudomonas aeruginosa. Each VF-related gene is denoted by a directional triangle and color-coded by the corresponding VF category (key). Gene name is indicated inside/outside of the circle with a hyperlink to the web page of gene details. (B) A linear illustration of the genomic locus encoding T6SS in P. aeruginosa. Each gene is represented by a horizontal arrow in the direction of its coding strand (to scale) and color-coded by the designated core structural component (key). For comparison, the counterparts of the circular and linear illustrations available from the previous VFDB website without VF category are shown in Supplementary Figure 1.

Examples of visualization of the newly proposed VF category available from the VFDB website. (A) A circular pathogenomic map of Pseudomonas aeruginosa. Each VF-related gene is denoted by a directional triangle and color-coded by the corresponding VF category (key). Gene name is indicated inside/outside of the circle with a hyperlink to the web page of gene details. (B) A linear illustration of the genomic locus encoding T6SS in P. aeruginosa. Each gene is represented by a horizontal arrow in the direction of its coding strand (to scale) and color-coded by the designated core structural component (key). For comparison, the counterparts of the circular and linear illustrations available from the previous VFDB website without VF category are shown in Supplementary Figure 1. Many bacterial VFs comprise multiple structural components that need to be subtly assembled into a complex nanomachine for function, such as secretion systems and flagella. Accurate recognition of each structural component is critical for sequence homology-based identification of such complex bacterial VFs (15,27). Further, one recent alignment-independent machine-learning-based bacterial VF classification attempt also relied on a well-curated dataset of orthologous components to achieve high accuracy (28). Therefore, we further provided a unified nomenclature of the core structural components for these complex bacterial VFs to avoid confusion caused by independent naming in different bacteria. Figure 1B is an example of a linear map of the T6SS from P. aeruginosa that is color-coded by the designated core structural components, including the membrane complex (TssJ, TssM and TssL), baseplate structure spike (PAAR and TssI/VgrG) and wedge (TssE, TssF, TssG and TssK), tube-sheath complex (TssA, TssB, TssC and TssD/Hcp) and disassembly ATPase (TssH/ClpV). Since the unified nomenclature is specialized for these VF-related genes, it provides more valuable information across bacterial species than the previous version categorized by the COGs (Supplementary Figure 1B). In addition, we created a tabular list of all characterized effectors (where available) of each secretion system to highlight the activity, targets or interaction partners, role in infection and related literature of each effector (Figure 2). In contrast to the core structural components that are generally conserved among various pathogens, diverse secreted effectors are the most fascinating constituent of secretion systems that actually enable miscellaneous functions, such as host cell invasion, immune evasion, nutrients acquisition and growth competition. Thus, the detailed information in our table of effectors will benefit further biological interpretation of associated sequence analyses.
Figure 2.

The tabular list of detailed information on known effectors of the type III secretion system from Vibrio cholerae. Related publication(s) of each effector are shown after the table.

The tabular list of detailed information on known effectors of the type III secretion system from Vibrio cholerae. Related publication(s) of each effector are shown after the table.

Redesigned basic web interface

Since the previous release in 2019, an enhanced alternative JavaScript-rich web interface was introduced into the VFDB to provide users with improved experiences through a set of interactive online analysis tools (29). To ensure that the database is easily accessible by all users worldwide, including those who have very limited computer/network settings, VFDB continues to maintain the traditional JavaScript-independent web pages as default. Nevertheless, the basic web pages of the VFDB were originally designed in 2004, which have a fixed page width setting of 800 pixels to make all contents visible in most of the former client monitors. This obsolete style severely limits the effective presentation of page contents for modern computers. Therefore, we redesigned the basic web interface with new styles to use percentage-based relative sizing rather than absolute width in pixels, so as to make the contents of each page automatically fit to the screen size of different users to maximize web page usage of various client settings.

DISCUSSION

One of the cutting-edge areas of current molecular microbiology is studies on microbial communities at the ecosystem level (e.g. microbiome). However, NGS-based panbacterial metagenomic data analyses require high-quality and well-curated reference datasets to produce meaningful results. The VFDB database is dedicated to providing the scientific community with a comprehensive knowledge base as well as a versatile analysis platform for bacterial VFs (2,29). The newly introduced general VF classification scheme will facilitate future mining of panbacterial metagenomic data from the aspect of bacterial pathogenesis, since explicit and detailed labelling of the reference dataset will allow both homology-based and machine-learning-based algorithms to efficiently classify the data and make fine-grained statistics or predictions. Moreover, the general VF category with unified nomenclature is valuable to the identification of potential novel VFs from uncultured bacterial genomes, which may in turn contribute to future discovery of unknown bacterial VFs. However, we would like to stress that the current VF category is a preliminary and tentative scheme rather than a complete solution, since the classification of bacterial VFs is indeed complicated. On the one hand, as mentioned above, the current VF category has tried to be consistent with the existing community conventions of bacterial pathogenesis. However, the original researchers might begin to name and classify VFs based on different criteria, such as morphology, serology or functional characteristics. For instance, colonization factor antigens were named after their functions, whereas bundle-forming pili were designated based on their characteristic appearance. As a consequence, the current proposed VF category is actually based on the combination of structural and functional characteristics rather than a simple clear criterion. On the other hand, bacterial pathogens have evolved a multitude of strategies against prokaryotic competitors and eukaryotic hosts to colonize, invade and overcome the host immune response. Empirical evidence has demonstrated that many bacterial VFs are multifaceted proteins with various functions. For example, listeriolysin O (LLO), which is a major virulence factor of the facultative intracellular pathogen Listeria monocytogenes, is involved in multiple stages of the intracellular lifecycle of the bacterium and displays unique characteristics (30). It has long been known that following bacterial internalization into host cells, LLO disrupts the primary vacuole, enabling the bacterium to replicate into the host cell cytosol. LLO is also required for disruption of the newly formed secondary vacuole to release the bacterium into the cytosol, where it repeats its intracellular lifecycle. In addition, LLO can act as an invasin that stimulates L. monocytogenes internalization and affects the transcriptional activity of infected cells. Therefore, the structure of an ideal classification scheme of bacterial VFs might be net-like rather than tree-like. In the future, the VFDB database will continue to integrate up-to-date knowledge of additional bacterial pathogens with medical importance to fulfill the demands of all users worldwide. Moreover, to circumvent the aforementioned dilemma behind the current classification scheme, an ontology-based computer-interpretable system tailored for bacterial VFs might be necessary to better depict the functional and/or structural features and the possible multiple roles in virulence for each characterized VF. Click here for additional data file.
  29 in total

1.  Circular genome visualization and exploration using CGView.

Authors:  Paul Stothard; David S Wishart
Journal:  Bioinformatics       Date:  2004-10-12       Impact factor: 6.937

Review 2.  Targeted imaging of bacterial infections: advances, hurdles and hopes.

Authors:  Marleen van Oosten; Markus Hahn; Lucia M A Crane; Rick G Pleijhuis; Kevin P Francis; Jan Maarten van Dijl; Gooitzen M van Dam
Journal:  FEMS Microbiol Rev       Date:  2015-06-24       Impact factor: 16.408

Review 3.  Antimicrobial resistance and virulence: a successful or deleterious association in the bacterial world?

Authors:  Alejandro Beceiro; María Tomás; Germán Bou
Journal:  Clin Microbiol Rev       Date:  2013-04       Impact factor: 26.132

4.  Learning transferable deep convolutional neural networks for the classification of bacterial virulence factors.

Authors:  Dandan Zheng; Guansong Pang; Bo Liu; Lihong Chen; Jian Yang
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

5.  COG database update: focus on microbial diversity, model organisms, and widespread pathogens.

Authors:  Michael Y Galperin; Yuri I Wolf; Kira S Makarova; Roberto Vera Alvarez; David Landsman; Eugene V Koonin
Journal:  Nucleic Acids Res       Date:  2020-11-09       Impact factor: 16.971

Review 6.  Nutrition and Bipartite Metabolism of Intracellular Pathogens.

Authors:  Ashley Best; Yousef Abu Kwaik
Journal:  Trends Microbiol       Date:  2019-01-14       Impact factor: 17.079

Review 7.  Stress Responses, Adaptation, and Virulence of Bacterial Pathogens During Host Gastrointestinal Colonization.

Authors:  Annika Flint; James Butcher; Alain Stintzi
Journal:  Microbiol Spectr       Date:  2016-04

8.  A framework for human microbiome research.

Authors: 
Journal:  Nature       Date:  2012-06-13       Impact factor: 49.962

9.  Genome-wide dissection reveals diverse pathogenic roles of bacterial Tc toxins.

Authors:  Nan Song; Lihong Chen; Zhemin Zhou; Xingmei Ren; Bo Liu; Siyu Zhou; Caihong Wang; Yun Wu; Nicholas R Waterfield; Jian Yang; Guowei Yang
Journal:  PLoS Pathog       Date:  2021-02-04       Impact factor: 6.823

Review 10.  Current understanding of the human microbiome.

Authors:  Jack A Gilbert; Martin J Blaser; J Gregory Caporaso; Janet K Jansson; Susan V Lynch; Rob Knight
Journal:  Nat Med       Date:  2018-04-10       Impact factor: 53.440

View more
  21 in total

1.  Genomic analysis of a novel active prophage of Hafnia paralvei.

Authors:  Lingting Pan; Dengfeng Li; Wei Lin; Wencai Liu; Weinan Qin; Lihua Xu; Yigang Tong
Journal:  Arch Virol       Date:  2022-06-25       Impact factor: 2.685

2.  Genomic sequence of the non-pathogen Neisseria sp. strain MA1-1 with antibiotic resistance and virulence factors isolated from a head and neck cancer patient.

Authors:  Eui Tae Kim; Young Suk Kim; Soo-Je Park
Journal:  Arch Microbiol       Date:  2022-09-02       Impact factor: 2.667

3.  The Notable Achievements and the Prospects of Bacterial Pathogen Genomics.

Authors:  Grigorios D Amoutzias; Marios Nikolaidis; Andrew Hesketh
Journal:  Microorganisms       Date:  2022-05-17

4.  Genomic Diversity, Virulence Gene, and Prophage Arrays of Bovine and Human Shiga Toxigenic and Enteropathogenic Escherichia coli Strains Isolated in Hungary.

Authors:  Domonkos Sváb; Linda Falgenhauer; Tünde Mag; Trinad Chakraborty; István Tóth
Journal:  Front Microbiol       Date:  2022-07-05       Impact factor: 6.064

5.  Comparative Genomics Applied to Systematically Assess Pathogenicity Potential in Shiga Toxin-Producing Escherichia coli O145:H28.

Authors:  Michelle Qiu Carter; Nicole Laniohan; Chien-Chi Lo; Patrick S G Chain
Journal:  Microorganisms       Date:  2022-04-21

6.  Case Report: Whole-Genome Sequencing of Serially Collected Haemophilus influenzae From a Patient With Common Variable Immunodeficiency Reveals Within-Host Evolution of Resistance to Trimethoprim-Sulfamethoxazole and Azithromycin After Prolonged Treatment With These Antibiotics.

Authors:  Paul Christoffer Lindemann; Haima Mylvaganam; Oddvar Oppegaard; Inger Lill Anthonisen; Nermin Zecic; Dagfinn Skaare
Journal:  Front Cell Infect Microbiol       Date:  2022-06-01       Impact factor: 6.073

7.  Streptococcus oriscaviae sp. nov. Infection Associated with Guinea Pigs.

Authors:  Jade L L Teng; Yuanchao Ma; Jonathan H K Chen; Ruibang Luo; Chuen-Hing Foo; Tsz Tuen Li; Jordan Y H Fong; Weiming Yao; Samson S Y Wong; Kitty S C Fung; Susanna K P Lau; Patrick C Y Woo
Journal:  Microbiol Spectr       Date:  2022-05-05

8.  The 2022 Nucleic Acids Research database issue and the online molecular biology database collection.

Authors:  Daniel J Rigden; Xosé M Fernández
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

9.  Metagenome Analysis of the Bacterial Characteristics in Invasive Klebsiella Pneumoniae Liver Abscesses.

Authors:  Zhijie Zhang; Hairui Wang; Yawen Guo; Zhaoyu Liu; Zhihui Chang
Journal:  Front Cell Infect Microbiol       Date:  2022-07-15       Impact factor: 6.073

10.  Genetic characterization of hypervirulent Klebsiella pneumoniae responsible for acute death in captive marmosets.

Authors:  Komkiew Pinpimai; Wijit Banlunara; Wendi D Roe; Keren Dittmer; Patrick J Biggs; Rachod Tantilertcharoen; Katriya Chankow; Napawan Bunpapong; Pongthai Boonkam; Nopadon Pirarat
Journal:  Front Vet Sci       Date:  2022-08-04
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.