Literature DB >> 23092926

DFVF: database of fungal virulence factors.

Tao Lu1, Bo Yao, Chi Zhang.   

Abstract

Fungal pathogens cause various diseases for plant and animal hosts. Despite the extensive impact of fungi on human health and life, the threats posed by emerging fungal pathogens are poorly understood. Specifically, there exist few fungal virulence gene databases, which prevent effective bioinformatics studies on fungal pathogens. Therefore, we constructed a comprehensive online database of known fungal virulence factors, which collected 2058 pathogenic genes produced by 228 fungal strains from 85 genera. This database creates a pivotal platform capable of stimulating and facilitating further bench studies on fungal pathogens.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 23092926      PMCID: PMC3478563          DOI: 10.1093/database/bas032

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   3.451


Introduction

Fungal organisms comprise one of the most diverse kingdoms on Earth; the number of fungal species is estimated to be ∼1.5 million (1). Many fungal taxa are pathogens under certain conditions. For instance, ∼80 000 fungal taxa have pathogenicity on 56 000 vascular plant hosts, according to the Fungus–Host Distributions of US Department of Agriculture (http://nt.ars-grin.gov). Fungal pathogens have a broad spectrum of hosts, including plants and animals, and may cause death and disability in humans, yield loss in agricultural crops and even alteration of forest ecosystem dynamics. For example, fungi in the genus Fusarium cause a variety of blights, seedling disease, root rots or wilts on nearly all species of cultivated plants, and they strongly affect the agriculture economy. Virulence factors are the most important proteins in pathogens, such as toxin synthetic enzymes and secreted biodegradation enzymes (2), that permit them to evade the defense mechanisms of the host and, thus, cause diseases (3). Currently, the number of reported fungal virulence factors is limited, and many of the data are only available in the literature texts. Please refer to references (4–8) for reviews on fungal virulence factors. To maximize the value of this type of data, it is essential that a method of data storage and sharing should be implemented for efficient and effective data mining. To this end, we believe that the utilization of a web-based comprehensive database will significantly facilitate such activities. Currently, there exists only one database, PHI-base (9), for fungal pathogens. Although PHI-base collected pathogenic genes for all types of fungal and bacterial pathogens, it contained limited number of fungal virulence factors, and most of its records are about bacterial pathogens. In comparison, advanced databases and prediction tools similar to what we are proposing have been developed for many other bacterial pathogens and were proven useful (10–13). Therefore, we constructed a comprehensive online database of fungal virulence factors for public use to fulfil this important need.

Database generation and its content

The state-of-the-art text-mining technique is used by the PubMed database and the Internet by searching keywords, such as fungal virulence factors, pathogenic genes and so forth. An in-house tool, programed in Python, is used to fetch article titles and abstracts from the PubMed database, and algorithms used by MedTAKMI (14) are implemented locally for entity extraction. Human intelligence is also involved to screen the output of automatic literature mining methods. Virulence factors are protein products of virulence genes, which are helpful for induction and development of disease (7, 15), but this database focuses on genes and their protein products. Some fungi exist in normal human body flora, such as Saccharomyces cerevisiae, and are normally non-pathogenic. However, they could also cause life-threatening infections, which often occur in immunocompromised patients or vulnerable population with weakened immune systems (16, 17). Therefore, virulence factors from this kind of fungi are also included in the database. As a result, 2058 fungal virulence factors are collected, which belong to 85 fungal genera, 228 fungal strains (by NCBI taxonomy ID). These virulence factors come from 593 peer-reviewed journal articles and sequences submitted to GenBank or UniProt databases. Although they are taxonomically different from fungi, oomycetes, as originally being classified among the fungi (18–20), are included this database, and only 79 records come from oomycetes. Of the 2058 proteins, 320 virulence factors are predicted to be secreted by fungi using WoLFPSORT (21) and signalp (22), whereas ∼30 proteins are related to biosynthetic fungal toxin, such as mycotoxin. As a comparison, there are 600 fungal pathogenic genes in the PHI-database (9). The related information of all virulence factors, such as gene symbol, NCBI database IDs, taxonomy and the protein sequence, is collected. In database of fungal virulence factors (DFVF), the fungus strain taxonomy is recorded with its original reference, as this method may help investigators find the references. In addition, the Pfam (23) domain annotation of each virulence factor and their Gene Ontology annotation are also recorded. The phenotypic information, such as disease information and host, is also collected. In DFVF, all plant diseases and host descriptions are from the US Department of Agriculture database records. Table 1 shows the statistics of hosts and their pathogenic fungi as well as virulence factors in DFVF. For example, there are 1308 virulence factors from 71 pathogens, whose hosts are all vertebrata animals, whereas 539 factors from 65 fungal pathogens have hosts of herbal plants.
Table 1

Statistics of virulence factors as their hosts

HostGenusSpeciesFactors
Animal
    Alla225451
    Vertebrata36711308
    Invertebrata5845
Plant
    Alla2390346
    Herb2065539
    Xyloid1767261

aAll means that the hosts of the virulence factors have a broad spectrum and cover many different kinds of hosts.

Statistics of virulence factors as their hosts aAll means that the hosts of the virulence factors have a broad spectrum and cover many different kinds of hosts.

USER interface

Search

The database system provides interactive access to all of the collected data, and users may connect to the database using a web browser. Figure 1 shows a snapshot of the user interface for users to browse or search the database. The ‘Browse’ button allows users to get a list of all records in one table. Variable search options are provided to conveniently locate the genes of interest. If a user knows the UniProt ID, gene name or a fungus name, the gene information search can be directly applied. If a user wants to get all virulence factors related to a certain disease or host, a disease related gene search can also be conducted. The most convenient searching approach is the keyword search in which the database will return all virulence factors for which the keyword is contained in any piece of the information under the factors.
Figure 1

The searching page of the database.

The searching page of the database.

Results

Once a user browses the whole database or searches with a specific option, the database first returns a table of related records, and then displays the UniProt ID, gene information and disease information in three columns, shown in Figure 2. The details of each factor can be displayed by clicking on the link of UniProt ID. The information for each virulence factor includes the basic information, DNA/protein sequences, disease information and annotation. The basic information consists of UniProt ID, gene symbol, taxonomy, ID and links to other databases and description. The PubMed links to all related publications are also presented. If the protein of a virulence factor has one or more Pfam domains, the links to those domains are provided. Descriptions on phenotypic data, including diseases and hosts are shown in the section of ‘Disease information.
Figure 2

The display page of searching result and information of each gene.

The display page of searching result and information of each gene.

Implementation

We adopted the LAMP (Linux, Apache, MySQL, PHP) platform to construct the online database system. The user interface has been designed using the JavaScript application framework. The user interface additionally accepts parameters through a URL for direct searching. This feature facilitates a link to the database from external sites, and it also allows users to bookmark and to cite specific results.

Accessibility

The database is freely available to all users without restriction at http://sysbio.unl.edu/DFVF. All data are downloadable from the same website. In addition to the link to download the whole database, we provide various links to download data in different categories for convenience. The source codes and other detailed information are available on request.

Funding

This work is supported by the University of Nebraska—Lincoln start-up funds (to C.Z.) and the Nebraska Soybean Board Funds (to C.Z.). Funding for open access charge: University of Nebraska - Lincoln start-up funds. Conflict of interest. None declared.
  18 in total

Review 1.  Contribution of proteomics to the study of plant pathogenic fungi.

Authors:  Raquel Gonzalez-Fernandez; Jesus V Jorrin-Novo
Journal:  J Proteome Res       Date:  2011-11-28       Impact factor: 4.466

2.  SignalP 4.0: discriminating signal peptides from transmembrane regions.

Authors:  Thomas Nordahl Petersen; Søren Brunak; Gunnar von Heijne; Henrik Nielsen
Journal:  Nat Methods       Date:  2011-09-29       Impact factor: 28.547

Review 3.  Nep1-like proteins from plant pathogens: recruitment and diversification of the NPP1 domain across taxa.

Authors:  Mark Gijzen; Thorsten Nürnberger
Journal:  Phytochemistry       Date:  2006-01-23       Impact factor: 4.072

Review 4.  Fungal effector proteins.

Authors:  Ioannis Stergiopoulos; Pierre J G M de Wit
Journal:  Annu Rev Phytopathol       Date:  2009       Impact factor: 13.078

Review 5.  Fungal effector proteins: past, present and future.

Authors:  Pierre J G M De Wit; Rahim Mehrabi; Harrold A Van den Burg; Ioannis Stergiopoulos
Journal:  Mol Plant Pathol       Date:  2009-11       Impact factor: 5.663

6.  The Pfam protein families database.

Authors:  Robert D Finn; Jaina Mistry; John Tate; Penny Coggill; Andreas Heger; Joanne E Pollington; O Luke Gavin; Prasad Gunasekaran; Goran Ceric; Kristoffer Forslund; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2009-11-17       Impact factor: 16.971

Review 7.  Toxins of filamentous fungi.

Authors:  Deepak Bhatnagar; Jiujiang Yu; Kenneth C Ehrlich
Journal:  Chem Immunol       Date:  2002

8.  Pseudomonas Genome Database: improved comparative analysis and population genomics capability for Pseudomonas genomes.

Authors:  Geoffrey L Winsor; David K W Lam; Leanne Fleming; Raymond Lo; Matthew D Whiteside; Nancy Y Yu; Robert E W Hancock; Fiona S L Brinkman
Journal:  Nucleic Acids Res       Date:  2010-10-06       Impact factor: 16.971

9.  MyBASE: a database for genome polymorphism and gene function studies of Mycobacterium.

Authors:  Xinxing Zhu; Suhua Chang; Kechi Fang; Sijia Cui; Jun Liu; Zuowei Wu; Xuping Yu; George F Gao; Huanming Yang; Baoli Zhu; Jing Wang
Journal:  BMC Microbiol       Date:  2009-02-20       Impact factor: 3.605

10.  VFDB 2008 release: an enhanced web-based resource for comparative pathogenomics.

Authors:  Jian Yang; Lihong Chen; Lilian Sun; Jun Yu; Qi Jin
Journal:  Nucleic Acids Res       Date:  2007-11-04       Impact factor: 16.971

View more
  16 in total

Review 1.  Computational Prediction of Effector Proteins in Fungi: Opportunities and Challenges.

Authors:  Humira Sonah; Rupesh K Deshmukh; Richard R Bélanger
Journal:  Front Plant Sci       Date:  2016-02-12       Impact factor: 5.753

2.  Identification of horizontally transferred genes in the genus Colletotrichum reveals a steady tempo of bacterial to fungal gene transfer.

Authors:  Vinicio D Armijos Jaramillo; Serenella A Sukno; Michael R Thon
Journal:  BMC Genomics       Date:  2015-01-02       Impact factor: 3.969

3.  The Pathogen-Host Interactions database (PHI-base): additions and future developments.

Authors:  Martin Urban; Rashmi Pant; Arathi Raghunath; Alistair G Irvine; Helder Pedro; Kim E Hammond-Kosack
Journal:  Nucleic Acids Res       Date:  2014-11-20       Impact factor: 16.971

4.  Genomics and Comparative Genomic Analyses Provide Insight into the Taxonomy and Pathogenic Potential of Novel Emmonsia Pathogens.

Authors:  Ying Yang; Qiang Ye; Kang Li; Zongwei Li; Xiaochen Bo; Zhen Li; Yingchun Xu; Shengqi Wang; Peng Wang; Huipeng Chen; Junzhi Wang
Journal:  Front Cell Infect Microbiol       Date:  2017-03-31       Impact factor: 5.293

Review 5.  Pathogenomics and Management of Fusarium Diseases in Plants.

Authors:  Sephra N Rampersad
Journal:  Pathogens       Date:  2020-05-01

6.  Comparative genomic analyses reveal the features for adaptation to nematodes in fungi.

Authors:  Ruizhen Wang; Leiming Dong; Ran He; Qinghua Wang; Yuequ Chen; Liangjian Qu; Yong-An Zhang
Journal:  DNA Res       Date:  2018-01-05       Impact factor: 4.458

7.  Genome Sequencing of Cladobotryum protrusum Provides Insights into the Evolution and Pathogenic Mechanisms of the Cobweb Disease Pathogen on Cultivated Mushroom.

Authors:  Frederick Leo Sossah; Zhenghui Liu; Chentao Yang; Benjamin Azu Okorley; Lei Sun; Yongping Fu; Yu Li
Journal:  Genes (Basel)       Date:  2019-02-08       Impact factor: 4.096

8.  Comparative genome analysis indicates high evolutionary potential of pathogenicity genes in Colletotrichum tanaceti.

Authors:  Ruvini V Lelwala; Pasi K Korhonen; Neil D Young; Jason B Scott; Peter K Ades; Robin B Gasser; Paul W J Taylor
Journal:  PLoS One       Date:  2019-05-31       Impact factor: 3.240

9.  Natural selection on coding and noncoding DNA sequences is associated with virulence genes in a plant pathogenic fungus.

Authors:  Gabriel E Rech; José M Sanz-Martín; Maria Anisimova; Serenella A Sukno; Michael R Thon
Journal:  Genome Biol Evol       Date:  2014-09-04       Impact factor: 3.416

10.  Genome analysis reveals evolutionary mechanisms of adaptation in systemic dimorphic fungi.

Authors:  José F Muñoz; Juan G McEwen; Oliver K Clay; Christina A Cuomo
Journal:  Sci Rep       Date:  2018-03-14       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.