| Literature DB >> 25414340 |
Martin Urban1, Rashmi Pant2, Arathi Raghunath2, Alistair G Irvine3, Helder Pedro4, Kim E Hammond-Kosack5.
Abstract
Rapidly evolving pathogens cause a diverse array of diseases and epidemics that threaten crop yield, food security as well as human, animal and ecosystem health. To combat infection greater comparative knowledge is required on the pathogenic process in multiple species. The Pathogen-Host Interactions database (PHI-base) catalogues experimentally verified pathogenicity, virulence and effector genes from bacterial, fungal and protist pathogens. Mutant phenotypes are associated with gene information. The included pathogens infect a wide range of hosts including humans, animals, plants, insects, fish and other fungi. The current version, PHI-base 3.6, available at http://www.phi-base.org, stores information on 2875 genes, 4102 interactions, 110 host species, 160 pathogenic species (103 plant, 3 fungal and 54 animal infecting species) and 181 diseases drawn from 1243 references. Phenotypic and gene function information has been obtained by manual curation of the peer-reviewed literature. A controlled vocabulary consisting of nine high-level phenotype terms permits comparisons and data analysis across the taxonomic space. PHI-base phenotypes were mapped via their associated gene information to reference genomes available in Ensembl Genomes. Virulence genes and hotspots can be visualized directly in genome browsers. Future plans for PHI-base include development of tools facilitating community-led curation and inclusion of the corresponding host target(s).Entities:
Mesh:
Year: 2014 PMID: 25414340 PMCID: PMC4383963 DOI: 10.1093/nar/gku1165
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Growth of the number of published articles screened by keyword search for PHI-base and the number of phenotypically curated genes. This figure was generated from literature records retrieved at PubMed and Web of Science using the search terms ‘(fung*or yeast) and (gene or factor) and (pathogenicity or virulen* or avirulence gene*)’. Key events in the history of functional gene analysis of pathogen-host interactions include: a, identification of the first avirulence gene (4); b, >1500 genome sequencing projects listed in the GOLD database (5); c, genome-wide functional analysis of pathogenicity genes in the rice blast fungus Magnaporthe oryzae; d, the first host-induced gene silencing (HIGS) study involving an obligate biotrophic species (6); e, genome-wide functional analysis of all transcription factors and protein kinases predicted in the cereal infecting fungus Fusarium graminearum (7,8).
Multispecies databases and websites involving plant, human and/or animal infecting pathogens which contain information complementary to the data in PHI-base
| Name and refa | URL (http://) | Comments |
|---|---|---|
| Broad-Fungal Genome Initiative | Genome browsing and comparative analysis for several plant pathogen division | |
| DFVF ( | Fungal virulence factor database generated using text-mining of the PubMed database and Internet | |
| e-Fungi ( | Rich source of ESTs obtained by Sanger sequencing | |
| Ensembl Genomes ( | Non-vertebrate species genomes portal with links to bacteria, fungi, metazoa, plants and protists | |
| Ensembl Bacteria | Genomes of bacterial and archea | |
| Ensembl Fungi | Genomes of fungal species including fungal pathogens | |
| Ensembl Protists | Genomes of protist species including Phytophthora | |
| Oomycetes Transcriptomics Database ( | Oomycete genomes and transcriptomics | |
| EuPathDB ( | Human pathogens | |
| FRAC | All known chemical target sites used commercially for the control of pathogens | |
| FungiDB ( | Fungal genomics database providing graphical tools for data mining | |
| HPIDB ( | Fifteen human virus pathogens–protein-protein interaction data | |
| JGI-MycoCosm ( | A genome portal for 100s of pathogenic and non-pathogenic fungal species | |
| Pathogen Portal | Emerging or re-emerging pathogens, potential biowarfare or bioterrorism pathogens | |
| PHIDIAS ( | Medical fungal and bacterial pathogens | |
| PhytoPath | PhytoPath–32 Fungi, 14 Protists, 12 bacterial species linked to PHI-base | |
| PLEXdb ( | Transcriptomics data only on plants, pathogens and during interactions | |
| USDA | Description of all the known hosts of fungi which infect plants | |
| VFDB ( | Virulence factors of human and animal bacterial pathogens |
aReference provided where available.
Interactions in PHI-base version 3.6 grouped by either host species or pathogen species
| Host/Entry type | Interactions |
|---|---|
| TOTALa | 4102 |
| PROKARYOTES (55)b | 804 |
| Animal hosts (16)c | 249 (31%) |
| 115 | |
| Plant hosts (29) | 555 (69%) |
| 300 | |
| 161 | |
| 29 | |
| 10 | |
| EUKARYOTES (105) | 3298 |
| Animal hosts (20) | 549 (16.6%) |
| 375 | |
| 238 | |
| 98 | |
| 144 | |
| 136 | |
| 30 | |
| Plant hosts (93) | 2744 (83.2%) |
| 2384 | |
| 1053 | |
| 1042 | |
| 575 | |
| 205 | |
| 93 | |
| 88 | |
| 78 | |
| 48 | |
| 44 | |
| 42 | |
| 261 | |
| 243 | |
| 7 | |
| 86 | |
| 53 | |
| 30 | |
| Others (4) | 13 |
| 10 | |
| 3 | |
| Fungal hosts (3) | 4 |
| Endophyte (1) | 5 |
| 5 |
aOnly highly represented taxon groups are listed. For a complete list of species in the database see Supplementary Table S1.
bThe table is divided into prokaryote and eukaryote host species. The species count number is listed in brackets.
cHost species are further divided into animal and plant host.
dLeft-indented genera and species infect or belong to taxonomic group listed non-indented above. Only main representatives organisms are listed.
eParasitic species are Leishmania infantum, L. mexicana, Toxoplasma gondii, Trypanosoma brucei and T. cruzi.
Definitions for the nine high-level phenotype outcomes used in PHI-base
| High-level phenotype outcomea | Definition |
|---|---|
| Loss of pathogenicity | The transgenic strain fails to cause disease that is observed in the wild type (i.e. qualitative effect). |
| Reduced virulence | The transgenic strain still causes some disease formation but fewer symptoms than the wild-type strain (i.e. a quantitative effect). Synonymous with the term reduced aggressiveness. |
| Unaffected pathogenicity | The transgenic strain which expresses altered levels of a specific gene product(s) causes the same level of disease compared to the wild-type reference strain. |
| Increased virulence (Hypervirulence) | The transgenic strain causes greater incidence or severity of disease than the wild-type strain. |
| Effector (plant avirulence determinant) | Some effector genes are required to cause disease on susceptible hosts but most are not. A plant pathogen-specific term which was previously referred to as a corresponding avirulence |
| Lethal | The transgenic strain is not viable. The gene product is essential for life of the organism. |
| Enhanced antagonism | The transgenic strain shows greater endophytic biomass in the host and/or the formation of visible disease symptoms. |
| Resistant to chemical | The transgenic strainb grows and/or develops normally when exposed to chemistry concentrations that are detrimental to the wild-type strain. |
| Sensitive to chemical | The transgenic strain which expresses either no or reduced levels of a specific gene product(s) or possesses a specific gene mutation(s), has the same abilityc as the wild-type strain to grow and develop when exposed to detrimental chemistry concentrations. |
aCompared to wild-type reference strain (i.e. a direct isogenic strain comparison).
bMolecular studies on natural field isolate population are also considered, once the natural target site has been identified.
cOn rare occasions increased sensitivity to chemistry has been observed.
Number of interactions per phenotypic group in animal and plant hosts
| Entry type | Animal hosta | Plant host |
|---|---|---|
| Loss of pathogenicity | 73 | 404 |
| Reduced virulenceb | 542 | 1056 |
| Increased virulence | 33 | 51 |
| Essential (lethal) | 46 | 74 |
| Unaffected pathogenicityc | 80 | 1144 |
| Effector | 0 | 533 |
| Enhanced antagonism | 0 | 4 |
| Resistance to chemistry | 5 | 30 |
| Sensitive to chemistryd | 1 | 7 |
aAnimal and plant-attacking pathogens are listed with their taxonomy ID and lifestyle in Supplementary Table S1.
bThe three missing entries in this category have other host types.
cOne entry in this category has a fungal host
dOne entry in this category has a fish host.
Figure 2.GO terms assigned to PHI-base accessions in Version 3.6 mapped to a biological process.
Figure 3.Inspection of gene function using the Ensembl genome browser. (A) Displayed is a small chromosomal region in Magnaporthe oryzae showing two genes involved in pathogenicity (as annotated in PHI-base) in their genomic context (viewable in the Ensembl browser, in the transcript display). A colour code indicates the annotated role of each gene, green ‘loss of pathogenicity’ and orange ‘reduced virulence’. (B) By selecting each colour-coded MGG transcript ID, information is revealed on the associated gene deletion study curated in the PHI-base database.
PHI-base uses that have often appeared in the peer-reviewed literature
| Use case | Type of research study | Example reference |
|---|---|---|
| 1 | Annotation and candidate gene selection | |
| Large scale forward genetics screens | ( | |
| Transcriptome studies (RNAseq, microarrays, ESTs) | ( | |
| Full and partial genome annotation, genome mining | ( | |
| 2 | Predictive bioinformatics analyses: Networks, protein-protein interaction mapping | ( |
| 3 | Complementary databases | ( |
| 4 | Review articles | ( |
| 5 | Single gene function studies | |
| Inter-comparisonand inter-comparison of gene mutants within and between species | ( |