Literature DB >> 31733065

PHI-base: the pathogen-host interactions database.

Martin Urban1, Alayne Cuzick1, James Seager1, Valerie Wood2, Kim Rutherford2, Shilpa Yagwakote Venkatesh3, Nishadi De Silva4, Manuel Carbajo Martinez4, Helder Pedro4, Andy D Yates4, Keywan Hassani-Pak5, Kim E Hammond-Kosack1.   

Abstract

The pathogen-host interactions database (PHI-base) is available at www.phi-base.org. PHI-base contains expertly curated molecular and biological information on genes proven to affect the outcome of pathogen-host interactions reported in peer reviewed research articles. PHI-base also curates literature describing specific gene alterations that did not affect the disease interaction phenotype, in order to provide complete datasets for comparative purposes. Viruses are not included, due to their extensive coverage in other databases. In this article, we describe the increased data content of PHI-base, plus new database features and further integration with complementary databases. The release of PHI-base version 4.8 (September 2019) contains 3454 manually curated references, and provides information on 6780 genes from 268 pathogens, tested on 210 hosts in 13,801 interactions. Prokaryotic and eukaryotic pathogens are represented in almost equal numbers. Host species consist of approximately 60% plants (split 50:50 between cereal and non-cereal plants), and 40% other species of medical and/or environmental importance. The information available on pathogen effectors has risen by more than a third, and the entries for pathogens that infect crop species of global importance has dramatically increased in this release. We also briefly describe the future direction of the PHI-base project, and some existing problems with the PHI-base curation process.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 31733065      PMCID: PMC7145647          DOI: 10.1093/nar/gkz904

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Infectious diseases have a profound influence on every aspect of society. Diseases are a major concern to plant, animal, human, and ecosystem health. Globally infectious diseases threaten food security, human community structures, and the biodiversity of natural ecosystems (1–3). The increasing effects of climate change, human migration, and the globalisation of the trading of fresh goods have resulted in a rise in the incidence and severity of existing disease problems, as well as the emergence of a cohort of novel pathogen species and zoonoses (4). In addition, the (re)acquisition of resistance to anti-infective chemistries—coupled with a rise in legislation banning or restricting existing chemistries—means the burden of microbial infections is of ever growing concern to human, animal and plant welfare (5,6). In the United Kingdom alone, the total economic burden from infectious diseases is estimated at £30 billion annually, and accounts for 7% of all deaths (7). Infectious diseases are a consequence of complex and dynamic interactions between pathogen virulence factors, and host cell recognition and response systems (8–10). It is increasingly clear that studying these interactions across the tree of life is a fertile ground for uncovering crucial biological principles that control the interaction outcome. In addition, in the post-genomics era—with the ever-decreasing costs for whole genome sequencing, genome assembly, and gene prediction—there is intense scientific and commercial interest in comparative pathogen genomics, as well as whole genome protein–protein interaction predictions and comparisons to identify functionally homologous genes, and to pinpoint species-unique genes and pathways. This increased understanding of the dynamics of a wide range of interactions contributes to the two predominant approaches available for combating infectious disease: namely, stimulating the host immune system to prevent infections, and minimizing the use of chemicals to eliminate infectious agents (11–13). The pathogen–host interactions database (PHI-base) was established in 2005 and is freely available at www.phi-base.org. PHI-base contains expertly curated molecular and biological information on genes proven to affect the phenotypic outcome of pathogen–host interactions (14,15). All PHI-base entries are supported by strong experimental evidence from a peer reviewed publication. PHI-base catalogues experimentally verified pathogenicity, virulence, and effector genes from fungal, protist, and bacterial pathogens which infect plant, human, animal, and insect hosts. Genes tested but found not to affect the interaction outcome are also expertly curated. In PHI-base, the term ‘interaction’ is used to describe the observable function of one gene, on one host, on one tissue type (14). Nine high-level phenotypic outcome terms have been developed to permit the comparison of interactions across the entire tree of life (16). These terms are ‘loss of pathogenicity’, ‘reduced virulence’, ‘increased virulence’, ‘unaffected pathogenicity’, ‘effector’, ‘lethal’, ‘increased virulence (hypervirulence)’, ‘resistance to chemical’ and ‘sensitivity to chemical’. These high-level phenotypic outcome terms—although not yet supported by a formal controlled vocabulary—are particularly useful for bioinformaticians and biologists unfamiliar with the nuances of multiple pathogen–host interactions, but who wish to include pathogens with different lifestyles and host ranges in their comparative analyses. In addition, a PHIB-BLAST tool has been introduced to permit simple or advanced BLAST queries arising from functional genomics, transcriptomics, and proteomics experimentation. In 2017, PHI-base joined the UK node of ELIXIR’s ‘Data for Life’ project as a gold-standard ‘agricultural omics data’ provider (17). PHI-base follows the FAIR data principles in order to make data findable, accessible, interoperable, and reusable (18). PHI-base also reuses data provided by external resources, including PubMed, NCBI taxonomy, UniProtKB, and the Gene Ontology (GO). A number of complementary multi-species databases on pathogens exist that also provide gene function annotation (recently reviewed by (14,19,20)). PHI-base is unique in describing a broad range of plant and animal pathogen–host interactions using the same controlled vocabulary consistently across >250 species. In this article, we report on a major increase in PHI-base gene content, new database features, integration with complementary databases, and our immediate plans using new funding.

RESULTS AND DISCUSSION

Biological data

Version 4.8 of PHI-base (released in September 2019 and described in this article), contains data on 6780 genes, 13801 interactions, 268 pathogens, 210 hosts and 3454 references. This version includes 71% more interactions, each annotated with a phenotype, compared to PHI-base version 4.2 described in (14). Bacteria and fungal pathogens represent the majority of the interaction data, with a near 50:50 split of entries; whilst protists, nematodes and insects represent 3.6% of the species (Table 1). The fungal pathogen interactions are dominated by the Ascomycetes, which covers 88.5% of annotated fungal interactions (5929 interactions, 100 species); this is followed by the Basidiomycetes, which only cover 11.4% of annotated interactions (762 interactions, 11 species). In total, 5755 phenotype interactions describing experimental data on 2320 genes from 1235 newly curated publications are included up to March 2019.
Table 1.

Summary of pathogen groups, interactions and phenotypes within PHI-base version 4.8

Phenotype/pathogenBacteriumFungusProtistNematodeInsectTotals
Number of pathogens1311121762268
Interactions in total66086696463241013801
Loss of pathogenicity204696710908
Reduced virulence30542960961306123
Unaffected pathogenicity1375220260003637
Effector (plant avirulence determinant)15114682639102261
Increased virulence (hypervirulence)4331732810635
Lethal18156900183
Chemical target: resistance to chemical72900036
Chemical target: sensitivity to chemical6800014
Enhanced antagonism040004
Summary of pathogen groups, interactions and phenotypes within PHI-base version 4.8 The number of pathogenic species in PHI-base was capped at 268 and includes a small number of newly emerging pathogens under intense investigation. Plant infecting pathogens—namely bacteria, fungi, protists, nematodes and insects—represent 60% of the species in PHI-base (Table 2). Amongst these, there is an almost equal split between cereal and non-cereal infecting species. Woody tree infecting species provide 1004 interaction entries (7.3% of plant pathogen interactions). Amongst the 32 human and animal infecting pathogens, an increasing number are now being tested on non-vertebrate species: for example, various insects, nematodes and crustaceans. These non-vertebrate pathogen interactions now account for 23% of database entries (Table 2).
Table 2.

Summary of the number of host species and interactions within PHI-base version 4.8

PhenotypePlantVertebrateInsectNematodeOthers
Host species1313224122
Interactions in total8248443969625881
Loss of pathogenicity6502331591
Reduced virulence2885265538613760
Unaffected pathogenicity232610041939816
Effector (plant avirulence determinant)20012332313
Increased virulence (hypervirulence)24430077131
Lethal10180200
Chemical target: resistance to chemical273000
Chemical target: sensitivity to chemical131000
Enhanced antagonism40000
Summary of the number of host species and interactions within PHI-base version 4.8 As in previous versions of PHI-base, the highest number of pathogen–host interactions tested in molecular genetic studies and reported in the literature are from the filamentous fungal pathogens Fusarium graminearum and Magnaporthe oryzae, which cause various diseases on staple crops, such as wheat, rice and maize (Table 3). The most highly represented plant-infecting bacteria are Ralstonia solanacearum, a pathogen of potato and other Solanaceae crops; and Xanthomonas oryzae, a pathogen of rice. For the animal kingdom, the most frequently studied pathogens include the human pathogens Salmonella enterica, Candida albicans and Pseudomonas aeruginosa (Table 3). Amongst the top 30 species present in PHI-base, phenotypic interaction information—from single, double and occasionally multiple gene deletions—is provided for each species: from a minimum of 32 genes to a maximum of 1340 genes. However, for the cereal infecting fungus Pyrenophora tritici-repentis, only five genes have been explored over 142 interactions. Overall, the 30 top species in PHI-base consist of 12 fungi, 1 protist and 17 bacteria, and together these covers 71% of total interactions and 88% of total genes.
Table 3.

Top species and interactions within PHI-base version 4.8

PathogenInteractionsGenesLoss of pathogenicityReduced virulenceIncreased virulenceEffectorUnaffected pathogenicityLethalNo. of tested host species
Fusarium graminearum 1571134036516809179413
Magnaporthe oryzae 1273738279501108439817
Ralstonia solanacearum 66613216430597919
Salmonella enterica 664412838145108122011
Xanthomonas oryzae 512224396243068303
Erwinia amylovora 45013534165551518105
Candida albicans 4483434830511080412
Pseudomonas aeruginosa* 44022019218344165016
Botrytis cinerea 36814724205104123026
Ustilago maydis 3602644818781710003
Aspergillus fumigatus 3092073012814093424
Cryptococcus neoformans 3052034418417050108
Pseudomonas syringae 293170153919138113
Escherichia coli 2641691167151169113
Staphylococcus aureus 2121391213722238110
Fusarium oxysporum* 209131249082760017
Xanthamonas campestris 1801101195403928
Klebsiella pneumoniae 173134472409304
Streptococcus pneumoniae 1521062110403065
Mycobacterium tuberculosis 1501123643614604
Candida glabrata 14843089605213
Verticillium dahliae 14560146492434016
Listeria monocytogenes 14269210217318010
Pyrenophora tritici-repentis 1425031138003
Enterococcus faecalis 13232182304605
Hyaloperonospora arabidopsidis 12770013123005
Streptococcus pyogenes 121680671903327
Xanthomonas citri 11933618388406
Vibrio cholerae 11771183203106
Beauveria bassiana 108844708224011
TOTALS 98355971675429640917452976166

*Pathogen species able to infect both plant and animal hosts.

Top species and interactions within PHI-base version 4.8 *Pathogen species able to infect both plant and animal hosts. Since 2015, there has been an emphasis on increasing the curation of pathogen gene modifications that result in a hypervirulence phenotype on the host. This has steadily risen from 112 genes (version 3.8) to 233 genes (tested in 324 interactions) (version 4.2), to 475 genes (tested in 635 interactions) (ver. 4.8). Hypervirulence phenotype interactions now account for 4.6% of all database entries and are particularly prevalent amongst bacterial pathogen entries (Table 1). This increasing number of hypervirulent interactions indicates that many additional aspects of the negative regulation of key pathogenicity processes—occurring during infection and colonization of both plant and animal hosts—have been identified. This gene set continues to warrant close monitoring in pathogen populations when attempting to explore, and then mitigate, the emergence and spread of hypervirulent pathogens associated with severe disease outbreaks (21). A second major curation effort for PHI-base has been to increase coverage of pathogen effectors (14). An effector is an entity derived from a pathogenic or non-pathogenic species, that either activates or suppresses host defences or other host responses. Interactions involving effectors have risen by 35%: from 1668 (version 4.2) to 2261 (version 4.8). This category now represents 16% of the dataset, with data derived from 83 species, mostly plant pathogens (Table 4). In total, 67% of the effector entries (1511 interactions) are from bacterial species; there is also a considerable number of entries from five obligate fungal rust or powdery mildew species, and one obligate protist species (Hyaloperonospora arabidopsidis). Based on data curated in PHI-base, the experimental method of choice for studying effector function is evaluating transient expression in a host or non-host species: transient expression tests account for 573 interactions across 28 pathogen species.
Table 4.

Summary of the pathogenic species providing the most information on effectors

Pathogen - 83 speciesInteractions
Bacteria - 40 species 1511
Ralstonia solanacearum 597
Xanthamonas oryzae 306
Pseudomonas syringae 191
Salmonella enterica 122
Xanthamonas citri 88
Fungus - 25 species 471
Pyrenophora tritici-repentis 138
Magnaporthe oryzae 84
Passalora fulva 57
Fusarium oxysporum 27
Ustilago maydis 17
Obligate fungal biotrophs - 5 species 65
Melampsora species33
Puccinia species27
Blumeria species5
Protist / 10 species 263
Hyaloperonospora arabidopsidis 123
Phytophthora sojae 51
Phytophthora capsici 38
Phytophthora infestans 29
Nematodes and insects - 3 species 10
Summary of the pathogenic species providing the most information on effectors In 2015, nine high level phenotypic terms were introduced to the curation process, to permit researchers to explore the database across a wide range of taxonomically diverse species which exhibit varied pathogenic lifestyles (16). The phenotype term ‘reduced virulence’ is the most highly represented and applies to 44% of database entries. The second most frequent term is ‘unaffected pathogenicity’, at 26%. The majority of the ‘unaffected pathogenicity’ phenotypes have been reported for plant pathogens (64%), however an increasing number (1004) are from animal pathogens (compared to 80 interactions in version 3.6, and 280 interactions in version 4.2). This change appears to have arisen primarily because, within an individual publication, the number of host species tested, or the number of pathogen genes tested has increased; also, comparative results may be included from single, double, and multiple-gene deletion mutants. The number of articles reporting entirely negative data remains small. These negative outcomes are usually presumed (by the respective authors) to indicate that the gene product does not have a functional role in the pathogenic process under investigation, or that gene redundancy exists. The high-level phenotypes for all interactions are summarized in Table 1 (for pathogen species) and Table 2 (for host species). A total of 183 PHI-base entries have been assigned the ‘lethal’ phenotype, consisting of 7 plant-infecting pathogens, 12 animal-infecting pathogens and 1 insect-infecting pathogen. The majority of lethal phenotype annotations are for fungal species, in particular Fusarium graminearum (94 entries), for which genome-wide single gene replacement studies have been completed for all predicted transcription associated proteins (22), the predicted protein kinases (the kinome) (23), protein phosphatases (the phosphatome) (24), and—most recently—the predicted plasma membrane spanning G-protein coupled receptors (25,26). In these large-scale experiments, no transformants were recovered in repeat experiments, whilst transformants were recovered for many other genes. Thus, the authors considered that the gene's function was ‘essential for life’. The human pathogen Aspergillus fumigatus has also contributed a disproportionately high number of lethal phenotype entries, with 42 of the 207 genes tested (20%) falling into this category where a targeted screen for essential genes has been initiated (27). However, amongst the 30 species with the most interactions in PHI-base (Table 3), 17 species have no ‘lethal’ category entries, whilst a further 8 species only provide 1 or 2 lethal entries. An increasing number of interactions involving human and animal pathogens are now being tested in non-vertebrate species (Table 2). In these bioassays, a wide range of insect larvae are used, including: Galleria mellonella (greater wax moth), Plutella xylostella (diamondback moth), and Bombyx mori (domestic silkworm); as well as adult insects, specifically Drosophila melanogaster (fruit fly). Other studies have used the nematode Caenorhabditis elegans (roundworm), the slime mold Dictyostelium discoideum, the free-living amoeba Acanthamoeba castellanii, or various crustaceans: such as shrimp species from the genus Artemia and Penaeus; and bivalve species, such as oysters from the genus Crassostrea. The increasing adoption of the 3Rs principle (replacement, reduction, and refinement) in place of animal models is the main contributing factor to the rising number of non-vertebrate entries (28). With increasing concerns over global food security, researchers in the international community are being encouraged to investigate host plant-pathogen interactions in crop species, rather than just model pathosystems (29). In addition, the availability of the published completed reference genome for hexaploid bread wheat (Triticum aestivum) from the International Wheat Genome Sequencing Consortium (RefSeq v1.0) (https://www.wheatgenome.org/) (30) is increasing the pace of discovery for many wheat infecting species. Table 5 shows the interaction entries involving major food and feed crops: namely wheat, rice, maize, barley, tomato, potato, and Brassica. Together, these seven host plant species provide 37% of the data in PHI-base (5096 interactions) and involve 79 pathogenic species (60% of plant pathogen species in PHI-base). In contrast, the three model species Arabidopsis thaliana, Nicotiana benthamiana, and Nicotiana tabacum provide only 5% of the data (688 interactions). The high number of 48 pathogenic species tested using N. benthamiana and N. tabacum is predominantly due to the availability of Agrobacterium-mediated transient expression assays to test the function of effector proteins.
Table 5.

Crop plant and model plant species contributions to PHI-base version 4.8

Host plantInteraction entriesNo of pathogen speciesLoss of pathogenicityReduced virulenceIncreased virulenceEffectorUnaffected pathogenicity
Crop species
Wheat1790187151315149923
Rice1371917246433366324
Maize66113663811918176
Barley463995163233169
Tomato59030561909195139
Potato1121515982024
Brassica 10912165531520
Model species
Arabidopsis359287971319844
Tobacco (N. benthamiana and N.tabacum)329477712120228
TOTALS (8 crop species) 5784102 (different species)491199312311961847
Crop plant and model plant species contributions to PHI-base version 4.8

Mapping PHI-base phenotypes to Ensembl Genomes and FungiDB

PHI-base supplies phenotypic annotation for over 100 crop-plant-infecting microbial pathogens into Ensembl Genomes (31). This contribution was initiated as part of the PhytoPath project (32). Recently, the implementation of an improved mapping pipeline developed by Ensembl has contributed to an increase in the number of genomes with PHI-base annotations by a factor of 8.7 in the total genomes of bacteria, fungi and protists compared to 2017 (De Silva et al., NAR Database issue 2020, submitted). Also, as a result of extrapolating annotations for conserved genes to closely related species, Ensembl have now applied PHI-base annotations to over 14 000 genes in over 1000 genomes. These can provide potential clues for experimental validation in other pathogens. Phenotype annotations are also provided to FungiDB (33). FungiDB release 46 (October 2019) integrates 2633 PHI-base annotations, mapping to 1636 genes for 18 FungiDB hosted genomes. In addition to pathogen–host interaction annotations, several in-vitro phenotypes including growth, sporulation and penetration defects are displayed.

Migration to reference sequence UniProt IDs

PHI-base provides links to UniProt IDs when these accessions exist in UniProt Knowledgebase (34). These links can provide further molecular protein annotation, including GO terms. However, new genomes are sequenced, and existing genomes are re-annotated. This can generate multiple gene IDs and protein IDs for the same gene, causing interoperability issues. We are currently migrating to a system where we consistently use the UniProt identifier from the reference strain as listed by UniProt, rather than IDs from alternative (non-reference) strains. PHI-base has over 15 years of curated literature, and therefore contains ∼11% legacy genes with no link to UniProt; here in most cases GenBank and EMBL records are referenced. For the genes originally curated with Uniprot IDs, ∼10% were in the meantime moved to the UniParc sequence archive. Thus, a challenge exists to frequently review and update PHI-base records, until microbial pathogen proteomes become sufficiently refined and available at UniProtKB. In the meantime, single-species community-based efforts, such as FusariumMutantDB (https://scabusa.org/FgMutantDb) (35), can effectively support PHI-base by providing mapping files for legacy gene IDs in several genome assemblies/strains to reference strain IDs available at UniProtKB. BLAST mapping of PHI-base proteins using Blast2GO software (Vers. 5.2.5) (36) using default parameters against the UniProtKB/TrEMBL (release2019_07) identified 937 sequences without GO associations. These sequences include many fungal species-specific effectors, for which currently GO terms are being created.

PHI-base BLAST tool

PHI-base has a strong focus on providing curated phenotype data, with less emphasis on providing bioinformatics tools. Excellent tools for genome browsing and sequence investigations are provided for example by Ensembl Genomes, FungiDB and other genomic resource providers (33,37). However, since 2017 we have provided an online sequence-to-phenotype BLAST search tool, called PHIB-BLAST. This allows users to map their own sequences to PHI-base accessions and the reported phenotypic outcomes are displayed in the BLAST result header, to give immediate comparisons between species. Additionally, this information is also made available for download in FASTA format, where PHI-base information is embedded in the single-line FASTA header for each protein sequence.

PHI-base usage

All of the publications citing PHI-base use are cited in the ‘about’ section of the database. Currently, 367 articles have cited PHI-base and 60% of these have been published in the past 5 years. New research investigations using PHI-base information cover multiple active fields of research, including gut microbiomes, effector discovery, diagnostic markers for the early ‘in host’ detection of pathogens and finding lethal phenotypes in human pathogens to aid the drug discovery process. For those wishing to query past versions of PHI-base, these have been made available on our ‘data’ repository on GitHub (https://github.com/PHI-base). PHI-base is accessed by users in 130 countries over six continents. Over the past 3 years PHI-base usage has remained relatively stable at between 9000–16 000 searches and >400–600 full downloads per annum.

Outreach to inspire the next generation

To help PHI-base reach a different audience, a STEM (Science, Technology, Engineering and Mathematics) outreach article was recently published highlighting the importance of big data, bioinformatics and plant pathology (https://futurumcareers.com/saving-plants-from-disease). This article was aimed at an audience of 11–19-year olds to inform and enable them to consider career options within these fields. Example case studies were taken from PHI-base and PhytopathDB. Accompanying worksheets (https://futurumcareers.com/Kim_Hammond-Kosack-activity-sheet.pdf) were provided to stimulate discussions and ideas within classrooms and beyond.

Future directions

PHI-Canto and ontologies

As reported previously (14) we have developed the multi-species web-based curation tool PHI-Canto (canto.phi-base.org). PHI-Canto is an implementation of the Canto community curation tool, developed and used by the fission yeast database PomBase (38). In addition to supporting professional biocurators, researchers will be able to directly contribute annotations from their publications to PHI-base. PHI-Canto supports the annotation of GO, phenotypes, modifications and interactions. Curation in PHI-Canto involves specifying a publication (using a PubMed ID), entering experimental pathogen and host genes (using UniProtKB IDs), creating genotypes (by listing alleles), annotating genotypes with one or more PHIPO (pathogen host interactions phenotype ontology) terms, and selecting an experimental evidence code. Pathogen–host interaction phenotypes are connected to the underlying genotypes of both the pathogen and the host (multi-species genotypes). Physical protein interactions—such as those identified in yeast two-hybrid or co-immunoprecipitation experiments—can also be curated which will be particularly useful in recording the direct interacting host targets of pathogen effectors. Recent developments to PHI-Canto include three main improvements. First, the handling of host genes and genotypes. Second, the ability to capture increasingly complex pathogen host interactions, involving incremental changes to either, or both, the pathogen and the host. Third, mechanisms to capture single species phenotypes for the pathogen and the host. PHIPO was recently registered at the Open Biological and Biomedical Ontology (OBO) Foundry (http://www.obofoundry.org/ontology/phipo.html) to promote reuse in the pathogen community. PHI-Canto will enable accurate biocuration of pathogen–host interaction data into PHI-base by the international community. With increased PHI-Canto use this will ensure PHI-base can keep up-to-date with the ever-growing number of publications and newly developed experimental techniques. Researchers interested in trialling PHI-Canto are encouraged to contact us by email (curation@phi-base.org).

Improving strain identification and disease curation

Curating strain and disease names are problematic because a wide range of synonyms exist that are inconsistently used and published in different research communities. We have developed a standardized list of strains of importance to PHI-base and currently continue to revise inconsistencies in legacy data. A list of infectious disease names from PHI-base is currently being standardized to a set of external disease ontologies for animals and plants. The revised nomenclature will be used in future PHI-base releases and within PHI-Canto.

Curation of the fungicide and anti-infective literature

PHI-base is curating publications describing the target sites of some anti-infective chemistries although this has not been a high priority (Table 1). We plan to increase the coverage of fungicide and anti-infective literature over the next 2 years. To support this work, we have curated a list of anti-infective agents, including a description of the anti-infectives’ function, and cross-references to other databases (FRAC codes, CHEBI IDs, and CAS numbers) where available. This inventory is available from our ‘data’ repository on Github (https://github.com/PHI-base). A pilot text mining study with Molecular Connections (PHI-base curation partner), will test bespoke machine learning algorithms using the anti-infectives list to identify additional papers containing potential fungicide and other anti-infective targets for future curation.

Access to PHI-base annotations in graphical displays of biological networks

KnetMiner (https://knetminer.org) is a digital research assistant with a Google-like search interface, predictive graph algorithms and interactive features to visualize biological knowledge networks (39,40). KnetMiner mines millions of relations in a genome-scale knowledge network to identify novel clues about genes, gene networks, and diseases (41,42). KnetMiner can search an integrated database of crop and model organism genomes, curated databases such as PHI-base, gene expression, gene interaction information, ontologies and the scientific literature to produce a ranked answer with evidence codes within seconds. The user can then interactively explore the auto generated knowledge network, hiding noisy or untrustworthy relations. So far knowledge networks containing PHI-base data have been developed for the cereal infecting fungal pathogens Fusarium graminearum and Zymoseptoria tritici. For plant and crop species, available networks include Arabidopsis, wheat and rice. In the future, our plans are to link knowledge networks for PHI-genes from Knetminer directly into PHI-base.

PHI-annotations into additional databases

PHI-base currently provides phenotype data to a variety of resources, including Ensembl Genomes (Protists, Bacteria and Fungi), FungiDB, Knetminer, FusariumMutantDB and GLOBI (43). Future plans include linking out to Ensembl Plants and to the thousands of fungal genomes and hundreds of protist genomes in the MycoCosm database provided by the Joint Genome Institute (JGI) (44). One of the advantages of MycoCosm is that the genomes of pathogenic and non-pathogenic species are displayed and queried via the use of a navigation tree which assists users with minimal knowledge of taxonomic relationships and groups. We will supply Gene Ontology annotations to GO, from where they will be distributed to other resources, including UniProtKB. PHI-base curators are working closely with the manual curation team of UniProtKB/Swiss-Prot to ensure that gene names and strains are consistent between entries, and we will explore mechanisms to share phenotype annotations with UniProtKB.

DATA AVAILABILITY

PHI-base 4: www.phi-base.org PHI-base GitHub page: https://github.com/PHI-base PHIB-BLAST: http://phi-blast.phi-base.org PHI-wiki page: https://en.wikipedia.org/wiki/PHI-base PHI-Canto (multi-species community annotation tool): https://canto.phi-base.org/ Linked resource - Ensembl genomes: http://ensemblgenomes.org Linked resource - FungiDB: https://fungidb.org/fungidb/ Linked resource - Pombase: https://www.pombase.org/ Linked resource - KnetMiner: https://knetminer.org Linked resource - ELIXIR UK: https://elixiruknode.org/
  40 in total

1.  Functional analysis of the Fusarium graminearum phosphatome.

Authors:  Yingzi Yun; Zunyong Liu; Yanni Yin; Jinhua Jiang; Yun Chen; Jin-Rong Xu; Zhonghua Ma
Journal:  New Phytol       Date:  2015-03-10       Impact factor: 10.151

Review 2.  Pivoting the plant immune system from dissection to deployment.

Authors:  Jeffery L Dangl; Diana M Horvath; Brian J Staskawicz
Journal:  Science       Date:  2013-08-16       Impact factor: 47.728

Review 3.  Emerging fungal threats to animal, plant and ecosystem health.

Authors:  Matthew C Fisher; Daniel A Henk; Cheryl J Briggs; John S Brownstein; Lawrence C Madoff; Sarah L McCraw; Sarah J Gurr
Journal:  Nature       Date:  2012-04-11       Impact factor: 49.962

4.  An expanded subfamily of G-protein-coupled receptor genes in Fusarium graminearum required for wheat infection.

Authors:  Cong Jiang; Shulin Cao; Zeyi Wang; Huaijian Xu; Jie Liang; Huiquan Liu; Guanghui Wang; Mingyu Ding; Qinhu Wang; Chen Gong; Chanjing Feng; Chaofeng Hao; Jin-Rong Xu
Journal:  Nat Microbiol       Date:  2019-06-03       Impact factor: 17.745

5.  Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species.

Authors:  Paul Julian Kersey; James E Allen; Alexis Allot; Matthieu Barba; Sanjay Boddu; Bruce J Bolt; Denise Carvalho-Silva; Mikkel Christensen; Paul Davis; Christoph Grabmueller; Navin Kumar; Zicheng Liu; Thomas Maurel; Ben Moore; Mark D McDowall; Uma Maheswari; Guy Naamati; Victoria Newman; Chuang Kee Ong; Michael Paulini; Helder Pedro; Emily Perry; Matthew Russell; Helen Sparrow; Electra Tapanari; Kieron Taylor; Alessandro Vullo; Gareth Williams; Amonida Zadissia; Andrew Olson; Joshua Stein; Sharon Wei; Marcela Tello-Ruiz; Doreen Ware; Aurelien Luciani; Simon Potter; Robert D Finn; Martin Urban; Kim E Hammond-Kosack; Dan M Bolser; Nishadi De Silva; Kevin L Howe; Nicholas Langridge; Gareth Maslen; Daniel Michael Staines; Andrew Yates
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

6.  FungiDB: An Integrated Bioinformatic Resource for Fungi and Oomycetes.

Authors:  Evelina Y Basenko; Jane A Pulman; Achchuthan Shanmugasundram; Omar S Harb; Kathryn Crouch; David Starns; Susanne Warrenfeltz; Cristina Aurrecoechea; Christian J Stoeckert; Jessica C Kissinger; David S Roos; Christiane Hertz-Fowler
Journal:  J Fungi (Basel)       Date:  2018-03-20

7.  UniProt: a worldwide hub of protein knowledge.

Authors: 
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

8.  KnetMaps: a BioJS component to visualize biological knowledge networks.

Authors:  Ajit Singh; Christopher J Rawlings; Keywan Hassani-Pak
Journal:  F1000Res       Date:  2018-10-17

9.  Non-canonical fungal G-protein coupled receptors promote Fusarium head blight on wheat.

Authors:  Tess Dilks; Kirstie Halsey; Rebecca P De Vos; Kim E Hammond-Kosack; Neil Andrew Brown
Journal:  PLoS Pathog       Date:  2019-04-01       Impact factor: 6.823

10.  Strategic focus on 3R principles reveals major reductions in the use of animals in pharmaceutical toxicity testing.

Authors:  Elin Törnqvist; Anita Annas; Britta Granath; Elisabeth Jalkesten; Ian Cotgreave; Mattias Öberg
Journal:  PLoS One       Date:  2014-07-23       Impact factor: 3.240

View more
  67 in total

1.  Affinibrenneria salicis gen. nov. sp. nov. isolated from Salix matsudana bark canker.

Authors:  Dan-Ran Bian; Han Xue; Guang-Ming Wang; Chun-Gen Piao; Yong Li
Journal:  Arch Microbiol       Date:  2021-04-26       Impact factor: 2.552

2.  Computational identification of putative copper-binding proteins in pomegranate bacterial blight pathogen Xanthomonas citri pv. punicae.

Authors:  K Dineshkumar; Ginny Antony
Journal:  Arch Microbiol       Date:  2022-06-04       Impact factor: 2.552

3.  Advenella mandrilli sp. nov., a bacterium isolated from the faeces of Mandrillus sphinx.

Authors:  Qiong Wang; Xiu-Lin Han; Zhi-Qin Fang; Chen-Lu Zhang; Chun Li; Tao Lu
Journal:  Antonie Van Leeuwenhoek       Date:  2022-01-15       Impact factor: 2.271

4.  Genomic Comparisons of Two Armillaria Species with Different Ecological Behaviors and Their Associated Soil Microbial Communities.

Authors:  Jorge R Ibarra Caballero; Bradley M Lalande; John W Hanna; Ned B Klopfenstein; Mee-Sook Kim; Jane E Stewart
Journal:  Microb Ecol       Date:  2022-03-21       Impact factor: 4.552

5.  Aspergillus fumigatus pan-genome analysis identifies genetic variants associated with human infection.

Authors:  Amelia E Barber; Tongta Sae-Ong; Kang Kang; Bastian Seelbinder; Jun Li; Grit Walther; Gianni Panagiotou; Oliver Kurzai
Journal:  Nat Microbiol       Date:  2021-11-24       Impact factor: 17.745

6.  Global diversity and distribution of prophages are lineage-specific within the Ralstonia solanacearum species complex.

Authors:  Samuel T E Greenrod; Martina Stoycheva; John Elphinstone; Ville-Petri Friman
Journal:  BMC Genomics       Date:  2022-10-06       Impact factor: 4.547

7.  A highly contiguous reference genome assembly for Colletotrichum falcatum pathotype Cf08 causing red rot disease in sugarcane.

Authors:  Amaresh Chandra; Dinesh Singh; Deeksha Joshi; Ashwini D Pathak; Ram K Singh; Sanjeev Kumar
Journal:  3 Biotech       Date:  2021-02-27       Impact factor: 2.406

8.  An evolutionary genomic approach reveals both conserved and species-specific genetic elements related to human disease in closely related Aspergillus fungi.

Authors:  Matthew E Mead; Jacob L Steenwyk; Lilian P Silva; Patrícia A de Castro; Nauman Saeed; Falk Hillmann; Gustavo H Goldman; Antonis Rokas
Journal:  Genetics       Date:  2021-06-24       Impact factor: 4.562

9.  Chromosome-level genome assembly and manually-curated proteome of model necrotroph Parastagonospora nodorum Sn15 reveals a genome-wide trove of candidate effector homologs, and redundancy of virulence-related functions within an accessory chromosome.

Authors:  Stefania Bertazzoni; Darcy A B Jones; Huyen T Phan; Kar-Chun Tan; James K Hane
Journal:  BMC Genomics       Date:  2021-05-25       Impact factor: 3.969

10.  Comparative Genomics of Eight Fusarium graminearum Strains with Contrasting Aggressiveness Reveals an Expanded Open Pangenome and Extended Effector Content Signatures.

Authors:  Tarek Alouane; Hélène Rimbert; Jörg Bormann; Gisela A González-Montiel; Sandra Loesgen; Wilhelm Schäfer; Michael Freitag; Thierry Langin; Ludovic Bonhomme
Journal:  Int J Mol Sci       Date:  2021-06-10       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.