| Literature DB >> 17663773 |
Zuoshuang Xiang1, Yuying Tian, Yongqun He.
Abstract
The Pathogen-Host Interaction Data Integration and Analysis System (PHIDIAS) is a web-based database system that serves as a centralized source to search, compare, and analyze integrated genome sequences, conserved domains, and gene expression data related to pathogen-host interactions (PHIs) for pathogen species designated as high priority agents for public health and biological security. In addition, PHIDIAS allows submission, search and analysis of PHI genes and molecular networks curated from peer-reviewed literature. PHIDIAS is publicly available at http://www.phidias.us.Entities:
Mesh:
Year: 2007 PMID: 17663773 PMCID: PMC2323235 DOI: 10.1186/gb-2007-8-7-r150
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Forty-two pathogens included in PHIDIAS
| Pathogens (disease) | CDC/NIAID category | No. of genomes | Phinfo | Pacodom | Phinet | |
| 1 | A/A | 3 | √ | 4,588 | √ | |
| 2 | B/B | 4 | √ | 4,267 | √ | |
| 3 | B/B | 1 | √ | 4,679 | √ | |
| 4 | B/B | 2 | √ | 5,093 | ||
| 5 | /B | 2 | 3,235 | |||
| 6 | A/A | 0 | √ | N/A | √ | |
| 7 | B/B | 1 | 3,770 | |||
| 8 | B/B | 1 | √ | 3,032 | √ | |
| 9 | B/B | 6 | √ | 5,440 | √ | |
| 10 | A/A | 2 | √ | 3,057 | √ | |
| 11 | 5 | 3,374 | ||||
| 12 | 3 | 3,974 | ||||
| 13 | /B | 2 | 3,999 | |||
| 14 | /C | 2 | √ | 3,991 | ||
| 15 | /C | 1 | √ | 2,129 | √ | |
| 16 | /C | 0 | √ | N/A | √ | |
| 17 | B/B | 4 | √ | 5,150 | √ | |
| 18 | B/B | 5 | √ | 5,211 | √ | |
| 19 | B/B | 5 | 5,449 | |||
| 20 | A/A | 5 | √ | 4,828 | √ | |
| 21 | Crimean-Congo hemorrhagic fever virus (tickborne hemorrhagic fever) | C/C | 1 | √ | 4 | √ |
| 22 | Eastern equine encephalitis virus (encephalitis) | B/B | 0 | √ | N/A | √ |
| 23 | Foot-and-mouth disease virus (foot-and-mouth disease) | 7 | √ | 3 | ||
| 24 | Guanarito virus (viral hemorrhagic fever) | A/A | 1 | √ | 0 | √ |
| 25 | Human immunodeficiency virus (AIDS) | 2 | √ | 8 | ||
| 26 | Junin virus (viral hemorrhagic fever) | A/A | 1 | √ | 0 | √ |
| 27 | Lassa virus (viral hemorrhagic fever) | A/A | 1 | √ | 0 | √ |
| 28 | Louping ill virus (encephalomyelitis) | 1 | √ | 6 | √ | |
| 29 | Machupo virus (viral hemorrhagic fever) | A/A | 1 | √ | 0 | √ |
| 30 | Marburg virus (viral hemorrhagic fever) | A/A | 1 | √ | N/A | √ |
| 31 | Measles virus (measles) | 1 | √ | 0 | √ | |
| 32 | Newcastle Disease Virus (Newcastle disease) | 0 | √ | N/A | ||
| 33 | Powassan virus (encephalitis) | 0 | √ | N/A | √ | |
| 34 | Reston ebola virus (viral hemorrhagic fever) | A/A | 1 | √ | 1 | √ |
| 35 | Rift Valley fever virus (Rift Valley fever) | /A | 1 | √ | 3 | √ |
| 36 | Variola virus (smallpox) | A/A | 2 | √ | 129 | |
| 37 | Venezuelan equine encephalitis virus (viral encephalitis) | B/B | 1 | √ | 8 | √ |
| 38 | Yellow fever virus (yellow fever) | /C | 1 | √ | 5 | √ |
| 39 | B/B | 0 | √ | N/A | ||
| 40 | 0 | √ | N/A | |||
| 41 | 0 | √ | N/A | √ | ||
| 42 | 0 | √ | N/A | |||
| Total (42 pathogens) | 77 | 37 | 75,433 | 27 | ||
The program includes 20 bacteria (54 genomes), 18 viruses (23 genomes), and 4 parasites. The database contains 75,433 conserved domains (7,919 unique PSSMs) and PHI network information for 27 pathogens.
Figure 1PHIDIAS data flow. (a) The PHIDIAS system architecture. (b) PhiDB data flow among key elements of different PhiDB database modules. The relationships among these elements are represented by the following signs: *, zero or more; 1, one; and 2...*, two or more. For example, the labeling of a pathway with '1' and '2...*' indicates that one pathway includes two or more interactions.
Public databases and software programs integrated in PHIDIAS
| Resources | Databases and analysis programs | Comments |
| NCBI | RefSeq | Reference sequences |
| Genome | Genome summary | |
| Gene | Gene information | |
| Protein | Protein information | |
| Nucleotide | Nucleotide information | |
| CDD | Conserved domains | |
| COGs | Clusters of orthologous groups | |
| Taxonomy | ||
| PubMed | Biomedical publications | |
| GEO | Gene expression database | |
| EBI and SIB | ArrayExpress | Gene expression database |
| Swissprot | Annotated protein data | |
| TrEMBL | Protein data | |
| InterPro | Protein families, domains and functions | |
| PROSITE | Protein families and domains | |
| VBI | PathInfo | PIML documents via web service |
| MiNet | MiNetML documents via web service | |
| TIGR | CMR | Comprehensive microbial resource |
| TIGRfam | TIGRfam assignments | |
| GO | Gene ontology | |
| KEGG | Pathways | |
| BioCyc | Biological pathways | |
| PFam | Protein domains and families | |
| ProDom | Protein domain families | |
| PDB | Protein database | |
| University of Michigan | BBP | |
| HazARD | Hazards in animal research database | |
| NCBI | BLAST | Blastn, blastp, blastx, tblastn, tblastx, PSI/PHI Blast, Mega Blast, Blast 2 sequences |
| GMOD | GBrowse | Genome browsing and analysis |
| BioPerl | Programming tools | |
| BioPAX | Biological pathway data exchange format |
CMR, TIGR Comprehensive Microbial Resource; GO, Gene Ontology; MeSH, Medical Subject Headings; PDB, Protein Data Bank.
Figure 2Comparison and analyses of sodC genes in the PGBrowser. Thirty two sodC genes are found in 32 genomes from 11 bacteria species (a), including sodC from B. abortus strain 9-941 (b).
Figure 3Integrative pathogen gene information in PHIDIAS.
Figure 4Example of Pacodom applications. (a) Pacodom search of 'phagocytosis'. (b) There are 42 Nramp protein matches from 42 pathogen genomes of 15 microbial species available in Pacodom.
Figure 5PhiDB Topic Search. The PhiDB Topic Search web interface is shown on the left and a comparison of immunoassays for diagnosis of B. melitensis and B. anthracis is shown on the right.
Figure 6Gene search web interface in Phigen.
Figure 7Visualization of an E. coli pathogenesis network in Phinet. A click on each node provides detailed information about a biological object in the bottom frame. When a mouse cursor moves over a node, a brief description of the biological object will appear. An interaction between biological objects is represented by a centered gray ball and arrows between nodes. Once the centered gray ball is clicked, details about the specific interaction appear in the bottom frame. Subcellular locations of biological objects are differentiated by the node border colors. The biological object types (for example, protein or gene) are represented by a combination of the node background colors and shapes. The program also displays different interactions, such as inhibition (solid T sign), activation (solid arrow), and indirect effects (dashed line).