Literature DB >> 25217587

VirusMentha: a new resource for virus-host protein interactions.

Alberto Calderone1, Luana Licata2, Gianni Cesareni3.   

Abstract

Viral infections often cause diseases by perturbing several cellular processes in the infected host. Viral proteins target host proteins and either form new complexes or modulate the formation of functional host complexes. Describing and understanding the perturbation of the host interactome following viral infection is essential for basic virology and for the development of antiviral therapies. In order to provide a general overview of such interactions, a few years ago we developed VirusMINT. We have now extended the scope and coverage of VirusMINT and established VirusMentha, a new virus-virus and virus-host interaction resource build on the detailed curation protocols of the IMEx consortium and on the integration strategies developed for mentha. VirusMentha is regularly and automatically updated every week by capturing, via the PSICQUIC protocol, interactions curated by five different databases that are part of the IMEx consortium. VirusMentha can be freely browsed at http://virusmentha.uniroma2.it/ and its complete data set is available for download.
© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25217587      PMCID: PMC4384001          DOI: 10.1093/nar/gku830

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Viruses exploit the molecular machinery of the infected host to support their own life cycle and target host defense mechanisms to escape host resistance. This is achieved by establishing a virus-specific protein interaction network that perturbs cell processes, such as DNA replication, gene expression, growth and differentiation (1,2). A prerequisite for a complete understanding of a virus life cycle is a proteome-wide description of the complexes formed by viral proteins, either on their own or in combination with host proteins. Over the past decades, a growing number of small- or large-scale studies reported evidence for interactions between viral and host proteins. Although this information can in principle be extracted from generic protein interaction databases, 5 years ago we developed VirusMINT (3), a protein-protein interaction (PPI) database focused on viral interactions. This offered two advantages: an increased focus of the MINT (4) curation team on the curation of viral interactions and the development of a dedicated website that could archive information useful for the community of experimental virologists. VirusMINT only offered data curated by the MINT curation team. VirHostNet (5), a second resource dedicated to virus interactions, adopted the strategy of merging various data sources. This database contains re-curated data extracted from primary resources that use different curation strategies. In addition, VirHostNet complements this information with newly curated interactions. This approach has generated a network consisting of nearly 5000 non-redundant interactions between viral and host proteins. However, the integration strategy adopted by this resource implies substantial work making updates infrequent. As a consequence, the VirHostNet data set has not been updated since its publication. As technologies to identify protein interactions evolve, generic or specialized journals regularly report new viral host protein interaction information (6,7). This abundance of data demands for a strategy to capture and integrate this information regularly, with minimal hands-on effort. In order to automate data merging, we recently implemented a resource called mentha that automatically integrates the content of five different PPI databases (8). By exploiting a similar automatic procedure we created VirusMentha, a resource that builds on the work started in VirusMINT by offering the curated data from our group combined with the data captured from the other IMEx (9) partners. Every week, VirusMentha integrates the virus–host interactions curated by the IMEx databases removing redundancy. Differently from other valuable resources, VirusMentha is not exclusively focused on specific organisms or viruses, such as HCVpro (http://cbrc.kaust.edu.sa/hcvpro/) (10) or HIV host-interaction map (http://www.ncbi.nlm.nih.gov/projects/RefSeq/HIVInteractions/) (11). VirusMentha captures all published virus–host interactions without limitation with respect to virus strain or to host organism. VirusMentha currently contains interactions for 24 viral families. It also offers interactions between viral proteins and different host organisms, such as Homo sapiens, Mus musculus, Arabidopsis thaliana and so on.

IMPLEMENTATION

Molecular interaction evidence is reported in the scientific literature in natural language format, thus making retrieval and processing a difficult task (12,13). The data contained in VirusMINT is manually curated by our curation team in compliance with IMEx standards. In order to further expand the coverage of the virus–virus and virus–host interactome, we decided to integrate our data with data curated and stored in the databases of the IMEx partners. IMEx is an international consortium whose purpose is to make protein interaction curation more efficient by distributing the curation load and by limiting the duplication of effort. IMEx standards represent a seal of quality among the several PPIs repositories (14). The databases that we chose to integrate are MINT, IntAct (15), DIP (16), MatrixDB (17) and BioGRID (18). VirHostNet data were not imported because the database does not fully conform to PSI-MI standards and does not provide enough experimental details. The procedure used for data merging is the same as the one developed for mentha. mentha retrieves data from external resources using the PSICQUIC (Proteomics Standard Initiative Common QUery InterfaCe) protocol (19) which provides standard programmatic access to molecular interaction repositories. Using the ontology trees defined by IMEx, the procedure identifies whether two pieces of evidence are actually the same piece of evidence described at different levels of curation depth. VirusMentha exploits the same procedure and offers direct access to evidence of viral protein interactions. The mentha procedure detects redundancy using the PSI-MI TAB file provided by the five source databases listed above. The key information to identify redundancy are supported by every database of the IMEx consortium, i.e. interaction type, experimental method, publication ID. However, these databases do not offer the same version of PSI-MI TAB file. MINT and IntAct are currently the two major databases in the IMEx consortium that offer PSI-MI TAB 2.7 format defined in PSICQUIC. The PSI-MI TAB file format has been updated to version 2.7 to allow curators to add extra information, such as for instance the polypeptide fragment (of viral polyproteins) involved in the interaction. In order to preserve the richness of such format, VirusMentha implements a detailed view if the evidence is curated by databases supporting PSI-MI TAB 2.7.

TAXONOMY ORGANIZATION

VirusMentha organizes the interactions according to viral families (Figure 1). The classification is based on the Baltimore classification which is based on the nucleic acid content of the virion (20,21). This classification is generally used in conjunction with the ICTV (The International Committee on Taxonomy of Viruses) classification system in order to create a tree of viral species (22).
Figure 1.

Mapping strategy. This figure shows the strategy used by VirusMentha to group all viruses into families. On the right, the diagram shows the strain annotated in the experimental evidence. The taxonomy tree is used to map back to the viral family, highlighted by red ovals.

Mapping strategy. This figure shows the strategy used by VirusMentha to group all viruses into families. On the right, the diagram shows the strain annotated in the experimental evidence. The taxonomy tree is used to map back to the viral family, highlighted by red ovals. In order to give access to this hierarchy of viral strains and families, we have reconstructed the taxonomy tree of viruses using the classification information provided by UniProt (23). In this taxonomy tree, as show in Figure 1, the first level is defined as per the Baltimore classification while families are defined according to the ICTV classification system. In order to handle this taxonomic hierarchy, the retrieval and integration procedure of VirusMentha analyses each interaction and for each viral protein in the interaction it extracts its taxonomy ID (organism). The mapping procedure is illustrated in Figure 1. Using the taxonomy tree, each virus strain (taxonomy number) is mapped to the most general viral family, without dropping the original strain identifier.

DATA GROWTH AND STATISTICS

Over the past years, the MINT curation team has curated a large number of articles using a detailed curation policy. The integration of VirusMINT data with other databases on a weekly basis resulted in the most comprehensive and self-updating resource available so far. On August 2014, VirusMentha contained up to 8084 non-redundant interactions supported by 8450 publications (Figure 2).
Figure 2.

Comparison of VirusMINT and VirusMentha. (A and B) Venn diagrams to show the relative contribution to the VirusMentha data set of the curation effort of VirusMINT and that of the others IMEx databases. (B) The pie chart represents the number of interactions of the main virus families as annotated in VirusMentha.

Comparison of VirusMINT and VirusMentha. (A and B) Venn diagrams to show the relative contribution to the VirusMentha data set of the curation effort of VirusMINT and that of the others IMEx databases. (B) The pie chart represents the number of interactions of the main virus families as annotated in VirusMentha.

WEB INTERFACE

Searches in the VirusMentha data set can be carried out directly from the home page by entering in the search field UniProt IDs, gene names, polypeptide names, keywords or single PMID (PubMed publication ID). To offer direct access to the VirusMentha data set we implemented the possibility of searching and downloading viral interaction evidence from a virus or host perspective. The user can decide to search the entire database or to restrict the search to specific organisms or viral families. The search results are presented as a list of proteins whose gene name, UniProt ID, description or PMID matches the query. The user can select the proteins of interest by adding them to the ‘protein bag’. All the interactions related to the selected proteins can be browsed either as a list in a table format or in a network view by starting a graphical applet. The user who wants to compare the same protein from different isolates can use the ‘Align’ button, to see their similarities; the alignment is performed on the fly using BLOSUM62 as substitution matrix, with the following penalties: open: 10; extend: 0.5. Experimental details linked to each interaction can be accessed by clicking the ‘show evidence’ button. Together with the link to the original paper, the interaction detection method and the interaction type of the experiment are displayed. Moreover, by clicking to the magnifier icon, a pop-up window opens showing the curation details as per PSI-MI TAB 2.7. In the binary interaction result page it is possible to filter relevant interactions using keywords. For instance, the user can search a specific human protein that interacts with several types of Herpes simplex virus strains and, using the field provided, restrict the list only to the ‘Patton’ strain. The VirusMentha interface has been developed to facilitate the assembly and modification of networks of interactions. The graphical applet displays viral proteins in cyan. The networks built by the user through various searches and/or network manipulations can be downloaded as a plain text file. Exported networks always report strain number and family number to facilitate clustering by family. Finally, for each protein VirusMentha reports the associated Gene Ontology terms (24), links to antibodies through Antibodypedia (25), cleaved polypeptides and, for each interaction, the MINT score and links to papers supporting the interaction are reported (Figure 3).
Figure 3.

VirusMentha web interface. (A) VirusMentha search page. By typing the gene name GAGPOL the user is presented with a list a GAG-POL genes from several HIV isolates. Each gene is accompanied with essential information extracted from UniProt including the polypeptide fragments processed inside the virion. (B) If the user loads one of these proteins into the ‘protein bag’ and clicks ‘list’, all binary interactions are presented in a table-like view, together with details about papers supporting the interaction. (C) Alternatively, the collected proteins can be browsed by displaying the interactions with a graphic applet.

VirusMentha web interface. (A) VirusMentha search page. By typing the gene name GAGPOL the user is presented with a list a GAG-POL genes from several HIV isolates. Each gene is accompanied with essential information extracted from UniProt including the polypeptide fragments processed inside the virion. (B) If the user loads one of these proteins into the ‘protein bag’ and clicks ‘list’, all binary interactions are presented in a table-like view, together with details about papers supporting the interaction. (C) Alternatively, the collected proteins can be browsed by displaying the interactions with a graphic applet.

CONCLUSION AND PERSPECTIVES

VirusMint was developed as a public repository to capture and organize manually curated information about interactions between viral and host proteins. VirusMentha extends this concept to offer the most comprehensive and self-updating resource for viral interactions available so far. In addition, VirusMentha exploits the curation potential of five different database curation teams by integrating data curated by databases adhering to the IMEx consortium. In addition, VirusMentha continues to benefit from the VirusMINT curation team that regularly captures data published in peer-reviewed scientific journals. Finally, VirusMentha has the advantage of being regularly and automatically updated every week. According to Google Scholar, VirusMINT has been cited in ∼100 papers (99 in August 2014) testifying the interest of the community of virologists for such a dedicated protein interaction repository. As more relevant information is reported in the scientific literature, we expect the higher coverage and curation depth that characterize VirusMentha to become a valuable resource for more comprehensive analysis of viral mechanisms and interactions.
  24 in total

1.  Antibodypedia, a portal for sharing antibody and antigen validation data.

Authors:  Erik Björling; Mathias Uhlén
Journal:  Mol Cell Proteomics       Date:  2008-07-29       Impact factor: 5.911

2.  HIV-host interactions: a map of viral perturbation of the host system.

Authors:  John W Pinney; Jonathan E Dickerson; William Fu; Brigitte E Sanders-Beer; Roger G Ptak; David L Robertson
Journal:  AIDS       Date:  2009-03-13       Impact factor: 4.177

Review 3.  RNA-directed DNA synthesis and RNA tumor viruses.

Authors:  H M Temin; D Baltimore
Journal:  Adv Virus Res       Date:  1972       Impact factor: 9.937

4.  Unscrambling hepatitis C virus-host interactions.

Authors:  Francis V Chisari
Journal:  Nature       Date:  2005-08-18       Impact factor: 49.962

5.  Activities at the Universal Protein Resource (UniProt).

Authors: 
Journal:  Nucleic Acids Res       Date:  2013-11-18       Impact factor: 16.971

6.  VirusMINT: a viral protein interaction database.

Authors:  Andrew Chatr-aryamontri; Arnaud Ceol; Daniele Peluso; Aurelio Nardozza; Simona Panni; Francesca Sacco; Michele Tinti; Alex Smolyar; Luisa Castagnoli; Marc Vidal; Michael E Cusick; Gianni Cesareni
Journal:  Nucleic Acids Res       Date:  2008-10-30       Impact factor: 16.971

7.  VirHostNet: a knowledge base for the management and the analysis of proteome-wide virus-host interaction networks.

Authors:  Vincent Navratil; Benoît de Chassey; Laurène Meyniel; Stéphane Delmotte; Christian Gautier; Patrice André; Vincent Lotteau; Chantal Rabourdin-Combe
Journal:  Nucleic Acids Res       Date:  2008-11-04       Impact factor: 16.971

8.  Antiviral effect and virus-host interactions in response to alpha interferon, gamma interferon, poly(i)-poly(c), tumor necrosis factor alpha, and ribavirin in hepatitis C virus subgenomic replicons.

Authors:  Robert E Lanford; Bernadette Guerra; Helen Lee; Devron R Averett; Brad Pfeiffer; Deborah Chavez; Lena Notvall; Catherine Bigger
Journal:  J Virol       Date:  2003-01       Impact factor: 5.103

9.  Protein-protein interaction databases: keeping up with growing interactomes.

Authors:  Benjamin Lehne; Thomas Schlitt
Journal:  Hum Genomics       Date:  2009-04       Impact factor: 4.639

View more
  53 in total

1.  Investigating genetic-and-epigenetic networks, and the cellular mechanisms occurring in Epstein-Barr virus-infected human B lymphocytes via big data mining and genome-wide two-sided NGS data identification.

Authors:  Cheng-Wei Li; Bo-Ren Jheng; Bor-Sen Chen
Journal:  PLoS One       Date:  2018-08-22       Impact factor: 3.240

2.  Human cytomegalovirus interactome analysis identifies degradation hubs, domain associations and viral protein functions.

Authors:  Luis V Nobre; Katie Nightingale; Benjamin J Ravenhill; Robin Antrobus; Lior Soday; Jenna Nichols; James A Davies; Sepehr Seirafian; Eddie Cy Wang; Andrew J Davison; Gavin Wg Wilkinson; Richard J Stanton; Edward L Huttlin; Michael P Weekes
Journal:  Elife       Date:  2019-12-24       Impact factor: 8.140

3.  Prediction of MicroRNAs in the Epstein-Barr Virus Reveals Potential Targets for the Viral Self-Regulation.

Authors:  Victor Serrano-Solis; Angelica Cardoso Carlos; Vinicius Maracaja-Coutinho; Sávio Torres de Farias
Journal:  Indian J Microbiol       Date:  2018-12-28       Impact factor: 2.461

4.  Bioinformatics-Based Approaches to Study Virus-Host Interactions During SARS-CoV-2 Infection.

Authors:  Muhammad Saad Khan; Qudsia Yousafi; Shabana Bibi; Muhammad Azhar; Awais Ihsan
Journal:  Methods Mol Biol       Date:  2022

5.  LGCA-VHPPI: A local-global residue context aware viral-host protein-protein interaction predictor.

Authors:  Muhammad Nabeel Asim; Muhammad Ali Ibrahim; Muhammad Imran Malik; Andreas Dengel; Sheraz Ahmed
Journal:  PLoS One       Date:  2022-07-05       Impact factor: 3.752

6.  Understanding of Zaire ebolavirus-human protein interaction for drug repurposing.

Authors:  Mathavan Muthaiyan; Shilpa Sri Pushan; Leimarembi Devi Naorem; Amouda Venkatesan
Journal:  Virusdisease       Date:  2020-03-07

7.  Interface-Based Structural Prediction of Novel Host-Pathogen Interactions.

Authors:  Emine Guven-Maiorov; Chung-Jung Tsai; Buyong Ma; Ruth Nussinov
Journal:  Methods Mol Biol       Date:  2019

8.  Data science in unveiling COVID-19 pathogenesis and diagnosis: evolutionary origin to drug repurposing.

Authors:  Jayanta Kumar Das; Giuseppe Tradigo; Pierangelo Veltri; Pietro H Guzzi; Swarup Roy
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

9.  VirHostNet 2.0: surfing on the web of virus/host molecular interactions data.

Authors:  Thibaut Guirimand; Stéphane Delmotte; Vincent Navratil
Journal:  Nucleic Acids Res       Date:  2014-11-11       Impact factor: 16.971

10.  Shared resources, shared costs--leveraging biocuration resources.

Authors:  Sandra Orchard; Henning Hermjakob
Journal:  Database (Oxford)       Date:  2015-03-16       Impact factor: 3.451

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.