| Literature DB >> 31032837 |
Bo Su1, Zhuang Qian1, Tianshu Li1, Yuwei Zhou1, Aloysius Wong2.
Abstract
Moonlighting proteins are single polypeptide chains capable of executing two or more distinct biochemical and/or biological functions. Here, we describe the development of PlantMP, which is a manually curated online-based database of plant proteins that are known to `moonlight'. The database contains searchable UniProt IDs and names, canonical and moonlighting functions, gene ontology numbers, plant species as well as links to the PubMed indexed articles. Proteins homologous to experimentally confirmed moonlighting proteins from the model plant Arabidopsis thaliana are provided as a separate list of `likely moonlighters'. Additionally, we also provide a list of predicted Arabidopsis moonlighting proteins reported in the literature. Currently, PlantMP contains 110 plant moonlighting proteins, 10 `likely moonlighters' and 27 `predicted moonlighters'. Organizing plant moonlighting proteins in one platform enables researchers to conveniently harvest plant-specific raw and processed data such as the molecular functions, biological roles and structural features essential for hypothesis formulation in basic research and for biotechnological innovations.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31032837 PMCID: PMC6482322 DOI: 10.1093/database/baz050
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1PlantMP database and interface. Information extracted from various databases and articles was manually curated and formatted offline on Excel before importing into MySQL database in the following format: protein name, gene uniprot_id, canonical_function_bp, canonical_function_mf, moonlighting_function_bp, moonlighting_function_mf, go_canonical_bp, go_canonical_mf, go_moonlighting_bp, go_moonlighting_mf, plant_species, pubmed_id, type. `Type’ indicates if the moonlighting function of the protein is predicted, likely or confirmed based on the literature. Database website structure was designed and developed using the front-end and back-end separation, which has the benefit of low-degree coupling. The website front-end will query the database when user invoked either the search function by UniProt ID, protein name or plant species or by selecting one of the four categories provided. At the back-end, data is provided using Restful Web API with JSON format. For example, all moonlighting proteins from A. thaliana can be retrieved by querying the URL https://www.plantmp.com/api/species?species=Arabidopsis in the web browser. For website development, an online web template provided by Colorlib (http://www.cssmoban.com/cssthemes/8065.shtml) was adopted and modulated with feature module for database query. Front-end only requires the setting up of the user query input as a parameter to specify API URL and then displayed in appropriate table HTML format. Users have the option to export all or selected results into Excel format.
Figure 2Selection of moonlighting plant proteins for inclusion in PlantMP database. Proteins with experimentally validated moonlighting functions retrieved from PubMed and other databases such as MoonProt (http://moonlightingproteins.org) (10) and MultiTaskProtDB (http://wallace.uab.es/multitaskII) (11) were included in the list of `confirmed moonlighters’ (indicated as green arrows). Homologs of confirmed moonlighters from A. thaliana were included in the list of `likely moonlighters’ (indicated as blue arrows), while reported candidate GC and AC (which are typically moonlighting in multi-domain plant proteins) from A. thaliana (13) were included in the list of `predicted moonlighters’ (indicated as orange arrows). Both `likely’ and `predicted’ moonlighters were only included if they have known primary functions and harbor domains or motifs that could hint moonlighting functions based on UniProt annotations (17). The candidate GCs and ACs were retrieved from a PatMatch search on TAIR (https://www.arabidopsis.org/cgi-bin/patmatch/nph-patmatch.pl) (28) using search motifs **reported by (13, 20).
Figure 3Layout of the PlantMP web server. Screenshot of the home page (A) beginning with a text search box that allows experienced users to quickly query and retrieve proteins of interest from three search options (protein name, UniProt ID and plant species) and then followed by a 3D model of a representative moonlighting protein (BRI1, UniProt ID: O22476) (14) docked with ATP and GTP and accompanied by clearly labeled primary and moonlighting catalytic activities to illustrate the function of the database. An introduction section that contains a brief description of moonlighting proteins and the database is provided to aid first time users navigate and extract information from the database. Following the Introduction is the `Browse Database’ section (B) where users can select from one of four lists of proteins: (1) All categories, (2) Confirmed moonlighters, (3) Likely moonlighters and (4) Predicted moonlighters. In the result page (C), moonlighting proteins in the PlantMP database contain UniProt IDs and names, canonical and moonlighting functions, GO numbers, plant species and links associated to the PubMed articles. Included in the canonical and moonlighting functions are their MF and BP extracted from UniProt (17). The website allows users to select desired proteins and export their results as Excel.