Literature DB >> 33216895

GPCR-PEnDB: a database of protein sequences and derived features to facilitate prediction and classification of G protein-coupled receptors.

Khodeza Begum1,2, Jonathon E Mohl2,3,4, Fredrick Ayivor1, Eder E Perez4, Ming-Ying Leung1,2,3,4.   

Abstract

G protein-coupled receptors (GPCRs) constitute the largest group of membrane receptor proteins in eukaryotes. Due to their significant roles in various physiological processes such as vision, smell and inflammation, GPCRs are the targets of many prescription drugs. However, the functional and sequence diversity of GPCRs has kept their prediction and classification based on amino acid sequence data as a challenging bioinformatics problem. There are existing computational approaches, mainly using machine learning and statistical methods, to predict and classify GPCRs based on amino acid sequence and sequence derived features. In this paper, we describe a searchable MySQL database, named GPCR-PEnDB (GPCR Prediction Ensemble Database), of confirmed GPCRs and non-GPCRs. It was constructed with the goal of allowing users to conveniently access useful information of GPCRs in a wide range of organisms and to compile reliable training and testing datasets for different combinations of computational tools. This database currently contains 3129 confirmed GPCR and 3575 non-GPCR sequences collected from the UniProtKB/Swiss-Prot protein database, encompassing over 1200 species. The non-GPCR entries include transmembrane proteins for evaluating various prediction programs' abilities to distinguish GPCRs from other transmembrane proteins. Each protein is linked to information about its source organism, classification, sequence lengths and composition, and other derived sequence features. We present examples of using this database along with its graphical user interface, to query for GPCRs with specific sequence properties and to compare the accuracies of five tools for GPCR prediction. This initial version of GPCR-PEnDB will provide a framework for future extensions to include additional sequence and feature data to facilitate the design and assessment of software tools and experimental studies to help understand the functional roles of GPCRs. Database URL: gpcr.utep.edu/database.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Year:  2020        PMID: 33216895      PMCID: PMC7678784          DOI: 10.1093/database/baaa087

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   3.451


  27 in total

1.  PRINTS and its automatic supplement, prePRINTS.

Authors:  T K Attwood; P Bradley; D R Flower; A Gaulton; N Maudling; A L Mitchell; G Moulton; A Nordle; K Paine; P Taylor; A Uddin; C Zygouri
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

2.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

3.  SeQuery: an interactive graph database for visualizing the GPCR superfamily.

Authors:  Geng-Ming Hu; M K Secario; Chi-Ming Chen
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

Review 4.  GPCR & company: databases and servers for GPCRs and interacting partners.

Authors:  Noga Kowalsman; Masha Y Niv
Journal:  Adv Exp Med Biol       Date:  2014       Impact factor: 2.622

Review 5.  Thematic Minireview Series: New Directions in G Protein-coupled Receptor Pharmacology.

Authors:  Henrik G Dohlman
Journal:  J Biol Chem       Date:  2015-06-30       Impact factor: 5.157

Review 6.  Trends in GPCR drug discovery: new agents, targets and indications.

Authors:  Alexander S Hauser; Misty M Attwood; Mathias Rask-Andersen; Helgi B Schiöth; David E Gloriam
Journal:  Nat Rev Drug Discov       Date:  2017-10-27       Impact factor: 84.694

7.  GPCRomics: GPCR Expression in Cancer Cells and Tumors Identifies New, Potential Biomarkers and Therapeutic Targets.

Authors:  Paul A Insel; Krishna Sriram; Shu Z Wiley; Andrea Wilderman; Trishna Katakia; Thalia McCann; Hiroshi Yokouchi; Lingzhi Zhang; Ross Corriden; Dongling Liu; Michael E Feigin; Randall P French; Andrew M Lowy; Fiona Murray
Journal:  Front Pharmacol       Date:  2018-05-22       Impact factor: 5.810

8.  Joint use of over- and under-sampling techniques and cross-validation for the development and assessment of prediction models.

Authors:  Rok Blagus; Lara Lusa
Journal:  BMC Bioinformatics       Date:  2015-11-04       Impact factor: 3.169

Review 9.  Engineering therapeutic antibodies targeting G-protein-coupled receptors.

Authors:  Migyeong Jo; Sang Taek Jung
Journal:  Exp Mol Med       Date:  2016-02-05       Impact factor: 8.718

10.  The Pfam protein families database: towards a more sustainable future.

Authors:  Robert D Finn; Penelope Coggill; Ruth Y Eberhardt; Sean R Eddy; Jaina Mistry; Alex L Mitchell; Simon C Potter; Marco Punta; Matloob Qureshi; Amaia Sangrador-Vegas; Gustavo A Salazar; John Tate; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2015-12-15       Impact factor: 16.971

View more
  1 in total

1.  What Makes GPCRs from Different Families Bind to the Same Ligand?

Authors:  Kwabena Owusu Dankwah; Jonathon E Mohl; Khodeza Begum; Ming-Ying Leung
Journal:  Biomolecules       Date:  2022-06-21
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.