Literature DB >> 25367737

The influence of alignment-free sequence representations on the semi-supervised classification of class C G protein-coupled receptors: semi-supervised classification of class C GPCRs.

Raúl Cruz-Barbosa1, Alfredo Vellido, Jesús Giraldo.   

Abstract

G protein-coupled receptors (GPCRs) are integral cell membrane proteins of relevance for pharmacology. The tertiary structure of the transmembrane domain, a gate to the study of protein functionality, is unknown for almost all members of class C GPCRs, which are the target of the current study. As a result, their investigation must often rely on alignments of their amino acid sequences. Sequence alignment entails the risk of missing relevant information. Various approaches have attempted to circumvent this risk through alignment-free transformations of the sequences on the basis of different amino acid physicochemical properties. In this paper, we use several of these alignment-free methods, as well as a basic amino acid composition representation, to transform the available sequences. Novel semi-supervised statistical machine learning methods are then used to discriminate the different class C GPCRs types from the transformed data. This approach is relevant due to the existence of orphan proteins to which type labels should be assigned in a process of deorphanization or reverse pharmacology. The reported experiments show that the proposed techniques provide accurate classification even in settings of extreme class-label scarcity and that fair accuracy can be achieved even with very simple transformation strategies that ignore the sequence ordering.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25367737     DOI: 10.1007/s11517-014-1218-y

Source DB:  PubMed          Journal:  Med Biol Eng Comput        ISSN: 0140-0118            Impact factor:   2.602


  32 in total

1.  Classification of G-protein coupled receptors by alignment-independent extraction of principal chemical properties of primary amino acid sequences.

Authors:  Maris Lapinsh; Alexandrs Gutcaits; Peteris Prusis; Claes Post; Torbjörn Lundstedt; Jarl E S Wikberg
Journal:  Protein Sci       Date:  2002-04       Impact factor: 6.725

2.  Classifying G-protein coupled receptors with support vector machines.

Authors:  Rachel Karchin; Kevin Karplus; David Haussler
Journal:  Bioinformatics       Date:  2002-01       Impact factor: 6.937

3.  Comparing two K-category assignments by a K-category correlation coefficient.

Authors:  J Gorodkin
Journal:  Comput Biol Chem       Date:  2004-12       Impact factor: 2.877

Review 4.  How many drug targets are there?

Authors:  John P Overington; Bissan Al-Lazikani; Andrew L Hopkins
Journal:  Nat Rev Drug Discov       Date:  2006-12       Impact factor: 84.694

5.  Structure of class B GPCR corticotropin-releasing factor receptor 1.

Authors:  Kaspar Hollenstein; James Kean; Andrea Bortolato; Robert K Y Cheng; Andrew S Doré; Ali Jazayeri; Robert M Cooke; Malcolm Weir; Fiona H Marshall
Journal:  Nature       Date:  2013-07-17       Impact factor: 49.962

6.  Structure of a class C GPCR metabotropic glutamate receptor 1 bound to an allosteric modulator.

Authors:  Huixian Wu; Chong Wang; Karen J Gregory; Gye Won Han; Hyekyung P Cho; Yan Xia; Colleen M Niswender; Vsevolod Katritch; Jens Meiler; Vadim Cherezov; P Jeffrey Conn; Raymond C Stevens
Journal:  Science       Date:  2014-03-06       Impact factor: 47.728

7.  Semi-supervised analysis of human brain tumours from partially labeled MRS information, using manifold learning models.

Authors:  Raúl Cruz-Barbosa; Alfredo Vellido
Journal:  Int J Neural Syst       Date:  2011-02       Impact factor: 5.866

8.  Structure of the human glucagon class B G-protein-coupled receptor.

Authors:  Fai Yiu Siu; Min He; Chris de Graaf; Gye Won Han; Dehua Yang; Zhiyun Zhang; Caihong Zhou; Qingping Xu; Daniel Wacker; Jeremiah S Joseph; Wei Liu; Jesper Lau; Vadim Cherezov; Vsevolod Katritch; Ming-Wei Wang; Raymond C Stevens
Journal:  Nature       Date:  2013-07-17       Impact factor: 49.962

9.  The G-protein-coupled receptors in the human genome form five main families. Phylogenetic analysis, paralogon groups, and fingerprints.

Authors:  Robert Fredriksson; Malin C Lagerström; Lars-Gustav Lundin; Helgi B Schiöth
Journal:  Mol Pharmacol       Date:  2003-06       Impact factor: 4.436

10.  A comparison of MCC and CEN error measures in multi-class prediction.

Authors:  Giuseppe Jurman; Samantha Riccadonna; Cesare Furlanello
Journal:  PLoS One       Date:  2012-08-08       Impact factor: 3.240

View more
  5 in total

1.  Prediction of intrinsically disordered regions in proteins using signal processing methods: application to heat-shock proteins.

Authors:  Vuk Vojisavljevic; Elena Pirogova
Journal:  Med Biol Eng Comput       Date:  2016-04-01       Impact factor: 2.602

2.  Using random forests for assistance in the curation of G-protein coupled receptor databases.

Authors:  Aleksei Shkurin; Alfredo Vellido
Journal:  Biomed Eng Online       Date:  2017-08-18       Impact factor: 2.819

3.  Representation Learning for Class C G Protein-Coupled Receptors Classification.

Authors:  Raúl Cruz-Barbosa; Erik-German Ramos-Pérez; Jesús Giraldo
Journal:  Molecules       Date:  2018-03-19       Impact factor: 4.411

4.  Using machine learning tools for protein database biocuration assistance.

Authors:  Caroline König; Ilmira Shaim; Alfredo Vellido; Enrique Romero; René Alquézar; Jesús Giraldo
Journal:  Sci Rep       Date:  2018-07-05       Impact factor: 4.379

5.  Label noise in subtype discrimination of class C G protein-coupled receptors: A systematic approach to the analysis of classification errors.

Authors:  Caroline König; Martha I Cárdenas; Jesús Giraldo; René Alquézar; Alfredo Vellido
Journal:  BMC Bioinformatics       Date:  2015-09-29       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.