Literature DB >> 27812751

A Method for the Annotation of Functional Similarities of Coding DNA Sequences: the Case of a Populated Cluster of Transmembrane Proteins.

Miguel Angel Fuertes1, José Ramón Rodrigo2, Carlos Alonso3.   

Abstract

The analysis of a large number of human and mouse genes codifying for a populated cluster of transmembrane proteins revealed that some of the genes significantly vary in their primary nucleotide sequence inter-species and also intra-species. In spite of that divergence and of the fact that all these genes share a common parental function we asked the question of whether at DNA level they have some kind of common compositional structure, not evident from the analysis of their primary nucleotide sequence. To reveal the existence of gene clusters not based on primary sequence relationships we have analyzed 13574 human and 14047 mouse genes by the composon-clustering methodology. The data presented show that most of the genes from each one of the samples are distributed in 18 clusters sharing the common compositional features between the particular human and mouse clusters. It was observed, in addition, that between particular human and mouse clusters having similar composon-profiles large variations in gene population were detected as an indication that a significant amount of orthologs between both species differs in compositional features. A gene cluster containing exclusively genes codifying for transmembrane proteins, an important fraction of which belongs to the Rhodopsin G-protein coupled receptor superfamily, was also detected. This indicates that even though some of them display low sequence similarity, all of them, in both species, participate with similar compositional features in terms of composons. We conclude that in this family of transmembrane proteins in general and in the Rhodopsin G-protein coupled receptor in particular, the composon-clustering reveals the existence of a type of common compositional structure underlying the primary nucleotide sequence closely correlated to function.

Entities:  

Keywords:  Gene clustering; Gene ontology; Molecular evolution; Triplet composon

Mesh:

Substances:

Year:  2016        PMID: 27812751     DOI: 10.1007/s00239-016-9763-7

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  43 in total

1.  Multipass membrane protein structure prediction using Rosetta.

Authors:  Vladimir Yarov-Yarovoy; Jack Schonbrun; David Baker
Journal:  Proteins       Date:  2006-03-01

2.  Prediction of G-protein-coupled receptor classes in low homology using Chou's pseudo amino acid composition with approximate entropy and hydrophobicity patterns.

Authors:  Q Gu; Y S Ding; T L Zhang
Journal:  Protein Pept Lett       Date:  2010-05       Impact factor: 1.890

3.  Novel genes derived from noncoding DNA in Drosophila melanogaster are frequently X-linked and exhibit testis-biased expression.

Authors:  Mia T Levine; Corbin D Jones; Andrew D Kern; Heather A Lindfors; David J Begun
Journal:  Proc Natl Acad Sci U S A       Date:  2006-06-15       Impact factor: 11.205

4.  Co-clustering and visualization of gene expression data and gene ontology terms for Saccharomyces cerevisiae using self-organizing maps.

Authors:  Markus Brameier; Carsten Wiuf
Journal:  J Biomed Inform       Date:  2006-05-20       Impact factor: 6.317

5.  Introns form compositional clusters in parallel with the compositional clusters of the coding sequences to which they pertain.

Authors:  Miguel A Fuertes; José M Pérez; Emile Zuckerkandl; Carlos Alonso
Journal:  J Mol Evol       Date:  2010-12-04       Impact factor: 2.395

6.  Genome-scale protein function prediction in yeast Saccharomyces cerevisiae through integrating multiple sources of high-throughput data.

Authors:  Yu Chen; Dong Xu
Journal:  Pac Symp Biocomput       Date:  2005

7.  On the origin and evolution of vertebrate olfactory receptor genes: comparative genome analysis among 23 chordate species.

Authors:  Yoshihito Niimura
Journal:  Genome Biol Evol       Date:  2009-04-30       Impact factor: 3.416

8.  The G-protein-coupled receptors in the human genome form five main families. Phylogenetic analysis, paralogon groups, and fingerprints.

Authors:  Robert Fredriksson; Malin C Lagerström; Lars-Gustav Lundin; Helgi B Schiöth
Journal:  Mol Pharmacol       Date:  2003-06       Impact factor: 4.436

9.  Prediction of G-protein-coupled receptor classes based on the concept of Chou's pseudo amino acid composition: an approach from discrete wavelet transform.

Authors:  Jian-Ding Qiu; Jian-Hua Huang; Ru-Ping Liang; Xiao-Quan Lu
Journal:  Anal Biochem       Date:  2009-04-11       Impact factor: 3.365

10.  Prediction of membrane transport proteins and their substrate specificities using primary sequence information.

Authors:  Nitish K Mishra; Junil Chang; Patrick X Zhao
Journal:  PLoS One       Date:  2014-06-26       Impact factor: 3.240

View more
  2 in total

1.  Conserved Critical Evolutionary Gene Structures in Orthologs.

Authors:  Miguel A Fuertes; José R Rodrigo; Carlos Alonso
Journal:  J Mol Evol       Date:  2019-02-28       Impact factor: 2.395

2.  Evolutionary conserved compositional structures hidden in genomes of the foot-and-mouth disease virus and of the human rhinovirus.

Authors:  Miguel Angel Fuertes; Silvia López-Arguello; Carlos Alonso
Journal:  Sci Rep       Date:  2019-11-12       Impact factor: 4.379

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.