Literature DB >> 19204811

Identifying coevolving partners from paralogous gene families.

Chen-Hsiang Yeang1.   

Abstract

Many methods have been developed to detect coevolution from aligned sequences. However, all the existing methods require a one-to-one mapping of candidate coevolving partners (nucleotides, amino acids) a priori. When two families of sequences have distinct duplication and loss histories, finding the one-to-one mapping of coevolving partners can be computationally involved. We propose an algorithm to identify the coevolving partners from two families of sequences with distinct phylogenetic trees. The algorithm maps each gene tree to a reference species tree, and builds a joint state of sequence composition and assignments of coevolving partners for each species tree node. By applying dynamic programming on the joint states, the optimal assignments can be identified. Time complexity is quadratic to the size of the species tree, and space complexity is exponential to the maximum number of gene tree nodes mapped to the same species tree node. Analysis on both simulated data and Pfam protein domain sequences demonstrates that the paralog coevolution algorithm picks up the coevolving partners with 60% 88% accuracy. This algorithm extends phylogeny-based coevolutionary models and make them applicable to a wide range of problems such as predicting protein-protein, protein-DNA and DNA-RNA interactions of two distinct families of sequences.

Entities:  

Year:  2008        PMID: 19204811      PMCID: PMC2614191          DOI: 10.4137/ebo.s621

Source DB:  PubMed          Journal:  Evol Bioinform Online        ISSN: 1176-9343            Impact factor:   1.625


  26 in total

1.  RNA secondary structure prediction using stochastic context-free grammars and evolutionary history.

Authors:  B Knudsen; J Hein
Journal:  Bioinformatics       Date:  1999-06       Impact factor: 6.937

2.  Co-evolution of proteins with their interaction partners.

Authors:  C S Goh; A A Bogan; M Joachimiak; D Walther; F E Cohen
Journal:  J Mol Biol       Date:  2000-06-02       Impact factor: 5.469

3.  Correlations among amino acid sites in bHLH protein domains: an information theoretic analysis.

Authors:  W R Atchley; K R Wollenberg; W M Fitch; W Terhalle; A W Dress
Journal:  Mol Biol Evol       Date:  2000-01       Impact factor: 16.240

4.  A simple algorithm to infer gene duplication and speciation events on a gene tree.

Authors:  C M Zmasek; S R Eddy
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

5.  Bayesian gene/species tree reconciliation and orthology analysis using MCMC.

Authors:  Lars Arvestad; Ann-Charlotte Berglund; Jens Lagergren; Bengt Sennblad
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

6.  A novel method for detecting intramolecular coevolution: adding a further dimension to selective constraints analyses.

Authors:  Mario A Fares; Simon A A Travers
Journal:  Genetics       Date:  2006-03-17       Impact factor: 4.562

7.  A space-time process model for the evolution of DNA sequences.

Authors:  Z Yang
Journal:  Genetics       Date:  1995-02       Impact factor: 4.562

8.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA.

Authors:  M Hasegawa; H Kishino; T Yano
Journal:  J Mol Evol       Date:  1985       Impact factor: 2.395

9.  Identification and classification of conserved RNA secondary structures in the human genome.

Authors:  Jakob Skou Pedersen; Gill Bejerano; Adam Siepel; Kate Rosenbloom; Kerstin Lindblad-Toh; Eric S Lander; Jim Kent; Webb Miller; David Haussler
Journal:  PLoS Comput Biol       Date:  2006-04-21       Impact factor: 4.475

10.  Detecting coevolution in and among protein domains.

Authors:  Chen-Hsiang Yeang; David Haussler
Journal:  PLoS Comput Biol       Date:  2007-09-18       Impact factor: 4.475

View more
  3 in total

1.  A multi-scale coevolutionary approach to predict interactions between protein domains.

Authors:  Giancarlo Croce; Thomas Gueudré; Maria Virginia Ruiz Cuevas; Victoria Keidel; Matteo Figliuzzi; Hendrik Szurmant; Martin Weigt
Journal:  PLoS Comput Biol       Date:  2019-10-21       Impact factor: 4.475

2.  Inter-protein residue covariation information unravels physically interacting protein dimers.

Authors:  Sara Salmanian; Hamid Pezeshk; Mehdi Sadeghi
Journal:  BMC Bioinformatics       Date:  2020-12-17       Impact factor: 3.169

3.  Genotype-phenotype associations: substitution models to detect evolutionary associations between phenotypic variables and genotypic evolutionary rate.

Authors:  Timothy D O'Connor; Nicholas I Mundy
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.