Literature DB >> 11541749

On combining protein sequences and nucleic acid sequences in phylogenetic analysis: the homeobox protein case.

D Agosti1, D Jacobs, R DeSalle.   

Abstract

Amino acid encoding genes contain character state information that may be useful for phylogenetic analysis on at least two levels. The nucleotide sequence and the translated amino acid sequences have both been employed separately as character states for cladistic studies of various taxa, including studies of the genealogy of genes in multigene families. In essence, amino acid sequences and nucleic acid sequences are two different ways of character coding the information in a gene. Silent positions in the nucleotide sequence (first or third positions in codons that can accrue change without changing the identity of the amino acid that the triplet codes for) may accrue change relatively rapidly and become saturated, losing the pattern of historical divergence. On the other hand, non-silent nucleotide alterations and their accompanying amino acid changes may evolve too slowly to reveal relationships among closely related taxa. In general, the dynamics of sequence change in silent and non-silent positions in protein coding genes result in homoplasy and lack of resolution, respectively. We suggest that the combination of nucleic acid and the translated amino acid coded character states into the same data matrix for phylogenetic analysis addresses some of the problems caused by the rapid change of silent nucleotide positions and overall slow rate of change of non-silent nucleotide positions and slowly changing amino acid positions. One major theoretical problem with this approach is the apparent non-independence of the two sources of characters. However, there are at least three possible outcomes when comparing protein coding nucleic acid sequences with their translated amino acids in a phylogenetic context on a codon by codon basis. First, the two character sets for a codon may be entirely congruent with respect to the information they convey about the relationships of a certain set of taxa. Second, one character set may display no information concerning a phylogenetic hypothesis while the other character set may impact information to a hypothesis. These two possibilities are cases of non-independence, however, we argue that congruence in such cases can be thought of as increasing the weight of the particular phylogenetic hypothesis that is supported by those characters. In the third case, the two sources of character information for a particular codon may be entirely incongruent with respect to phylogenetic hypotheses concerning the taxa examined. In this last case the two character sets are independent in that information from neither can predict the character states of the other. Examples of these possibilities are discussed and the general applicability of combining these two sources of information for protein coding genes is presented using sequences from the homeobox region of 46 homeobox genes from Drosophila melanogaster to develop a hypothesis of genealogical relationship of these genes in this large multigene family.

Entities:  

Mesh:

Substances:

Year:  1996        PMID: 11541749     DOI: 10.1111/j.1096-0031.1996.tb00193.x

Source DB:  PubMed          Journal:  Cladistics        ISSN: 0748-3007            Impact factor:   5.254


  9 in total

1.  sine oculis in basal Metazoa.

Authors:  Ilona G Bebenek; Ruth D Gates; Joshua Morris; Volker Hartenstein; David K Jacobs
Journal:  Dev Genes Evol       Date:  2004-06-25       Impact factor: 0.900

2.  Evidence, content and corroboration and the Tree of Life.

Authors:  E Kurt Lienau; Rob DeSalle
Journal:  Acta Biotheor       Date:  2008-11-18       Impact factor: 1.774

3.  Phylogenetic incongruence among oncogenic genital alpha human papillomaviruses.

Authors:  Apurva Narechania; Zigui Chen; Rob DeSalle; Robert D Burk
Journal:  J Virol       Date:  2005-12       Impact factor: 5.103

4.  Did homeodomain proteins duplicate before the origin of angiosperms, fungi, and metazoa?

Authors:  G Bharathan; B J Janssen; E A Kellogg; N Sinha
Journal:  Proc Natl Acad Sci U S A       Date:  1997-12-09       Impact factor: 11.205

5.  Transformation Series as an Ideographic Character Concept.

Authors:  Taran Grant; Arnold G Kluge
Journal:  Cladistics       Date:  2004-02       Impact factor: 5.254

6.  The evolution of the major hepatitis C genotypes correlates with clinical response to interferon therapy.

Authors:  Phillip S Pang; Paul J Planet; Jeffrey S Glenn
Journal:  PLoS One       Date:  2009-08-11       Impact factor: 3.240

7.  Generation of divergent uroplakin tetraspanins and their partners during vertebrate evolution: identification of novel uroplakins.

Authors:  Rob Desalle; Javier U Chicote; Tung-Tien Sun; Antonio Garcia-España
Journal:  BMC Evol Biol       Date:  2014-01-23       Impact factor: 3.260

8.  Phosphotyrosine phosphatase R3 receptors: Origin, evolution and structural diversification.

Authors:  Javier U Chicote; Rob DeSalle; Antonio García-España
Journal:  PLoS One       Date:  2017-03-03       Impact factor: 3.240

9.  Ancient origins of vertebrate-specific innate antiviral immunity.

Authors:  Krishanu Mukherjee; Bryan Korithoski; Bryan Kolaczkowski
Journal:  Mol Biol Evol       Date:  2013-10-08       Impact factor: 16.240

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.