Literature DB >> 11821912

Stretch coding and block coding: two new strategies to represent questionably aligned DNA sequences.

Daniel L Geiger1.   

Abstract

Most coding strategies that address the problem of questionable alignment (elision, case sensitive, missing, polymorphic, gaps as presence/absence matrix) conflict with phylogenetic principles, particularly those relating to the concept of homology (shared similiarity explained by common ancestry). In some cases, the test of conjunction is failed. In other cases, characters that are coded ambiguously can lead to character-state optimization in the terminal taxa that conflicts with the original observations. Only data exclusion and contraction avoid these pitfalls. In highly dissimilar sequences additional character states can represent the available information. Two new methods that accomplish this-block and stretch coding-are introduced here. These two new coding strategies are not in conflict with the test of conjunction and do not contradict the original observations. They are comparable to coding practices with morphological data once the intrinsic differences due to character-state identity and topographical identity have been taken into account. It is suggested that, of the three recoding methods, the one is selected that preserves the maximum potential phylogenetic information as measured with the minimum number of steps required for the particular part of the data matrix.

Mesh:

Year:  2002        PMID: 11821912     DOI: 10.1007/s00239-001-0001-5

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  3 in total

1.  Multiple Sequence Alignment Averaging Improves Phylogeny Reconstruction.

Authors:  Haim Ashkenazy; Itamar Sela; Eli Levy Karin; Giddy Landan; Tal Pupko
Journal:  Syst Biol       Date:  2019-01-01       Impact factor: 15.683

2.  Analyzing multi-locus plant barcoding datasets with a composition vector method based on adjustable weighted distance.

Authors:  Chi Pang Li; Zu Guo Yu; Guo Sheng Han; Ka Hou Chu
Journal:  PLoS One       Date:  2012-07-27       Impact factor: 3.240

3.  Rapid DNA barcoding analysis of large datasets using the composition vector method.

Authors:  Ka Hou Chu; Minli Xu; Chi Pang Li
Journal:  BMC Bioinformatics       Date:  2009-11-10       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.