Literature DB >> 20525566

A Monte Carlo approach successfully identifies randomness in multiple sequence alignments: a more objective means of data exclusion.

Bernhard Misof1, Katharina Misof.   

Abstract

Random similarity of sequences or sequence sections can impede phylogenetic analyses or the identification of gene homologies. Additionally, randomly similar sequences or ambiguously aligned sequence sections can negatively interfere with the estimation of substitution model parameters. Phylogenomic studies have shown that biases in model estimation and tree reconstructions do not disappear even with large data sets. In fact, these biases can become pronounced with more data. It is therefore important to identify possible random similarity within sequence alignments in advance of model estimation and tree reconstructions. Different approaches have been already suggested to identify and treat problematic alignment sections. We propose an alternative method that can identify random similarity within multiple sequence alignments (MSAs) based on Monte Carlo resampling within a sliding window. The method infers similarity profiles from pairwise sequence comparisons and subsequently calculates a consensus profile. This consensus profile represents a summary of all calculated single similarity profiles. In consequence, consensus profiles identify dominating patterns of nonrandom similarity or randomness within sections of MSAs. We show that the approach clearly identifies randomness in simulated and real data. After the exclusion of putative random sections, node support drastically improves in tree reconstructions of both data. It thus appears to be a powerful tool to identify possible biases of tree reconstructions or gene identification. The method is currently restricted to nucleotide data but will be extended to protein data in the near future.

Entities:  

Mesh:

Year:  2009        PMID: 20525566     DOI: 10.1093/sysbio/syp006

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  91 in total

1.  Disentangling the aging gene expression network of termite queens.

Authors:  José Manuel Monroy Kuhn; Karen Meusemann; Judith Korb
Journal:  BMC Genomics       Date:  2021-05-11       Impact factor: 3.969

2.  Phylogenomics reveals deep molluscan relationships.

Authors:  Kevin M Kocot; Johanna T Cannon; Christiane Todt; Mathew R Citarella; Andrea B Kohn; Achim Meyer; Scott R Santos; Christoffer Schander; Leonid L Moroz; Bernhard Lieb; Kenneth M Halanych
Journal:  Nature       Date:  2011-09-04       Impact factor: 49.962

Review 3.  Evolution and Functional Morphology of the Proboscis in Kalyptorhynchia (Platyhelminthes).

Authors:  Julian P S Smith; Marian K Litvaitis; Stefan Gobert; Theodore Uyeno; Tom Artois
Journal:  Integr Comp Biol       Date:  2015-05-21       Impact factor: 3.326

4.  Evolutionary history of Polyneoptera and its implications for our understanding of early winged insects.

Authors:  Benjamin Wipfler; Harald Letsch; Paul B Frandsen; Paschalia Kapli; Christoph Mayer; Daniela Bartel; Thomas R Buckley; Alexander Donath; Janice S Edgerly-Rooks; Mari Fujita; Shanlin Liu; Ryuichiro Machida; Yuta Mashimo; Bernhard Misof; Oliver Niehuis; Ralph S Peters; Malte Petersen; Lars Podsiadlowski; Kai Schütte; Shota Shimizu; Toshiki Uchifune; Jeanne Wilbrandt; Evgeny Yan; Xin Zhou; Sabrina Simon
Journal:  Proc Natl Acad Sci U S A       Date:  2019-01-14       Impact factor: 11.205

5.  Fast evolving 18S rRNA sequences from Solenogastres (Mollusca) resist standard PCR amplification and give new insights into mollusk substitution rate heterogeneity.

Authors:  Achim Meyer; Christiane Todt; Nina T Mikkelsen; Bernhard Lieb
Journal:  BMC Evol Biol       Date:  2010-03-09       Impact factor: 3.260

6.  Parametric and non-parametric masking of randomness in sequence alignments can be improved and leads to better resolved trees.

Authors:  Patrick Kück; Karen Meusemann; Johannes Dambach; Birthe Thormann; Björn M von Reumont; Johann W Wägele; Bernhard Misof
Journal:  Front Zool       Date:  2010-03-31       Impact factor: 3.172

7.  Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships.

Authors:  Björn M von Reumont; Karen Meusemann; Nikolaus U Szucsich; Emiliano Dell'Ampio; Vivek Gowri-Shankar; Daniela Bartel; Sabrina Simon; Harald O Letsch; Roman R Stocsits; Yun-xia Luan; Johann Wolfgang Wägele; Günther Pass; Heike Hadrys; Bernhard Misof
Journal:  BMC Evol Biol       Date:  2009-05-27       Impact factor: 3.260

8.  OrthoSelect: a protocol for selecting orthologous groups in phylogenomics.

Authors:  Fabian Schreiber; Kerstin Pick; Dirk Erpenbeck; Gert Wörheide; Burkhard Morgenstern
Journal:  BMC Bioinformatics       Date:  2009-07-16       Impact factor: 3.169

9.  Accurate and efficient reconstruction of deep phylogenies from structured RNAs.

Authors:  Roman R Stocsits; Harald Letsch; Jana Hertel; Bernhard Misof; Peter F Stadler
Journal:  Nucleic Acids Res       Date:  2009-09-01       Impact factor: 16.971

10.  The root of the East African cichlid radiations.

Authors:  Julia Schwarzer; Bernhard Misof; Diethard Tautz; Ulrich K Schliewen
Journal:  BMC Evol Biol       Date:  2009-08-05       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.