Literature DB >> 17366135

Conditioned genome reconstruction: how to avoid choosing the conditioning genome.

Matthew Spencer1, David Bryant, Edward Susko.   

Abstract

Genome phylogenies can be inferred from data on the presence and absence of genes across taxa. Logdet distances may be a good method, because they allow expected genome size to vary across the tree. Recently, Lake and Rivera proposed conditioned genome reconstruction (calculation of logdet distances using only those genes present in a conditioning genome) to deal with unobservable genes that are absent from every taxon of interest. We prove that their method can consistently estimate the topology for almost any choice of conditioning genome. Nevertheless, the choice of conditioning genome is important for small samples. For real bacterial genome data, different choices of conditioning genome can result in strong bootstrap support for different tree topologies. To overcome this problem, we developed supertree methods that combine information from all choices of conditioning genome. One of these methods, based on the BIONJ algorithm, performs well on simulated data and may have applications to other supertree problems. However, an analysis of 40 bacterial genomes using this method supports an incorrect clade of parasites. This is a common feature of model-based gene content methods and is due to parallel gene loss.

Mesh:

Year:  2007        PMID: 17366135     DOI: 10.1080/10635150601156313

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  8 in total

1.  Inferring bacterial genome flux while considering truncated genes.

Authors:  Weilong Hao; G Brian Golding
Journal:  Genetics       Date:  2010-06-15       Impact factor: 4.562

2.  A likelihood framework to analyse phyletic patterns.

Authors:  Ofir Cohen; Nimrod D Rubinstein; Adi Stern; Uri Gophna; Tal Pupko
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

3.  On the artefactual parasitic eubacteria clan in conditioned logdet phylogenies: heterotachy and ortholog identification artefacts as explanations.

Authors:  Ajanthah Sangaralingam; Edward Susko; David Bryant; Matthew Spencer
Journal:  BMC Evol Biol       Date:  2010-11-09       Impact factor: 3.260

4.  Telling the whole story in a 10,000-genome world.

Authors:  Robert G Beiko
Journal:  Biol Direct       Date:  2011-06-30       Impact factor: 4.540

5.  Automated reconstruction of whole-genome phylogenies from short-sequence reads.

Authors:  Frederic Bertels; Olin K Silander; Mikhail Pachkov; Paul B Rainey; Erik van Nimwegen
Journal:  Mol Biol Evol       Date:  2014-03-05       Impact factor: 16.240

6.  Inference and characterization of horizontally transferred gene families using stochastic mapping.

Authors:  Ofir Cohen; Tal Pupko
Journal:  Mol Biol Evol       Date:  2009-10-06       Impact factor: 16.240

7.  The tree of genomes: an empirical comparison of genome-phylogeny reconstruction methods.

Authors:  Angela McCann; James A Cotton; James O McInerney
Journal:  BMC Evol Biol       Date:  2008-11-12       Impact factor: 3.260

8.  The deep roots of the rings of life.

Authors:  James A Lake; Janet S Sinsheimer
Journal:  Genome Biol Evol       Date:  2013       Impact factor: 3.416

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.