Literature DB >> 18053244

Direct maximum parsimony phylogeny reconstruction from genotype data.

Srinath Sridhar1, Fumei Lam, Guy E Blelloch, R Ravi, Russell Schwartz.   

Abstract

BACKGROUND: Maximum parsimony phylogenetic tree reconstruction from genetic variation data is a fundamental problem in computational genetics with many practical applications in population genetics, whole genome analysis, and the search for genetic predictors of disease. Efficient methods are available for reconstruction of maximum parsimony trees from haplotype data, but such data are difficult to determine directly for autosomal DNA. Data more commonly is available in the form of genotypes, which consist of conflated combinations of pairs of haplotypes from homologous chromosomes. Currently, there are no general algorithms for the direct reconstruction of maximum parsimony phylogenies from genotype data. Hence phylogenetic applications for autosomal data must therefore rely on other methods for first computationally inferring haplotypes from genotypes.
RESULTS: In this work, we develop the first practical method for computing maximum parsimony phylogenies directly from genotype data. We show that the standard practice of first inferring haplotypes from genotypes and then reconstructing a phylogeny on the haplotypes often substantially overestimates phylogeny size. As an immediate application, our method can be used to determine the minimum number of mutations required to explain a given set of observed genotypes.
CONCLUSION: Phylogeny reconstruction directly from unphased data is computationally feasible for moderate-sized problem instances and can lead to substantially more accurate tree size inferences than the standard practice of treating phasing and phylogeny construction as two separate analysis stages. The difference between the approaches is particularly important for downstream applications that require a lower-bound on the number of mutations that the genetic region has undergone.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18053244      PMCID: PMC2222657          DOI: 10.1186/1471-2105-8-472

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  21 in total

1.  dbSNP: a database of single nucleotide polymorphisms.

Authors:  E M Smigielski; K Sirotkin; M Ward; S T Sherry
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms.

Authors:  Tianhua Niu; Zhaohui S Qin; Xiping Xu; Jun S Liu
Journal:  Am J Hum Genet       Date:  2001-11-26       Impact factor: 11.025

3.  Generating samples under a Wright-Fisher neutral model of genetic variation.

Authors:  Richard R Hudson
Journal:  Bioinformatics       Date:  2002-02       Impact factor: 6.937

4.  Haplotyping as perfect phylogeny: a direct approach.

Authors:  Vineet Bafna; Dan Gusfield; Giuseppe Lancia; Shibu Yooseph
Journal:  J Comput Biol       Date:  2003       Impact factor: 1.479

5.  Bayes estimation of species divergence times and ancestral population sizes using DNA sequences from multiple loci.

Authors:  Bruce Rannala; Ziheng Yang
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

6.  Algorithms for efficient near-perfect phylogenetic tree reconstruction in theory and practice.

Authors:  Srinath Sridhar; Kedar Dhamdhere; Guy Blelloch; Eran Halperin; R Ravi; Russell Schwartz
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2007 Oct-Dec       Impact factor: 3.710

7.  Distinguishing human ethnic groups by means of sequences from Helicobacter pylori: lessons from Ladakh.

Authors:  Thierry Wirth; Xiaoyan Wang; Bodo Linz; Richard P Novick; J Koji Lum; Martin Blaser; Giovanna Morelli; Daniel Falush; Mark Achtman
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-29       Impact factor: 11.205

Review 8.  Patterns of human genetic diversity: implications for human evolutionary history and disease.

Authors:  Sarah A Tishkoff; Brian C Verrelli
Journal:  Annu Rev Genomics Hum Genet       Date:  2003       Impact factor: 8.929

9.  Haplotype reconstruction from genotype data using Imperfect Phylogeny.

Authors:  Eran Halperin; Eleazar Eskin
Journal:  Bioinformatics       Date:  2004-02-26       Impact factor: 6.937

10.  Efficient reconstruction of haplotype structure via perfect phylogeny.

Authors:  Eleazar Eskin; Eran Halperin; Richard M Karp
Journal:  J Bioinform Comput Biol       Date:  2003-04       Impact factor: 1.122

View more
  2 in total

1.  A consensus tree approach for reconstructing human evolutionary history and detecting population substructure.

Authors:  Ming-Chi Tsai; Guy Blelloch; R Ravi; Russell Schwartz
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2011 Jul-Aug       Impact factor: 3.710

2.  Analyzing heterogeneous complexity in complementary and alternative medicine research: a systems biology solution via parsimony phylogenetics.

Authors:  Mones Abu-Asab; Mary Koithan; Joan Shaver; Hakima Amri
Journal:  Forsch Komplementmed       Date:  2012-01-20
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.