Literature DB >> 18855041

PCA and clustering reveal alternate mtDNA phylogeny of N and M clades.

G Alexe1, R Vijaya Satya, M Seiler, D Platt, T Bhanot, S Hui, M Tanaka, A J Levine, G Bhanot.   

Abstract

Phylogenetic trees based on mtDNA polymorphisms are often used to infer the history of recent human migrations. However, there is no consensus on which method to use. Most methods make strong assumptions which may bias the choice of polymorphisms and result in computational complexity which limits the analysis to a few samples/polymorphisms. For example, parsimony minimizes the number of mutations, which biases the results to minimizing homoplasy events. Such biases may miss the global structure of the polymorphisms altogether, with the risk of identifying a "common" polymorphism as ancient without an internal check on whether it either is homoplasic or is identified as ancient because of sampling bias (from oversampling the population with the polymorphism). A signature of this problem is that different methods applied to the same data or the same method applied to different datasets results in different tree topologies. When the results of such analyses are combined, the consensus trees have a low internal branch consensus. We determine human mtDNA phylogeny from 1737 complete sequences using a new, direct method based on principal component analysis (PCA) and unsupervised consensus ensemble clustering. PCA identifies polymorphisms representing robust variations in the data and consensus ensemble clustering creates stable haplogroup clusters. The tree is obtained from the bifurcating network obtained when the data are split into k = 2,3,4,...,kmax clusters, with equal sampling from each haplogroup. Our method assumes only that the data can be clustered into groups based on mutations, is fast, is stable to sample perturbation, uses all significant polymorphisms in the data, works for arbitrary sample sizes, and avoids sample choice and haplogroup size bias. The internal branches of our tree have a 90% consensus accuracy. In conclusion, our tree recreates the standard phylogeny of the N, M, L0/L1, L2, and L3 clades, confirming the African origin of modern humans and showing that the M and N clades arose in almost coincident migrations. However, the N clade haplogroups split along an East-West geographic divide, with a "European R clade" containing the haplogroups H, V, H/V, J, T, and U and a "Eurasian N subclade" including haplogroups B, R5, F, A, N9, I, W, and X. The haplogroup pairs (N9a, N9b) and (M7a, M7b) within N and M are placed in nonnearest locations in agreement with their expected large TMRCA from studies of their migrations into Japan. For comparison, we also construct consensus maximum likelihood, parsimony, neighbor joining, and UPGMA-based trees using the same polymorphisms and show that these methods give consistent results only for the clade tree. For recent branches, the consensus accuracy for these methods is in the range of 1-20%. From a comparison of our haplogroups to two chimp and one bonobo sequences, and assuming a chimp-human coalescent time of 5 million years before present, we find a human mtDNA TMRCA of 206,000 +/- 14,000 years before present.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18855041     DOI: 10.1007/s00239-008-9148-7

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  26 in total

1.  Efficiency of the neighbor-joining method in reconstructing deep and shallow evolutionary relationships in large phylogenies.

Authors:  S Kumar; S R Gadagkar
Journal:  J Mol Evol       Date:  2000-12       Impact factor: 2.395

2.  Mitochondrial genome variation in eastern Asia and the peopling of Japan.

Authors:  Masashi Tanaka; Vicente M Cabrera; Ana M González; José M Larruga; Takeshi Takeyasu; Noriyuki Fuku; Li-Jun Guo; Raita Hirose; Yasunori Fujita; Miyuki Kurata; Ken-ichi Shinoda; Kazuo Umetsu; Yoshiji Yamada; Yoshiharu Oshida; Yuzo Sato; Nobutaka Hattori; Yoshikuni Mizuno; Yasumichi Arai; Nobuyoshi Hirose; Shigeo Ohta; Osamu Ogawa; Yasushi Tanaka; Ryuzo Kawamori; Masayo Shamoto-Nagai; Wakako Maruyama; Hiroshi Shimokata; Ryota Suzuki; Hidetoshi Shimodaira
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

3.  Tracing modern human origins.

Authors:  Henry Harpending; Vinayak Eswaran
Journal:  Science       Date:  2005-09-23       Impact factor: 47.728

4.  Detection of basepair substitution mutation at a frequency of 1 x 10(-7) by combining two genotypic selection methods, MutEx enrichment and allele-specific competitive blocker PCR.

Authors:  B L Parsons; R H Heflich
Journal:  Environ Mol Mutagen       Date:  1998       Impact factor: 3.216

5.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

6.  Optimal alignments in linear space.

Authors:  E W Myers; W Miller
Journal:  Comput Appl Biosci       Date:  1988-03

Review 7.  The powers and pitfalls of parsimony.

Authors:  C B Stewart
Journal:  Nature       Date:  1993-02-18       Impact factor: 49.962

8.  Strand asymmetry in human mitochondrial DNA mutations.

Authors:  M Tanaka; T Ozawa
Journal:  Genomics       Date:  1994-07-15       Impact factor: 5.736

9.  Mitochondrial DNA and human evolution.

Authors:  R L Cann; M Stoneking; A C Wilson
Journal:  Nature       Date:  1987 Jan 1-7       Impact factor: 49.962

10.  Mitochondrial genome variation and the origin of modern humans.

Authors:  M Ingman; H Kaessmann; S Pääbo; U Gyllensten
Journal:  Nature       Date:  2000-12-07       Impact factor: 49.962

View more
  3 in total

1.  Cluster analysis of the origins of the new influenza A(H1N1) virus.

Authors:  A Solovyov; G Palacios; T Briese; W I Lipkin; R Rabadan
Journal:  Euro Surveill       Date:  2009-05-28

2.  Principal Component Analysis applied directly to Sequence Matrix.

Authors:  Tomokazu Konishi; Shiori Matsukuma; Hayami Fuji; Daiki Nakamura; Nozomi Satou; Kunihiro Okano
Journal:  Sci Rep       Date:  2019-12-17       Impact factor: 4.379

3.  The influence of habitats on female mobility in Central and Western Africa inferred from human mitochondrial variation.

Authors:  Valeria Montano; Veronica Marcari; Mariano Pavanello; Okorie Anyaele; David Comas; Giovanni Destro-Bisol; Chiara Batini
Journal:  BMC Evol Biol       Date:  2013-01-29       Impact factor: 3.260

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.