Literature DB >> 25799056

Not seeing the forest for the trees: size of the minimum spanning trees (MSTs) forest and branch significance in MST-based phylogenetic analysis.

Andreia Sofia Teixeira1, Pedro T Monteiro2, João A Carriço3, Mário Ramirez4, Alexandre P Francisco2.   

Abstract

Trees, including minimum spanning trees (MSTs), are commonly used in phylogenetic studies. But, for the research community, it may be unclear that the presented tree is just a hypothesis, chosen from among many possible alternatives. In this scenario, it is important to quantify our confidence in both the trees and the branches/edges included in such trees. In this paper, we address this problem for MSTs by introducing a new edge betweenness metric for undirected and weighted graphs. This spanning edge betweenness metric is defined as the fraction of equivalent MSTs where a given edge is present. The metric provides a per edge statistic that is similar to that of the bootstrap approach frequently used in phylogenetics to support the grouping of taxa. We provide methods for the exact computation of this metric based on the well known Kirchhoff's matrix tree theorem. Moreover, we implement and make available a module for the PHYLOViZ software and evaluate the proposed metric concerning both effectiveness and computational performance. Analysis of trees generated using multilocus sequence typing data (MLST) and the goeBURST algorithm revealed that the space of possible MSTs in real data sets is extremely large. Selection of the edge to be represented using bootstrap could lead to unreliable results since alternative edges are present in the same fraction of equivalent MSTs. The choice of the MST to be presented, results from criteria implemented in the algorithm that must be based in biologically plausible models.

Entities:  

Mesh:

Year:  2015        PMID: 25799056      PMCID: PMC4370493          DOI: 10.1371/journal.pone.0119315

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  14 in total

Review 1.  Community structure in social and biological networks.

Authors:  M Girvan; M E J Newman
Journal:  Proc Natl Acad Sci U S A       Date:  2002-06-11       Impact factor: 11.205

Review 2.  Population genetics and evolution of the pan-genome of Streptococcus pneumoniae.

Authors:  Alessandro Muzzi; Claudio Donati
Journal:  Int J Med Microbiol       Date:  2011-10-13       Impact factor: 3.473

3.  Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms.

Authors:  M C Maiden; J A Bygraves; E Feil; G Morelli; J E Russell; R Urwin; Q Zhang; J Zhou; K Zurth; D A Caugant; I M Feavers; M Achtman; B G Spratt
Journal:  Proc Natl Acad Sci U S A       Date:  1998-03-17       Impact factor: 11.205

4.  Using allele frequencies and geographic subdivision to reconstruct gene trees within a species: molecular variance parsimony.

Authors:  L Excoffier; P E Smouse
Journal:  Genetics       Date:  1994-01       Impact factor: 4.562

5.  eBURST: inferring patterns of evolutionary descent among clusters of related bacterial genotypes from multilocus sequence typing data.

Authors:  Edward J Feil; Bao C Li; David M Aanensen; William P Hanage; Brian G Spratt
Journal:  J Bacteriol       Date:  2004-03       Impact factor: 3.490

Review 6.  MLST revisited: the gene-by-gene approach to bacterial genomics.

Authors:  Martin C J Maiden; Melissa J Jansen van Rensburg; James E Bray; Sarah G Earle; Suzanne A Ford; Keith A Jolley; Noel D McCarthy
Journal:  Nat Rev Microbiol       Date:  2013-09-02       Impact factor: 60.633

7.  Homologous recombination drives both sequence diversity and gene content variation in Neisseria meningitidis.

Authors:  Ying Kong; Jennifer H Ma; Keisha Warren; Raymond S W Tsang; Donald E Low; Frances B Jamieson; David C Alexander; Weilong Hao
Journal:  Genome Biol Evol       Date:  2013       Impact factor: 3.416

8.  Phylogeographic reconstruction of a bacterial species with high levels of lateral gene transfer.

Authors:  Talima Pearson; Philip Giffard; Stephen Beckstrom-Sternberg; Raymond Auerbach; Heidie Hornstra; Apichai Tuanyok; Erin P Price; Mindy B Glass; Benjamin Leadem; James S Beckstrom-Sternberg; Gerard J Allan; Jeffrey T Foster; David M Wagner; Richard T Okinaka; Siew Hoon Sim; Ofori Pearson; Zaining Wu; Jean Chang; Rajinder Kaul; Alex R Hoffmaster; Thomas S Brettin; Richard A Robison; Mark Mayo; Jay E Gee; Patrick Tan; Bart J Currie; Paul Keim
Journal:  BMC Biol       Date:  2009-11-18       Impact factor: 7.431

9.  Global optimal eBURST analysis of multilocus typing data using a graphic matroid approach.

Authors:  Alexandre P Francisco; Miguel Bugalho; Mário Ramirez; João A Carriço
Journal:  BMC Bioinformatics       Date:  2009-05-18       Impact factor: 3.169

10.  PHYLOViZ: phylogenetic inference and data visualization for sequence based typing methods.

Authors:  Alexandre P Francisco; Cátia Vaz; Pedro T Monteiro; José Melo-Cristino; Mário Ramirez; Joäo A Carriço
Journal:  BMC Bioinformatics       Date:  2012-05-08       Impact factor: 3.169

View more
  1 in total

1.  Evidence for Host-Genotype Associations of Borrelia burgdorferi Sensu Stricto.

Authors:  Samir Mechai; Gabriele Margos; Edward J Feil; Nicole Barairo; L Robbin Lindsay; Pascal Michel; Nicholas H Ogden
Journal:  PLoS One       Date:  2016-02-22       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.