| Literature DB >> 21876769 |
Kristen S Swithers1, Gregory P Fournier, Anna G Green, J Peter Gogarten, Pascal Lapierre.
Abstract
In 2009, James Lake introduced a new hypothesis in which reticulate phylogeny reconstruction is used to elucidate the origin of gram-negative bacteria (Nature 460: 967-971). The presented data supported the gram-negative bacteria originating from an ancient endosymbiosis between the Actinobacteria and Clostridia. His conclusion was based on a presence-absence analysis of protein families that divided all prokaryotes into five groups: Actinobacteria, Double Membrane bacteria (DM), Clostridia, Archaea and Bacilli. Of these five groups, the DM are by far the largest and most diverse group compared to the other groupings. While the fusion hypothesis for the origin of double membrane bacteria is enticing, we show that the signal supporting an ancient symbiosis is lost when the DM group is broken down into smaller subgroups. We conclude that the signal detected in James Lake's analysis in part results from a systematic artifact due to group size and diversity combined with low levels of horizontal gene transfer.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21876769 PMCID: PMC3158100 DOI: 10.1371/journal.pone.0023774
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Protein family counts for the ten possible informative profiles.
The table was adapted from Lake's Table 1 [2] to include the Pfam counts that result if different representative classes are chosen for the DM group. Number of Pfam per group is in parentheses the same number as in Lake's paper was found for all other groups. The circle illustrates Lake's hypothesis that the double membrane bacteria resulted from a fusion between Clostridia and Actinobacteria. The patterns compatible with this hypothesis are boxed (pattern 5,7,8,9 and 10).
Figure 2Posterior bootstrap support values (p-values) for a ring model, tree model or equal probabilities for each of the sampled groups.
The p-values were calculated from 10,000 re-samplings with replacement and extracting the total number of times the tree model, ring model or when both were equally supported from the parsimony counts. Only in the case where all the double membrane prokaryotes as defined by Lake [2], or when all the proteobacteria were included, did a ring model better explain the data.