| Literature DB >> 26541173 |
Mathieu Groussin1, Bastien Boussau2, Gergely Szöllõsi3, Laura Eme4, Manolo Gouy2, Céline Brochier-Armanet2, Vincent Daubin5.
Abstract
In a recent article, Nelson-Sathi et al. (NS) report that the origins of major archaeal lineages (MAL) correspond to massive group-specific gene acquisitions via HGT from bacteria (Nelson-Sathi et al. 2015. Origins of major archaeal clades correspond to gene acquisitions from bacteria. Nature 517(7532):77-80.). If correct, this would have fundamental implications for the process of diversification in microbes. However, a reexamination of these data and results shows that the methodology used by NS systematically inflates the number of genes acquired at the root of each MAL, and incorrectly assumes bacterial origins for these genes. A reanalysis of their data with appropriate phylogenetic models accounting for the dynamics of gene gain and loss between lineages supports the continuous acquisition of genes over long periods in the evolution of Archaea.Entities:
Keywords: Archaea; ancestral genome reconstruction; horizontal gene transfer
Mesh:
Year: 2015 PMID: 26541173 PMCID: PMC4866543 DOI: 10.1093/molbev/msv249
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
FForcing HGTs to the origins of archaeal phyla. (a) Competing evolutionary scenarios for « Cluster 23981 » from NS’s import set. Left: the gene is ancient in Bacteria and was subsequently transferred (red arrow) to the ancestor of Methanosarcinales. A large number of losses (blue lines) in Bacteria and Methanosarcinales is necessary to explain the narrow pattern of presence in extant species. Right: the gene is not ancient in Bacteria and was absent at the origin of Methanosarcinales. It has been transferred twice from Bacteria or Archaea and among Bacteria, and no loss is necessary. (b) Distribution of gene gains at the origin of Methanosarcinales (for other phyla, see the supplementary figure S1, Supplementary Material online). NS estimates are represented in purple. The distribution is very skewed toward sparsely distributed genes. ML expectations of gains on the corresponding set of genes are represented by red crosses.
FThe sporadic distribution of the 2,264 import gene families in Bacteria. The distribution of the 2,264 gene families in the 1,847 bacterial genomes is represented. The distribution is very skewed: half of the gene families have fewer than 21 bacterial homologs out of the 1,847 genomes (1.1%), and a vast majority of them (80%) are present in fewer than 99 bacterial genomes (5%). Because for each family the bacterial homologs are from at least two different phyla, this distribution is highly suggestive of recent HGTs among bacteria and of complex evolutionary scenarios for these families, preventing the inference of a direction of transfer.
FMost of the 2,264 import gene families were acquired after phyla origination in Archaea. We used the tree reconstructed by NS to represent the points of acquisition of genes of the import set in the evolution of Archaea. Colors on branches represent the expected number of gains per branch, relatively to the number of phylum-specific families. Branches with a gain expectation less than 1 are colored in grey. On the left, NS estimations are represented. All phylum-specific families were acquired at the origin of each phylum, and no subsequent gains are inferred. On the right, ML estimates show that gene family gains are spread over the history of diversification of each phyla and that most of the families were acquired after the origination of phyla.