Literature DB >> 27150504

One step beyond a ribosome: The ancient anaerobic core.

Filipa L Sousa1, Shijulal Nelson-Sathi2, William F Martin2.   

Abstract

Life arose in a world without oxygen and the first organisms were anaerobes. Here we investigate the gene repertoire of the prokaryote common ancestor, estimating which genes it contained and to which lineages of modern prokaryotes it was most similar in terms of gene content. Using a phylogenetic approach we found that among trees for all 8779 protein families shared between 134 archaea and 1847 bacterial genomes, only 1045 have sequences from at least two bacterial and two archaeal groups and retain the ancestral archaeal-bacterial split. Among those, the genes shared by anaerobes were identified as candidate genes for the prokaryote common ancestor, which lived in anaerobic environments. We find that these anaerobic prokaryote common ancestor genes are today most frequently distributed among methanogens and clostridia, strict anaerobes that live from low free energy changes near the thermodynamic limit of life. The anaerobic families encompass genes for bifunctional acetyl-CoA-synthase/CO-dehydrogenase, heterodisulfide reductase subunits C and A, ferredoxins, and several subunits of the Mrp-antiporter/hydrogenase family, in addition to numerous S-adenosyl methionine (SAM) dependent methyltransferases. The data indicate a major role for methyl groups in the metabolism of the prokaryote common ancestor. The data furthermore indicate that the prokaryote ancestor possessed a rotor stator ATP synthase, but lacked cytochromes and quinones as well as identifiable redox-dependent ion pumping complexes. The prokaryote ancestor did possess, however, an Mrp-type H(+)/Na(+) antiporter complex, capable of transducing geochemical pH gradients into biologically more stable Na(+)-gradients. The findings implicate a hydrothermal, autotrophic, and methyl-dependent origin of life. This article is part of a Special Issue entitled 'EBEC 2016: 19th European Bioenergetics Conference, Riva del Garda, Italy, July 2-6, 2016', edited by Prof. Paolo Bernardi.
Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Acetogens; Anaerobes; Autotrophy; Early evolution; Geochemistry; Methanogens

Mesh:

Substances:

Year:  2016        PMID: 27150504      PMCID: PMC4906156          DOI: 10.1016/j.bbabio.2016.04.284

Source DB:  PubMed          Journal:  Biochim Biophys Acta        ISSN: 0006-3002


“It must be clear that all the changes and the original life forms are dependent upon energy as well as material capture and flow.”

Introduction

One of the more intriguing enterprises in comparative genomics is to infer the nature of the last universal common ancestor, also called Luca, on the basis of gene content [46], [57], [73], [6], [74], [75], [81], [112]. The standard approach to the problem is to generate a reference tree – sometimes called a backbone tree or species tree – and then to plot the distribution of gene families, usually the COGs, or clusters of orthologous groups [110] onto the leaves of the tree and then to infer presence and absence patterns along the inner branches and nodes of the tree, right down to its root, the presence and absence patterns at the root giving an estimate of Luca's gene content [6], [27], [46], [73], [74], [75], [81], [112]. The models that one assumes for gene gain and loss have a considerable impact on the inferred genome of Luca [23], [24], [46], [73], [100] as does the selected reference tree [31] and the genome collection of the study. In early investigations of Luca gene content, Luca was considered as the last common ancestor of bacteria, archaea and eukaryotes [115]. More recent findings have eukaryotic ribosomes branching within the archaea, rather than as their sister [22], [87], [108], [114], such that in the more modern “two domain” trees, Luca is the last common ancestor of prokaryotes. To stress that, one could introduce the term last prokaryotic common ancestor, or Lpca. But new terms for established concepts are seldom helpful, and Luca means different things to different people anyway. Here we stick to the term Luca, but we use it here to mean the last common ancestor of prokaryotes, which in our view of early evolution was not a free living cell, but rather was an entity that had the genetic code, that had proteins, that had ribosomes and an ATPase [106], that could make DNA as a stable repository for retrievable information, but probably could not replicate DNA as chromosomes [51], [53], and that – we posit – probably was contained within naturally forming inorganic compartments as chemical confines of a geological structure like a hydrothermal vent, which supplied the reduced carbon and continuous chemical disequilibrium (energy supply) that Luca needed to get organized in the first place [65], [66], [59], [107]. But irrespective of where and how it arose, newer phylogenetic data indicate that eukaryotes need to be excluded when it comes to estimating the gene content of Luca. Excluding eukaryotes has an immense effect on Luca gene content estimation. This is because current views and current data have it that eukaryotes arose from a symbiosis of two prokaryotes, the bacterial ancestor of mitochondria and its archaeal host [1], [22], [54], [58], [64], [90], and there are only about 2585 gene families that eukaryotes share widely with prokaryotes [55]. By including eukaryotes in Luca gene content estimation, one would be excluding all enzymes specific to anaerobic chemolithoautotrophy, all enzymes specific to anoxygenic photosynthesis, all enzymes specific to sulfate reduction, and all enzymes specific to all biochemical pathways that eukaryotes do not possess, which comprises the vast majority of genes distributed across prokaryotic genomes. Eukaryotes possess only a very, very small sample of prokaryotic energy metabolic diversity [120] and an even smaller sample of prokaryotic gene diversity in general [55]. It is thus important to estimate Luca gene content based upon all prokaryotic genes, not just the narrow sample of genes that eukaryotes inherited from prokaryotes at eukaryotic (and plastid) origin [55]. Removing the restriction that the inclusion of eukaryotes introduces into Luca gene content estimation is easy, one just excludes eukaryotic gene from the set to be considered for Luca inference. Far more problematic, however, is the issue of lateral gene transfer. This is because – even in studies that exclude eukaryotes from Luca inference – many studies score genes as present in Luca if the genes are present in several archaea and one (or more) bacterium, or present in several bacteria and one (or more) archaeon [9]. But such genes could easily be transdomain lateral gene transfers and not holdovers from Luca at all. In haloarchaea alone, there are more than 1000 well-documented cases of genes that were acquired from bacteria via transdomain lateral gene transfers [76]. In a broader sample of archaeal lineages, Nelson-Sathi et al. [77] identified more than 6000 cases of transdomain lateral gene transfers. In prokaryotes, LGT is not only frequent [10], [45], [50], [61] but it also played an important role in prokaryote lineage diversification [77]. Transdomain LGT generates gene distribution patterns that complicate the inference of Luca's gene content. In an insightful paper, Kannan et al. [46] clearly outlined the problems that LGT introduces regarding Luca: If a gene family was invented relatively late in evolution, in a particular bacterial lineage, and then transferred across broad taxonomic boundaries (for example from bacteria to archaea or vice versa), then its phylogenetic distribution would erroneously mimic presence in Luca. If not recognized as LGTs, such genes lead to a vastly (and artefactually) inflated Luca genome content. If such interdomain LGT is widespread, reconstructing Luca's gene content becomes tedious. How to identify transdomain LGTs so as to remove their inflating effects upon Luca gene content estimation? We have a suggestion. We recently reported a clustering and phylogenetic analysis of over 6 million genes from 1891 prokaryotic genomes, focusing on genes shared by archaea and bacteria [77]. We found that interdomain LGTs from bacteria to archaea vastly outnumbered gene transfers in the other direction and that gene acquisitions from bacteria correspond to the origin of several major archaeal groups [77]. In that study, 4705 protein families showed extensive interdomain LGT, and another 4397 protein families were identified where archaea and bacteria are monophyletic in the corresponding phylogenetic tree. Gene presence in archaea and bacteria, in addition to monophyly of archaea and bacteria, is the minimal condition that should be fulfilled for genes that were present in Luca but not subject to interdomain LGT. Among the 4397 protein families reported in which archaea and bacteria are monophyletic, 3347 cases represent fairly obvious interdomain LGTs in that the genes are widespread in bacteria but present in only one archaeal lineage [77]. The remaining 1045 genes show archaea and bacteria to be monophyletic but show no obvious signs of interdomain LGT. This set of genes is, in principle, a candidate list for genes present in Luca but not transferred between domains since the divergence of bacteria and archaea. These 1045 genes are therefore of interest and compose our starting point for the functional analysis of how the primordial ancestor of bacteria and archaea made a living. Yet there still might be some gene families among those 1045 that, despite bi-domain presence and domain monophyly, were subject to interdomain LGTs that went undetected in our earlier report. For example, oxygen dependent enzymes can hardly have existed in Luca because life arose in a world without oxygen [42], [56], [67], but they might have been passed around promiscuously after the advent of oxygenated environments. Because Luca had to be an anaerobe (oxygen being a biological product), we can introduce one more criterion for Luca presence: oxygen dependent enzymes and pathways cannot have been present in Luca, such that enzymes and pathways specific to, or typically found among, aerobes (but not in anaerobes) can be excluded from Luca's gene set. To gain insights on the primordial metabolism of the common ancestor of bacteria and archaea before its diversification into the bacterial and archaea lineages, we set out here to identify protein families that span the archaeal–bacterial division, that were not subject to interdomain LGT, and that are preferentially found within the genomes of anaerobes. Of course, we cannot exclude the existence of other proteins in Luca, such as the ones widely shared by aerobic and anaerobic organisms, or some of the ones whose evolutionary history involved interdomain LGT events.

Methods

Aerotolerance profiling

The method described in Sousa et al. [105] was employed to identify and classify hemecopper oxygen reductases (HCOs) and nitric oxide reductases (NORs) across the genomes used in this study. Briefly, a manually curated database of 1225 sequences classified as being A1, A2, B and C type hemecopper oxygen reductases [82] or NORs was download from HCO database and used to query the genomes of 1981 prokaryotes from our dataset (blast cut-of 25% identity, E-value 10− 10, alignment coverage of at least 300 amino acids). In a second step, the obtained hits were classified as belonging to one of the five recognized enzyme types. An organism is considered aerobic if it contains genes coding for A1, A2, B or C type enzymes and anaerobic otherwise. Protein families were classified as aerobic or anaerobic if 90% of their representatives and, at least 85% of archaeal organisms and, at least 85% of bacterial organisms belong to the same classification. The remaining cases were classified as mixed families. Results are summarized in Supplemental Table A1.

Functional characterization of protein families

Functional annotations were retrieved from COG [37], KEGG [44], and when justified, Biocyc [17] and Brenda [95] databases. The combined information is available as Supplemental Table A2. In the wordle representation, common words such as hypothetical-protein, protein or subunit were removed as well as the superphyla Proteobacteria, Firmicutes, Crenarchaeota and Euryarchaeota taxonomic descriptions.

Distribution of O2 dependent reactions

Reactions, reaction directionality (according to metabolic pathways), KEGG orthology and Brite hierarchy were parsed from KEGG's database (June 2015) [44]. This allowed the mapping onto KEGG genomes of the reactions depending on O2 as in [88].

Results and discussion

Universal (or nearly so) genes

Genes that are widely distributed across prokaryotic domains were either present in Luca before the divergence of bacteria and archaea [27], [52] or were subject to interdomain LGT [46]. We previously clustered 134 archaeal genomes into 25,762 protein families and identified their corresponding homologs among 1847 bacterial genomes [77]. In that dataset, the genomes span, according to prokaryote systematics, 13 archaeal and 23 bacterial higher taxonomic groups respectively, which roughly correspond to phyla (or class) and are designated for convenience henceforth as phyla here. If we search for nearly universal protein families using the criteria of i) presence in at least 22 bacterial phyla (missing in only one phylum) and at least 12 archaeal phyla (missing in only one phylum) and ii) monophyly of the domains within the corresponding maximum likelihood tree, we end up with a set of 27 nearly-universal protein families (Fig. 1a), corresponding to the familiar set of 30–35 “core” genes for (mostly ribosomal) proteins that are now commonly used to infer lineage relationships in place of rRNA alone [20], [21], [22], [39], [108], [114].
Fig. 1

Wordle representation of the most frequent functional descriptions within a) 27 interdomain nearly universal monophyletic protein families and b) 109 nearly universal protein families. The size of the words relates to the number of times the word appears within the annotations. The larger the word, the higher its frequency.

If we allow for some gene loss during evolution (or rapid sequence divergence in some lineages) and thus opt for a less stringent distribution criterion, and furthermore relax the criterion for domain monophyly, our extended protein set consists of 109 protein families that include 9 aminoacyl-tRNA synthetase families, whose complex evolutionary history is well known [116], [117], several enzymes involved in amino acid biosynthesis and ATP synthase subunits (Fig. 1b, Table 1). This extended, or nearly universal, set corresponds very closely in content to the 102 nearly universal trees, or “nuts”, reported by Puigbò et al. [84]), in that sense we could independently reproduce (109 genes) their nearly universal tree set (102 genes). The core and the extended core thus indicate the (obvious) presence of ribosomes in Luca [33], [74], an ion-gradient-dependent energy harvesting machinery [59], and the presence of some amino acid biosynthesis.
Table 1

Functional category of the nearly universal protein families and the 1045 families that retain the archaeal bacterial division.

Nearly universal mono.Nearly universal1045 mono. familiesAnaerobicAerobicMixed
Cellular processes and signaling
 Cell cycle control, division, chromos part.019009
 Cell motility009009
 Cell wall/membrane/envelope biogenesis05500149
 Defense mechanisms01494144
 Extracellular structures001001
 Intra traff., secretion, and vesicular transport22140212
 Mobilome: prophages, transposons00211020
 Nuclear structure000000
 Post-transl. mod., protein turnover, chaperones13331428
 Signal transduction mechanisms00250223
Information, storage and processing
 Chromatin structure and dynamics000000
 Replication, recombination and repair03301227
 RNA processing and modification000000
 Transcription00462539
 Translation, ribosomal struct. and biogenesis2334916085
Metabolism
 Amino acid transport and metabolism022683560
 Carbohydrate transport and metabolism05754368
 Coenzyme transport and metabolism17533149
 Energy production and conversion0591101170
 Inorganic ion transport and metabolism04562648
 Lipid transport and metabolism01410734
 Nucleotide transport and metabolism013151212
 Synth., transp. cat. metabolites01230716
Poorly characterized
 Function unknown00685459
 General function prediction only0211771298
 Not found006012444
Total2710910456279904
However, neither the core nor the expanded core (or nearly universal set) deliver information regarding the type of carbon and energy metabolism of primordial cells, because microbial metabolism has diversified within and across lineages during 3.5 billion years of microbial evolution. But the size of both the core (~ 30 genes) and the extended core (~ 100 genes) indicates that the clusters that we are using [77] deliver universal and nearly universal gene family distributions that correspond very well with what others have found independently using smaller genome samples and different methods [39], [84]. From this point on, we will focus on the remaining non-universal protein families that might have been present in Luca.

Distinction between recent and ancient protein families

In a previous study [77] we identified 4397 protein families that retained the monophyly of archaea and bacteria in maximum likelihood trees (Fig. 2a). However 3347 of those correspond to protein families in which either i) several bacteria and only one archaeal lineage or ii) several archaea and only one bacterial lineage group is represented. We further include in this group, for thoroughness, five protein families in which the archaeal representatives belong to Desulfurococcales and Fervidicoccus fontis Kam984 (here it is grouped with Desulfurococcales). The narrow phylogenetic distributions of these 3352 protein families (present in only one group in one of the domains) indicate that they correspond to interdomain LGT events that occurred after the prokaryotic domains had already diversified into major lineages. As such, they contain information about LGT frequencies, which is not our focus here, but do not contain direct clues about early metabolism and were excluded from our present analysis.
Fig. 2

Inter-domain gene sharing and aerobic profile network of families that retain the archaeal bacteria division. Number of genes shared between archaeal and bacterial organisms A) within the monophyletic 4397 protein families and B) within the remaining 1045 monophyletic protein families after removal of obvious interdomain LGTs. Each cell in the matrix indicates the number of genes (E-value ≤ 10− 10 and ≥ 25% global identity) shared between the protein families of 134 archaeal and 1847 bacterial genomes whose tree retain the archaea–bacteria division (scale bar at right). C) Inter-domain gene sharing network in terms of the aerobic classification of the organism pairs. Panel A is adapted from [77].

When we exclude those interdomain LGTs, following the suggestion of Kannan et al. [46], what remains is a set of 1045 protein families, containing sequences from at least two archaeal groups and at least two bacterial groups are present in the families and where the domains are monophyletic in phylogenetic trees. Fig. 2b shows which archaeal lineages and which bacterial lineages harbor these Luca genome candidates. Their patterns of gene sharing are not randomly distributed among either archaea or bacteria, rather they are preferentially distributed in pairs of lineages (taxa) from each domain. These taxon pairs are boxed and labeled with numbers in Fig. 2b: 1) clostridial and methanogenic lineages, 2) actinobacterial and Sulfolobales lineages, and 3) deltaproteobacterial and methanogen lineages. The boxed taxon pairs identify archaeal and bacterial lineages that share Luca candidate genes, that is, genes that are i) present in archaea and bacteria, but ii) not present in all bacteria and archaea (which we expect for Luca's genes, because Luca's habitat was different from today's), iii) where the domains do not interleave in the 1045 maximum likelihood trees, and iv) where the gene family is present in more than one archaeal lineage and more than one bacterial lineage (that is, lineage specific interdomain LGTs have been filtered out). Within the Luca candidate genes that identify the lineage pairs 1–3, the COG categories “energy production and conversion” and “carbohydrate metabolism” are among the most prominently represented (Table 1).

Ancient means anaerobic

However, even for these Luca candidate gene protein families, it is still possible that domain monophyly stems from lineage specific interdomain LGT and subsequent within domain transfer. This mechanism of distribution could apply both to ancient genes present in Luca and to later lineage specific inventions and/or later environment specific genes, for example oxygenic environments. Here we seek to identify ancient proteins. Because Luca arose in anaerobic environments [26], [67], proteins that arose in, or are typical for, oxygenated environments cannot be ancient, hence we would like to exclude them from the Luca candidate gene set. If we understand the literature of geochemists correctly, nobody can say for sure at the moment when the first oxygen arose [62], but we can be reasonably sure that it was present in the atmosphere roughly 2.5 billion years ago [42], [99] and accumulated in the oceans roughly 600 million years ago [62], [109]. A few might disagree [79] and argue for the presence of O2 since the early Archaean. Yet despite some uncertainty about when oxygen arose, we can be relatively sure that Luca arose in a world without appreciable amounts of oxygen [56], [67], because oxygen is a product of cyanobacterial photosynthesis involving two photosystems, which is a highly derived form of microbial physiology, having arisen after anoxygenic photosynthesis, cytochrome dependent respirations, fermentations and autotrophy [26], [67], [96]. Thus, in the search for a list of bona fide Luca candidate genes, the next pruning step is to look for the protein families shared only by anaerobic organisms, meaning that we filter aerobes and proteins typical of aerobes from the data. For this, we have to ascertain the oxygen tolerance or oxygen requirements of the 1981 organisms within our dataset. How to do this in the absence of specific growth information for each genome, and taking facultative aerobes into account? Microbial physiology can help. To reduce O2 to water, prokaryotes use two evolutionary unrelated membrane complexes, the bd oxygen reductase and the hemecopper oxygen reductases (HCOs, also known as the complex IV or cytochrome c oxidase). While the bd oxygen reductase is generally associated with oxygen detoxification or very low oxygen environments [11], the HCOs are incorporated in prokaryotic electron transfer chains that are much more diverse than the canonical mitochondrial one, but, as in the case of eukaryotes, that also promote the establishment of an electrochemical cation gradient across the membrane to feed the universal ATP synthase [82], [29]. Since organisms that express bd oxygen reductases usually also possess hemecopper oxygen reductases [11], one way to assess the oxygen requirements of the organisms present in our dataset is simply to look for the presence of HCOs in their genome. This is not trivial, though, because HCOs are related (structurally and sequence-wise) with nitric oxide reductases (NORs), with which they share the same general structural core of subunit I, the presence of a low-spin heme and a similar catalytic center composed of a high-spin heme and a metal ion — copper in the case of HCOs and iron in the case of NORs [18], [25], [105], [29]. However, instead of reducing oxygen to water, NORs perform the two-electron reduction of NO to N2O and are not related with aerobic respiration. Thus, if we can effectively distinguish between HCOs (aerobic) and NORs (anaerobic) we can distinguish at the genome level between organisms that regularly deal with O2 (aerobes, facultative aerobes, having HCOs) and those that shun it (anaerobes, lacking HCO while possessing NOR, or lacking both). For this we used tools developed elsewhere [105] and adapted to this specific problem (see Methods). In the present genome sample, ~ 67% (1332 out of 1981) of the genomes contain one or more HCOs, revealing the adaptation to oxic habitats, while ~ 33% (649 out of 1981) of the genomes are devoid of HCOs (Table 2). Our method sorted 18 organisms (genomes) that only contain NORs into the anaerobic category, which is important, because of the possible existence of NO dependent chemistry in early earth [12] and the presence of NORs at the onset of bioenergetic processes as argued by some [78], [29]. NORs are divided into two main groups, according to the nature of their electron donor. Thus, cNORs represent the enzymes that use soluble electron donors such as cytochrome c, HiPIPs or cupredoxins and qNORs represent the enzymes which oxidize quinols [29]. qNORs have representatives in the two prokaryotic domains although their presence in Archaea can be both attributed exclusively to two interdomain LGT events, one to Crenarchaeota and one to halobacteria [14], [38] or vertical inherence, multiple losses (except in some Crenarchaeota organisms), and an additional LGT of bacterial qNORs to halobacteria [29]. Regarding cNORs, only one sequence has been identified so far within Archaea [29] and the phylogenies of this subfamily, as the authors recognize, are prone to change over time.
Table 2

Taxonomic distribution of HCOs within 1981 genomes.

GroupsAerobicAnaerobicN. genomes
Archaea
 Others
  Korarchaeota011
  Thaumarchaeota202
  Nanoarchaeota011
224
 Crenarchaeota
  Thermoproteales6713
  Desulfurococcalesa11314
  Sulfolobales16016
232043
 Euryarchaeota
  Thermococcales01414
  Thermoplasmatales134
  Archaeoglobales044
  Methanobacteriales088
  Methanococcales01515
  Methanomicrobiales066
  Methanocellales033
  Methanosarcinales01010
  Halobacteriales23023
246387
Bacteria
 Clostridia4105109
 Bacilli137151288
 Negativicutes066
 Tenericutes04747
 Planctomycetes606
 Chlamydiae43438
 Spirochaetes73946
 Bacteroidetes552277
 Actinobacteria16641207
 Chlorobi5611
 Fusobacteria055
 Thermotogae01515
 Aquificae8210
 Chloroflexi10616
 Deinococcus–Thermus17017
 Cyanobacteria44044
 Acidobacteria808
 Deltaproteobacteria311748
 Epsilonproteobacteria72274
 Alphaproteobacteria2044208
 Betaproteobacteria1203123
 Gammaproteobacteria37441415
 Other bacteria111829
12835641847
Total13326491981

Fervidicoccus fontis Kam984 (Fervidicoccales order) is grouped within Desulfurococcales order.

By the HCO criterion, aerobes are present in three archaeal phyla in our sample (Thaumarchaeota, Crenarchaeota, and Euryarchaeota). HCOs are widely distributed among crenarchaeotes, being present in all 16 members of the Sulfolobales sampled, among Thermoproteales (6 out of 13 Pyrobaculum species) and one member of the Desulfurococcales. By contrast, among the euryarchaeotic genomes sampled, only Halobacteriales and one member of the Thermoplasmatales (Picrophilus torridus DSM9790) contain genes coding for HCO (Table 2). The existence of functional HCOs in halobacteria is well documented [121], [122] as is the identification of large influx of gene transfers from bacteria to the halophiles [89] and to the halophile common ancestor [76], [118], that transformed an ancient methanogen into an oxygen-respiring heterotroph. Interestingly, a strictly anaerobic, acetate-oxidizing S0-reducing haloarchaeon has been sequenced [104], showing that aerobic respiration is not anymore a universal feature of the haloarchaea and underscoring ongoing metabolic diversification and gene loss within haloarchaea. Within bacteria, HCOs are present in genomes from 19 out of the 23 bacterial groups, although with different densities of distribution. While aerobic (and facultative aerobic) organisms dominate the proteobacterial, cyanobacterial and actinobacterial lineages, the majority of the bacterial genomes sampled belonging to the fusobacteria, thermotogae, aquificae, negativicutes, tenericutes, chlamydiae and spirochaetes groups do not contain detectable HCOs. Having a genomic proxy for oxygen tolerance among organisms (genome lineages) within the present sample allows us to classify the domain pairs as anaerobes, oxygen tolerant, or mixed (Fig. 2c, Table 2). Of the three most frequent domain pairs, pair 1 contains methanogen (anaerobes) or anaerobic lineages derived from methanogens together with clostridia (anaerobes), pair 2 contains Sulfolobales (aerotolerant) and Actinobacteria (aerotolerant), while pair 3 contains deltaproteobacterial (mixed aerotolerant and anaerobes) and methanogen (anaerobes) or anaerobic lineages derived from methanogens. The pairs also allow us to separate the protein families into aerobic (recent), anaerobic (ancient) or mixed protein families.

Who is new (aerotolerant), who is old (anaerobic)?

HCOs allow us to sort genomes into categories of aerotolerant (having HCOs) or not (lacking HCOs). But the exercise here is to classify protein families as being typical for aerobes or anaerobes. In a simple scheme, we could just say that proteins encoded in genomes labeled as anaerobic should correspond to anaerobic protein families and those families having members that occur in aerotolerant genomes (having HCOs) should be excluded from the LucaGC set. But that criterion is too strict and will falsely exclude many anaerobic protein families, because i) they might be present in facultative anaerobes, and ii) they might have been subject to recent LGTs into aerotolerant species but have not yet been lost. This presents a difficult problem that readily lends itself to years of parameter space exploration, recurrent adjustments, and getting bogged down in endless minutiae of calculations. We took a pragmatic, albeit in some aspects somewhat arbitrary, approach. For a protein family to qualify for designation as an anaerobic family, we chose as our first (arbitrary) criterion that least 90% of all genes present in the family belong to organisms devoid of HCOs. To ensure a similar representation of archaea and bacteria in terms of anaerobic organisms, we included an additional filtering to consider only the protein families where at least 85% of the archaeal organisms represented are anaerobic and at least 85% of the bacterial organisms are anaerobic. This has the effect of excluding protein families whose high anaerobic score is due to an overrepresentation of anaerobic organism from one domain versus the other. Conversely, the criteria of at least 85% of archaeal organisms and 85% of bacterial organisms represented containing HCO were used to identify protein families as aerotolerant. What about the other protein families beyond these thresholds? Could they also be Luca candidate genes? Some might. We recognize that we might be excluding both “aerotolerant” and “anaerobic” protein families from these groups, this initial approach to filtering LGTs can probably be improved upon in future applications. Using these criteria, 79 protein families were classified as “aerotolerant”. The majority of the taxonomic distribution of the Luca candidate genes that identify aerotolerant taxon pairs identify Sulfolobales, Halobacteriales, Actinobacteria and Proteobacteria (Fig. 3b). In terms of functional annotation, these Luca candidate genes consist of mainly cytochrome and copper-containing related proteins, dioxygenases, and some NADH- and FAD-dependent oxidoreductases (Fig 3d). That is, the Luca candidate genes that link these aerobe-containing interdomain taxon pairs are inventions that arose after the cyanobacterial innovation of oxygenic photosynthesis [2]. Their distributions are not likely the result of ancient vertical inheritance, as Luca arose long before oxygen did.
Fig. 3

Wordle representation of the most frequent taxonomic and functional descriptions within the 79 aerobic and the 62 anaerobic protein families. The size of the words relates to the number of times the word appears within the annotations. The larger the word, the higher its frequency.

Perhaps as a slight disappointment, but also not entirely as a surprise, 904 of the 1045 families were classified as “mixed” with respect to occurrence within aerotolerant lineages. This has to do with LGTs and facultative anaerobes and these families deserve further inspection in future studies. What, on the bottom line, did this investigation uncover? We found two things, which we will summarize in the final two sections. 1) Perhaps more important than its role as a terminal acceptor, oxygen is an outstanding oxidant that microbes learned to use widely in many pathways. 2) At the anaerobic core of genes that reflect vertical inheritance from the prokaryotic common ancestor, we find evidence for antiporter-dependent ion gradient conversion, ATP synthase ion-gradient harnessing, FeS-cluster dependent soluble electron transport, and methyl-group dependent metabolism.

What did O2 do for metabolism?

The transition from an anoxic world to the emergence of oxic environments promoted overall modifications in the environment redox potential [26], [48], changing the metal availability and leading to the invention of new proteins and folds [49], [43] that allowed the expansion of the existing metabolic pathways to include oxygen-dependent reactions [19], [43], [88], [99]. Moreover, the irreversibility of oxygen consuming reactions promoted a positive selection pressure for the transfer and maintenance of oxygen related metabolism throughout prokaryotic organisms. Modern genomes harbor many O2-dependent reactions distributed across 11 metabolic pathway categories (Fig. 4).
Fig. 4

Distribution of oxygen dependent reaction within prokaryotic genomes grouped by the 11 Kegg metabolic pathway maps.

In addition to its availability as a terminal acceptor for pre-existing (anaerobic) respiratory chains, O2 allowed several energetically demanding reactions to occur more readily, for example the oxidation and cleavage of aromatic compounds by various dioxygenases [35]. Many existing biosynthetic pathways were also affected, as for example, the O2 independent (more ancient) and alternative O2-dependent (derived) biosynthetic pathways for heme [13], [34], [40], cobalamine [91], [92], and chlorophyll [80] to name a few. About half of the O2-dependent enzymes that we identified in the current sample of 1821 genomes are involved in the degradation of isoprenoids and xenobiotics. Without question, O2 expanded the realm of anabolic and catabolic pathways across genomes [88], but it did not alter the nature of the basic building blocks of life, nor did it fundamentally alter their biosynthesis. In the main, oxygen facilitated the oxidation of the building blocks of life, opening up new pathways of heterotrophic growth. A similar metabolic transition occurred at the origin of heterotrophy, as the first heterotrophs arose from autotrophs by learning to glean energy from the anaerobic oxidation of amino acids and bases at low H2 partial pressures [96]. Earth is now different from when life arose, the main difference for microbes being that today there are oxic habitats [42], [99]. Some have argued that O2 or high potential acceptors like O2 (for example NO), must have existed at life's origin, the argument being that the free energy changes associated with H2-dependent CO2 reduction were not sufficient to get life started [94]. O2 is indeed a strong terminal acceptor for electron transfer phosphorylation (hence valuable once life had already evolved respiratory chains). But is it too often overlooked that the use of oxygen comes at a huge cost. How so? As it relates to organic compounds like amino acids and nucleic acid bases – the substance of life – it is a curious but significant observation that life with oxygen is energetically far more expensive than life without oxygen [5]. The energetic costs to synthesize the cellular building blocks under oxic conditions are on average 12.9 times higher than under anoxic conditions [60]. This is because the reactions that generate the basic building blocks of cells (amino acids in the main) are thermodynamically favorable in anoxic environments but thermodynamically unfavorable in oxidizing environments [60], [70]. The reason is that modest H2 partial pressures (≤ 10− 4–10− 3 atm, [111]) favor the reduction of CO2 to organic compounds (which is why acetogens and methanogens can grow) while low H2 partial pressures favor the slow oxidation of amino acids and bases back to ammonium and CO2 [96], whereas O2 allows fast and highly exergonic oxidations of even unfermentable organics, which is why wood, coal and oil burn well, releasing heat in the process.

The ancient metabolic core: small, but strictly anaerobic

Only 62 protein families that trace to the prokaryote common ancestor fulfilled the criteria set here to be designated as “anaerobic” families. Even though 62 is not a genome-sized number, it is still greater than ~ 30, the usual list of suspect genes used for the reconstruction of deep phylogeny, and importantly, our list of 62 does not include either the core (~ 30) or the extended core (~ 100). The 27 universal and the 102 nearly-universal protein families were filtered out from the ancient anaerobic core due to their presence in both aerobic and anaerobic organisms. At this point we have to make a distinction between prokaryotic interdomain monophyletic families (that is, vertical inherence from Luca) and universal proteins, whose wide distribution with or without interdomain LGT events (e.g. ATP synthases), were present in Luca. The 62 anaerobic protein families identified above are the only remaining markers for ancient metabolism we can still retrieve from extant genomic data using this procedure. Of course, we cannot exclude the existence of other proteins in Luca, such as the ones widely shared by aerobic and anaerobic organisms, or some whose evolutionary history involved interdomain LGT events. But these are the ones we identify with this procedure. Yet these are new insights into Luca's gene complement. Furthermore, the functional classification of these 62 ancient anaerobic proteins goes beyond the traditionally identified informational genes, covering a number of functional categories not previously known about Luca (Table 1). The Luca candidate genes shared by anaerobes are the genes that we sought to identify at the outset of this study. The protein families that i) still retain the ancient archaeal–bacterial division in phylogenies, and ii) that are preferentially encoded in the genomes of anaerobes, identify mainly methanogens and clostridia (Fig. 3a). Here one might ask “identify them as what?” Methanogens (Methanomicrobia and Methanococci) and clostridia (Clostridiales) are the most frequent pairs within the 62 protein families that preserve the archaeal–bacterial split, hence were not obviously distributed via LGT, and that only rarely (if ever) occur in genomes that harbor HCOs. These protein families are typical of anaerobes, whereby Luca had to be an anaerobe. Put another way, if we acknowledge that Luca was an anaerobe, there are only 62 protein families that we can trace as probable vertical inheritances from Luca (as opposed to LGTs) that are typical of strict anaerobes in that they rarely, if ever, occur in genomes of aerotolerant (facultative anaerobic) prokaryotes, and these 62 proteins are mainly shared by clostridia and methanogens. One might object that this observation results from the high number of clostridial and methanogenic organisms devoid of HCOs within our dataset. However, both archaea and bacteria contain other taxonomic groups without HCOs, but that have far fewer Luca candidate genes than clostridia and methanogens. The functional annotations of the anaerobic Luca candidate genes reveal numerous methyl transferases, subunits of the acetyl-CoA synthase complex, the soluble heterodisulfide reductase subunits C and A, several SAM-dependent methyltransferases and ferredoxin (Supplemental Table A2), in addition to several subunits of the H+/Na+-antiporter MRP/hydrogenases and related complexes within this group (Fig. 3c). The Mrp antiporter is surprising and particularly interesting. Based purely on comparative physiology and theoretical considerations, it was recently proposed that a crucial step in bioenergetic evolution at an early stage prior to the emergence of free-living cells, was the advent of an Mrp-type H+/Na+ antiporter that could convert geochemical pH gradients into biologically more useful Na+ gradients [59]. This antiporter, the kind common to Ech and FeNi hydrogenases [63], was suggested to have been the first step en route to replacing geochemical ion gradients with biologically derived ones, to generate Na+ gradients that could be more readily harnessed by Luca's rotor–stator ATPase, yet prior to the invention of redox-chemistry-based (electron-transport-based) ion pumping systems [59]. Later work on simulated gradients provided support for that view [103]. Here, in a completely independent genome based approach, the Mrp-type antiporter suddenly turns up among 62 genes in the ancient anaerobic core. This is surprising, but was also predicted on the basis of bioenergetics by the theory that life arose at an alkaline hydrothermal vent. The present data also indicate the absence of redox-based ion-pumping systems in Luca (Table 3), as previously suggested [66], [59], [106], [107]. The 27 universal and the 102 nearly-universal protein families were filtered out from the ancient anaerobic core because they are present in both aerobic and anaerobic organisms.
Table 3

Gene annotations of the 62 anaerobic core families.

ClusterAnnotationComments
6079radical_SAM_proteinSAM methyltransferase
9248radical_SAM_proteinPyruvate formate lyase activating enzyme EC:1.97.1.4; SAM methyltransferase
21,920Radical_SAM_domain_proteinrlmN; 23S rRNA methyltransferase [EC:2.1.1.192]; SAM
10,466type_11_methyltransferaseSAM methyltransferase
21,520type_11_methyltransferaseSAM methyltransferase
14,283Methylase_involved_in…SAM methyltransferase
166250S_ribosomal_protein_L29RP-L29, rpmC; large subunit ribosomal protein L29
1296acetyl-CoA_decarbonylase/synthase…Acetyl-CoA decarbonylase/synthase complex subunit gamma [EC:2.1.1.245]
1321acetyl-CoA_decarbonylase/synthase…Acetyl-CoA decarbonylase/synthase complex subunit delta [EC:2.1.1.245]
7851ATP_synthase_subunit_cATPVK, ntpK, atpK; V/A-type H +-transporting ATPase subunit K
4404ferredoxinfer; ferredoxin
18,705flavodoxinFlavodoxin
13,779heterodisulfide_reductase_subunit_A…Heterodisulfide-reductase CoB CoM
14,009heterodisulfide_reductase_subunit_C…Heterodisulfide reductase; CoM CoS
4789membrane_bound_hydrogenase_subunit…Multicomponent Na +:H + antiporter
11,625Membrane-bound_hydrogenase_MBH…Membrane-bound-hydrogenase antiporter subunit
11,548L-glutamine_synthetaseglnA, GLUL; glutamine synthetase [EC:6.3.1.2]; glutamine-synthetase
10,709nitrogenase_iron-iron_accessory_protein…Iron–iron nitrogenase accessory protein AnfO
5364acetyltransferaseCoA acyl-CoA acetyltransferase
10,044phenylacetate–CoA_ligasepaaK; phenylacetate-CoA ligase [EC:6.2.1.30]; CoA phenylacetate-CoA-ligase
23,640NADPH-dependent_FMN_reductaseFMN FAD NADH NADH-dependent-FMN-reductase
10,494NADPH-dependent_FMN_reductaseNADH-dependent-FMN-reductase
4774sugar_kinaseSugar-kinase
11,273FeoA_family_proteinfeoA ferrous iron transport protein A
11,762aldo/keto_reductaseK07079; NADH aldo/keto-reductase
12,136putative_ABC_transporterPutative ABC transport system permease protein
6304citrate_transporterCitrate_transporter
16,010cobalamin_biosynthesis_proteincbiN; cobalt/nickel transport protein; cobalamin
14,243cytochrome_c_biogenesis…Cytochrome c-biogenesis transmembrane protein
1071ApbE_family_lipoproteinK09740; hypothetical protein; ApbE_family_lipoprotein
6375beta-lactamase_domain-containing_proteinBeta-lactamase metal dependent
4311deblocking_aminopeptidaseE3.2.1.4; endoglucanase metal metallopeptidase
22,518YcfA-like_proteinhicA; mRNA interferase HicA [EC:3.1.-.-]; YcfA-like-protein
15,732nucleotidyltransferaseK07076; nucleotidyltransferase
4155PP-loop_domain-containing…ttcA; tRNA 2-thiocytidine biosynthesis protein TtcA; PP-loop
9988regulatory_protein_MarRRegulatory-protein-MarR transcriptional-regulator
23,018type_II_site-specific_deoxyribonucleaseType-II-restriction-enzyme type-II-site-specific-deoxyribonuclease
16,528transposase,_IS4Transposase
10,489ATP-dependent_RNA_helicaseCRISPR-associated-endonuclease/helicase Cas3; ATP-dependent
15,539CRISPR-associated_protein…CRISPR-associated protein Cmr3
14,799CRISPR-associated_protein…CRISPR-associated protein Csm1
18,976helicase_domain_proteinHelicase
20,220helicase-like_proteinHelicase
2237xylose_isomerase_domain…Metal-dependent; xylose-isomerase-domain-containing-protein
20,694xylose_isomerase_domain…Metal xylose-isomerase-domain-containing-protein
7805putative_YgiT-type_zinc_fingerZinc zinc-finger domain

16 other protein families annotated as hypothetical proteins.

If Luca could synthesize ATP with the help of proteins that were able to harness and transduce a geochemical pH gradient, where did the biochemical energy come from that allowed Luca to perform protein synthesis in the absence of chemiosmotic harnessing? It remains true that there are only two ways in which cells conserve energy in the form of ATP: chemiosmotic coupling and substrate level phosphorylation (SLP) [98]. Before the origin of chemiosmotic coupling, SLP would thus have been the only option. In essence, there have been only two viable suggestions for how the first cells might have harnessed energy via SLP: a geochemical supply of carbon monoxide [32] or a geochemical supply of methyl groups [66], [68] — on the early Earth, oxidations of organics from space will not support energy metabolism [96]. Both proposals focus on the reactions catalyzed by bifunctional acetyl-CoA synthase/carbon monoxide dehydrogenase (CODH/ACS), which condenses a Ni-bound methyl group with Ni-bound carbon monoxide (carbonyl) to generate a Ni-bound acetyl group that is removed from the enzyme via thiolysis to generate a thioester, which is cleaved by phosphate to yield acetyl phosphate, which phosphorylates ADP via SLP [85], [106]. Consistent with both proposals [32], [66] CODH/ACS is contained within the anaerobic core (Table 3). Additional evidence for the existence of abiotic synthesis within alkaline vents systems comes from laboratory experiments showing that the direct reduction of CO2 by H2 in the presence of pH gradients or electric potential can lead to the formation of formaldehyde, acetate and pyruvate [41], [93], [102]. Though an excellent source of energy and electrons, carbon monoxide has a very restricted role in biochemistry outside of energy metabolism [86]. Studies of abiotic synthesis under the far-from-equilibrium electrochemical vent conditions will help to clarify the presence or absence of CO under these conditions [41]. Methyl groups, on the other hand, have a very central and general role in microbial metabolism, in particular in S-adenosyl methionine (SAM) and in radical SAM enzymes, which have a myriad of functions in biosyntheses, in particular in cofactor biosyntheses [101], [119]. Of the 62 proteins in the ancient core, six (~ 10%) are SAM-dependent enzymes, making these the most common class of proteins in the ancient core behind “unknown function”. Clearly, the ancient core points to a central involvement of methyl groups in ancient metabolism. This is very much in line with the predications from the theory that life arose at hydrothermal vents: “…the biochemical system proposed [] would remain strictly dependent upon geochemically provided methyl groups up until the advent of (protein dependent) chemiosmotic harnessing” ([66], p. 1912). Recent geochemical studies have uncovered evidence for disequilibrium of one carbon species in the Von Damm hydrothermal field, indicating that there are distinct kinetic barriers to methane synthesis in some hydrothermal systems [72], such that a model for the origin of life that requires a geochemical source of chemically accessible methyl moieties is not asking for too much. That methyl groups have such a prominent place in the ancient anaerobic core is interesting and possibly significant. If vent conditions allowed methyl synthesis, then the synthesis of thioesters and acetyl phosphate would also, in principle, be possible [41], [93], [102]. Other proteins in the ancient anaerobic core, besides Mrp subunits, ATP synthase subunits, CODH/ACS subunits and SAM dependent enzymes include heterodisulfide reductase, an electron bifurcating enzyme essential to energy metabolism in methanogens [47], ferredoxin, flavodoxin, an iron (II) transport protein, and a thiocytidine biosynthesis protein involved tRNA modification. Four proteins in the list are membrane transport proteins, which would indicate that Luca existed in an environment where hydrophobic layers corresponding to the thickness of a lipid bilayer existed. That makes sense because neither the Mrp antiporter nor the rotor stator ATP synthase can function without a membrane, though the membrane need not be genetically encoded [16], as the synthesis of hydrophobic compounds at vents is expected from thermodynamics [3] and is observed in some modern hydrothermal systems [83], [97]. Glutamine synthase, the enzyme that introduces nitrogen into metabolism, is in the list, as are CoA-ligases and NADPH dependent flavin reductases, transducers of one electron to two electron transport. The latter, together with ferredoxin, flavodoxin, electron bifurcating enzymes and radical SAM enzymes point to a major role of one electron transport in Luca's metabolism. But where would the reduced nitrogen come from? For life to get started, reduced nitrogen had to be (at least locally) available, and both transition metal catalysts [28] and hydrothermal vent conditions [12] could have been a local source of ammonium. Moreover, the presence of a nitrogenase accessory protein in the 62 anaerobic families list can be an indication for the existence of an ancient nitrogenase in Luca, as previously proposed [69]. Thus, if we i) remove interdomain LGTs as Kannan et al. [46] suggested, ii) identify lineages that have the most Luca candidate genes, and iii) distinguish between anaerobic Luca candidate genes in anaerobic lineages (might be ancient) and aerotolerant Luca candidate genes in aerotolerant lineages (cannot be ancient) we are left with the result in Fig. 3a and c, namely that Luca looks most similar to modern bacterial and archaeal lineages that harbor anaerobic chemolithoautotrophs: clostridial-type acetogens and methanogens. This is noteworthy in the respect that acetogens and methanogens are the groups of organisms that stand in the foreground of theories for the origin of life that are based in microbial physiology and that connect well to geochemistry [66], [8], [59], [106]. The idea that life started at hydrothermal vents has been around for 30 years [7], it continued to be further developed. In our inference, we have ended up with something that is half-alive, a curious but necessary intermediate in the transition from non-living to living things.

Conclusions

The overall picture of core physiology in Luca that we infer from genome sequences is almost indistinguishable from that in Fig. 1c of Lane and Martin [59] that was obtained from comparative physiology, bioenergetics, and theory. That two completely independent approaches converge on the same set of proteins and functions in early bioenergetics is noteworthy. It indicates that a version of the hydrothermal vent theory focusing on the acetyl-CoA pathway, methyl synthesis, acetogens and methanogens [66] possesses an element of robustness in that it interfaces well with the physiology of anaerobic prokaryotes [15], [98], with thermodynamics [3], [4], with findings from geochemistry about serpentinization and hydrothermal vents [71], [97], and as we have shown here, with comparative genomics. Even ribosomal phylogenies tend to agree with the predictions of the model in that newer metagenomic indicate a greater antiquity for methanogen-related metabolism within the archaea than previously assumed [30], and newer phylogenetic data put methanogenesis at the root of the archaeal domain [67], [87]. In contrast to today's oxidized and strongly oxidizing environment, the highly reducing hydrothermal setting on a young, metal-rich, anaerobic Earth that we have in mind as Luca's residence [106] offers very favorable thermodynamic conditions for the synthesis of Luca's building blocks [3], [5], [71], [97]. When we look at Luca as an anaerobe and as the common ancestor of prokaryotes, we obtain a picture of its genome that resembles clostridial acetogens and methanogens. With regard to the most primitive forms of microbial physiology, microbiologists reached the same conclusion 45 years ago [26], namely that methanogens and acetogens probably represent the most ancient lineages [36]. We required 2000 genomes and powerful computers for our conclusions, while Decker et al. just thought about it. Evidently, just thinking about things can be a source of scientific progress. The following are the supplementary data related to this article.

Supplementary Table A1

Oxygen requirements of the organisms present in the 1045 protein families.

Supplementary Table A2

Functional annotation.

Transparency document

Transparency document.
  115 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  A novel scenario for the evolution of haem-copper oxygen reductases.

Authors:  M M Pereira; M Santana; M Teixeira
Journal:  Biochim Biophys Acta       Date:  2001-06-01

3.  Sulphur isotope evidence for an oxic Archaean atmosphere.

Authors:  Hiroshi Ohmoto; Yumiko Watanabe; Hiroaki Ikemi; Simon R Poulson; Bruce E Taylor
Journal:  Nature       Date:  2006-08-24       Impact factor: 49.962

Review 4.  The energetics of organic synthesis inside and outside the cell.

Authors:  Jan P Amend; Douglas E LaRowe; Thomas M McCollom; Everett L Shock
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2013-06-10       Impact factor: 6.237

5.  Oxygen-dependent coproporphyrinogen III oxidase (HemF) from Escherichia coli is stimulated by manganese.

Authors:  Daniela Breckau; Esther Mahlitz; Anselm Sauerwald; Gunhild Layer; Dieter Jahn
Journal:  J Biol Chem       Date:  2003-09-15       Impact factor: 5.157

6.  Comparative genomics of the vitamin B12 metabolism and regulation in prokaryotes.

Authors:  Dmitry A Rodionov; Alexey G Vitreschak; Andrey A Mironov; Mikhail S Gelfand
Journal:  J Biol Chem       Date:  2003-07-17       Impact factor: 5.157

Review 7.  Comparison between the nitric oxide reductase family and its aerobic relatives, the cytochrome oxidases.

Authors:  S de Vries; I Schröder
Journal:  Biochem Soc Trans       Date:  2002-08       Impact factor: 5.407

8.  A bioenergetic basis for membrane divergence in archaea and bacteria.

Authors:  Víctor Sojo; Andrew Pomiankowski; Nick Lane
Journal:  PLoS Biol       Date:  2014-08-12       Impact factor: 8.029

9.  Sequencing of seven haloarchaeal genomes reveals patterns of genomic flux.

Authors:  Erin A Lynch; Morgan G I Langille; Aaron Darling; Elizabeth G Wilbanks; Caitlin Haltiner; Katie S Y Shao; Michael O Starr; Clotilde Teiling; Timothy T Harkins; Robert A Edwards; Jonathan A Eisen; Marc T Facciotti
Journal:  PLoS One       Date:  2012-07-24       Impact factor: 3.240

10.  Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes.

Authors:  Boris G Mirkin; Trevor I Fenner; Michael Y Galperin; Eugene V Koonin
Journal:  BMC Evol Biol       Date:  2003-01-06       Impact factor: 3.260

View more
  19 in total

1.  Functional Role of MrpA in the MrpABCDEFG Na+/H+ Antiporter Complex from the Archaeon Methanosarcina acetivorans.

Authors:  Ricardo Jasso-Chávez; César Diaz-Perez; José S Rodríguez-Zavala; James G Ferry
Journal:  J Bacteriol       Date:  2016-12-28       Impact factor: 3.490

2.  Integrative modeling of gene and genome evolution roots the archaeal tree of life.

Authors:  Tom A Williams; Gergely J Szöllősi; Anja Spang; Peter G Foster; Sarah E Heaps; Bastien Boussau; Thijs J G Ettema; T Martin Embley
Journal:  Proc Natl Acad Sci U S A       Date:  2017-05-22       Impact factor: 11.205

3.  On the evolution of chaperones and cochaperones and the expansion of proteomes across the Tree of Life.

Authors:  Mathieu E Rebeaud; Saurav Mallik; Pierre Goloubinoff; Dan S Tawfik
Journal:  Proc Natl Acad Sci U S A       Date:  2021-05-25       Impact factor: 11.205

4.  Nitroreductases of bacterial origin in Giardia lamblia: Potential role in detoxification of xenobiotics.

Authors:  Joachim Müller; Norbert Müller
Journal:  Microbiologyopen       Date:  2019-07-25       Impact factor: 3.139

Review 5.  Complement and human T cell metabolism: Location, location, location.

Authors:  Erin E West; Natalia Kunz; Claudia Kemper
Journal:  Immunol Rev       Date:  2020-03-12       Impact factor: 10.983

6.  A Ferredoxin- and F420H2-Dependent, Electron-Bifurcating, Heterodisulfide Reductase with Homologs in the Domains Bacteria and Archaea.

Authors:  Zhen Yan; Mingyu Wang; James G Ferry
Journal:  mBio       Date:  2017-02-07       Impact factor: 7.867

7.  Metabolic and evolutionary patterns in the extremely acidophilic archaeon Ferroplasma acidiphilum YT.

Authors:  Olga V Golyshina; Hai Tran; Oleg N Reva; Sofia Lemak; Alexander F Yakunin; Alexander Goesmann; Taras Y Nechitaylo; Violetta LaCono; Francesco Smedile; Alexei Slesarev; David Rojo; Coral Barbas; Manuel Ferrer; Michail M Yakimov; Peter N Golyshin
Journal:  Sci Rep       Date:  2017-06-16       Impact factor: 4.379

8.  Discovery of anaerobic lithoheterotrophic haloarchaea, ubiquitous in hypersaline habitats.

Authors:  Dimitry Y Sorokin; Enzo Messina; Francesco Smedile; Pawel Roman; Jaap S Sinninghe Damsté; Sergio Ciordia; Maria Carmen Mena; Manuel Ferrer; Peter N Golyshin; Ilya V Kublanov; Nazar I Samarov; Stepan V Toshchakov; Violetta La Cono; Michail M Yakimov
Journal:  ISME J       Date:  2017-01-20       Impact factor: 10.302

Review 9.  Keeping It All Going-Complement Meets Metabolism.

Authors:  Martin Kolev; Claudia Kemper
Journal:  Front Immunol       Date:  2017-01-18       Impact factor: 7.561

10.  One-carbon metabolism, folate, zinc and translation.

Authors:  Antoine Danchin; Agnieszka Sekowska; Conghui You
Journal:  Microb Biotechnol       Date:  2020-03-09       Impact factor: 5.813

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.