| Literature DB >> 31573151 |
Alfonso Esposito1, Sabrina Tamburini2, Luca Triboli1, Luca Ambrosino3, Maria Luisa Chiusano4, Olivier Jousson1.
Abstract
Acetogenic bacteria are obligate anaerobes with the ability of converting carbon dioxide and other one-carbon substrates into acetate through the Wood-Ljungdahl (WL) pathway. These substrates are becoming increasingly important feedstock in industrial microbiology. The main potential industrial application of acetogenic bacteria is the production of metabolites that constitute renewable energy sources (biofuel); such bacteria are of particular interest for this purpose thanks to their low energy requirements for large-scale cultivation. Here, we report new genome sequences for four species, three of them are reported for the first time, namely Acetobacterium paludosum DSM 8237, Acetobacterium tundrae DSM 917, Acetobacterium bakii DSM 8239, and Alkalibaculum bacchi DSM 221123. We performed a comparative genomic analysis focused on the WL pathway's genes and their encoded proteins, using Acetobacterium woodii as a reference genome. The Average Nucleotide Identity (ANI) values ranged from 70% to 95% over an alignment length of 5.4-6.5 Mbp. The core genome consisted of 363 genes, whereas the number of unique genes in a single genome ranged from 486 in A. tundrae to 2360 in A.bacchi. No significant rearrangements were detected in the gene order for the Wood-Ljungdahl pathway however, two species showed variations in genes involved in formate metabolism: A. paludosum harbor two copies of fhs1, and A. bakii a truncated fdhF1. The analysis of protein networks highlighted the expansion of protein orthologues in A. woodii compared to A. bacchi, whereas protein networks involved in the WL pathway were more conserved. This study has increased our understanding on the evolution of the WL pathway in acetogenic bacteria.Entities:
Keywords: Acetogens; Comparative genomics; NGS; Wood-Ljungdahl pathway
Mesh:
Substances:
Year: 2019 PMID: 31573151 PMCID: PMC6925170 DOI: 10.1002/mbo3.938
Source DB: PubMed Journal: Microbiologyopen ISSN: 2045-8827 Impact factor: 3.139
Figure 1Graphic depiction of the Wood–Ljungdahl pathway including the genes involved in each step of the pathway. Colors represent the gene clusters; THF: tetrahydrofolate; fdhF1 and 2: formate dehydrogenase 1 and 2; fhs1: formyl‐THF synthetase; fchA:methenyl‐THF cyclohydrolase, folD: methylene‐THF dehydrogenase; metVF: methylene‐THF reductase; rnfC2: rnfC‐like protein. Redrawn from Poehlein et al. (2012)
NGS data and genome assembly statistics
| # read pairs | # contigs | N50 | Tot. length | % GC | |
|---|---|---|---|---|---|
|
| 553976 | 49 | 186894 | 3,116,598 | 34.71 |
|
| 786768 | 43 | 285194 | 4,163,517 | 41.21 |
|
| 1158287 | 54 | 179628 | 3,691,131 | 40.04 |
|
| 757003 | 66 | 154452 | 3,563,081 | 39.64 |
Figure 2Hierarchically clustered heatmap of ANI calculated using blastn (left), and alignment length (right) between the five genomes
Genome annotation statistics, including number of CDS predicted by Prokka, antiSMASH gene clusters analysis and protein family annotation by eggNOG mapper (for A. woodii the analysis was done on the reference strain with acc.no. CP002987)
| Coding sequences (CDS) | Avg. # CDS per Kb | Avg. gene length | % genome containing CDS | #rRNA | #tRNA | # Protein Families | Secondary metabolites gene clusters found by antiSMASH | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
Bacteriocin/ Microcin | Terpene | NRPS | fatty acids | saccharide | others | ||||||||
|
| 3618 | 0.89 | 951.6 | 85.11 | 16 | 58 | 2698 | 1 | 0 | 2 | 1 | 4 | 9 |
|
| 2860 | 0.92 | 898.7 | 82.48 | 6 | 55 | 2205 | 1 | 0 | 1 | 1 | 4 | 5 |
|
| 3822 | 1.23 | 936.6 | 85.97 | 5 | 48 | 2740 | 2 | 1 | 0 | 1 | 4 | 8 |
|
| 3363 | 1.08 | 947.2 | 86.3 | 6 | 53 | 2487 | 2 | 0 | 0 | 1 | 3 | 9 |
|
| 3330 | 1.07 | 919.2 | 85.13 | 6 | 54 | 2411 | 3 | 0 | 1 | 1 | 3 | 10 |
Annotation of the genes in the core genome
| RefSeq name in | Cluster number | Gene name |
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Contig | Start | End | Length | Contig | Start | End | Length | |||
| WP_014355214.1 | 1 | fdhF1 | NC_016894.1 | 944951 | 947125 | 2174 | NODE_17_length_58697_cov_40.1842 | 55126 | 57810 | 2684 |
| WP_014355215.1 | 1 | hycB1 | NC_016894.1 | 947122 | 947655 | 533 | not found | |||
| WP_014355216.1 | 1 | fdhF2 | NC_016894.1 | 947921 | 950089 | 2168 | NODE_29_length_7652_cov_43.4377 | 4056 | 6758 | 2702 |
| WP_014355217.1 | 1 | hycB2 | NC_016894.1 | 950093 | 950623 | 530 | not found | |||
| WP_083837833.1 | 1 | fdhD | NC_016894.1 | 950758 | 951549 | 791 | NODE_17_length_58697_cov_40.1842 | 50333 | 51133 | 800 |
| WP_014355219.1 | 1 | hycB3 | NC_016894.1 | 951566 | 952126 | 560 | not found | |||
| WP_014355220.1 | 1 | hydA1 | NC_016894.1 | 952144 | 953523 | 1379 | not found | |||
| WP_014355320.1 | 2 | fhs1 | NC_016894.1 | 1080969 | 1082645 | 1676 | NODE_3_length_279548_cov_33.281 | 195911 | 197584 | 1673 |
| WP_014355321.1 | 2 | fchA | NC_016894.1 | 1082745 | 1083404 | 659 | NODE_3_length_279548_cov_33.281 | 197704 | 198330 | 626 |
| WP_014355322.1 | 2 | folD | NC_016894.1 | 1083442 | 1084347 | 905 | NODE_3_length_279548_cov_33.281 | 198346 | 199197 | 851 |
| WP_014355323.1 | 2 | rnfC2 | NC_016894.1 | 1084375 | 1086339 | 1964 | NODE_7_length_185859_cov_36.1889 | 108899 | 110863 | 1964 |
| WP_014355324.1 | 2 | metV | NC_016894.1 | 1086341 | 1086958 | 617 | NODE_7_length_185859_cov_36.1889 | 108265 | 108897 | 632 |
| WP_014355325.1 | 2 | metF | NC_016894.1 | 1086992 | 1087888 | 896 | NODE_7_length_185859_cov_36.1889 | 107312 | 108193 | 881 |
| WP_014355456.1 | 3 | cooC1 | NC_016894.1 | 1235110 | 1235895 | 785 | NODE_3_length_279548_cov_33.281 | 182407 | 183177 | 770 |
| WP_014355457.1 | 3 | acsV | NC_016894.1 | 1235961 | 1237886 | 1925 | NODE_3_length_279548_cov_33.281 | 187232 | 188480 | 1248 |
| WP_014355458.1 | 3 | orf1 | NC_016894.1 | 1237902 | 1238549 | 647 | not found | |||
| WP_014355459.1 | 3 | orf2 | NC_016894.1 | 1238546 | 1239205 | 659 | not found | |||
| WP_014355460.1 | 3 | acsD | NC_016894.1 | 1239392 | 1240327 | 935 | NODE_3_length_279548_cov_33.281 | 183192 | 184139 | 947 |
| WP_014355461.1 | 3 | acsC | NC_016894.1 | 1240347 | 1241687 | 1340 | NODE_3_length_279548_cov_33.281 | 184168 | 185508 | 1340 |
| WP_014355462.1 | 3 | acsE | NC_016894.1 | 1241757 | 1242542 | 785 | NODE_3_length_279548_cov_33.281 | 185552 | 186337 | 785 |
| WP_014355463.1 | 3 | acsA | NC_016894.1 | 1242813 | 1244711 | 1898 | NODE_3_length_279548_cov_33.282 | 177291 | 179183 | 1892 |
| WP_014355464.1 | 3 | cooC2 | NC_016894.1 | 1244738 | 1245523 | 785 | NODE_3_length_279548_cov_33.282 | 179205 | 179794 | 589 |
| WP_041670690.1 | 3 | acsB1 | NC_016894.1 | 1245585 | 1247753 | 2168 | NODE_3_length_279548_cov_33.282 | 180358 | 182149 | 1791 |
Figure 3Venn diagram summarizing the number of shared and unique genes as inferred by Roary
Figure 4Organization of the three gene clusters in the four Acetobacterium genomes. Orthologues are connected with purple shades
Genomic coordinates of the WL pathway genes in A. woodii in comparison with A. bacchi
| Gene name | Annotation |
|---|---|
| ackA | Acetate kinase |
| acoA | "Acetoin:2,6‐dichlorophenolindophenol oxidoreductase subunit alpha" |
| acsC | Corrinoid/iron‐sulfur protein large subunit |
| acsE | 5‐methyltetrahydrofolate:corrinoid/iron‐sulfur protein co‐methyltransferase |
| alaA | Glutamate‐pyruvate aminotransferase AlaA |
| alaS | Alanine‐‐tRNA ligase |
| apbC | Iron‐sulfur cluster carrier protein |
| apeA | putative M18 family aminopeptidase 1 |
| arcB | "Ornithine carbamoyltransferase 2, catabolic" |
| argC | N‐acetyl‐gamma‐glutamyl‐phosphate reductase |
| argD | acetylornithine aminotransferase ArgD1 |
| argG | Argininosuccinate synthase |
| argH | Argininosuccinate lyase |
| argS | Arginine‐‐tRNA ligase |
| artM | Arginine transport ATP‐binding protein ArtM |
| asd2 | Aspartate‐semialdehyde dehydrogenase 2 |
| aspS | Aspartate‐‐tRNA ligase |
| asrA | Anaerobic sulfite reductase subunit A |
| asrB | Anaerobic sulfite reductase subunit B |
| asrC | Anaerobic sulfite reductase subunit C |
| atpA | ATP synthase subunit alpha |
| atpB | ATP synthase subunit a |
| atpD | "ATP synthase subunit beta, sodium ion specific" |
| azr | FMN‐dependent NADPH‐azoreductase |
| bfmB | Methoxymalonate biosynthesis protein |
| carE | Caffeyl‐CoA reductase‐Etf complex subunit CarE |
| cbiF | Cobalt‐precorrin‐4 C(11)‐methyltransferase |
| cbiH | putative cobalt‐factor III C(17)‐methyltransferase |
| cfiB | 2‐oxoglutarate carboxylase small subunit |
| cheY | Chemotaxis protein CheY |
| clpP | ATP‐dependent Clp protease proteolytic subunit |
| clpX | ATP‐dependent Clp protease ATP‐binding subunit ClpX |
| clpY | ATP‐dependent protease ATPase subunit ClpY |
| coaX | Type III pantothenate kinase |
| cooS1 | Carbon monoxide dehydrogenase 1 |
| crh | HPr‐like protein Crh |
| csd | putative cysteine desulfurase |
| cysK1 | O‐acetylserine sulfhydrylase |
| cysS | Cysteine‐‐tRNA ligase |
| dcd | dCTP deaminase |
| ddpD | putative D%2CD‐dipeptide transport ATP‐binding protein DdpD |
| der | GTPase Der |
| dmdA | 2%2C3‐dimethylmalate dehydratase large subunit |
| dnaA | Chromosomal replication initiator protein DnaA |
| dnaE | DNA polymerase III subunit alpha |
| drrA | Daunorubicin/doxorubicin resistance ATP‐binding protein DrrA |
| dtd | D‐aminoacyl‐tRNA deacylase |
| dut | Deoxyuridine 5'‐triphosphate nucleotidohydrolase |
| dxs | 1‐deoxy‐D‐xylulose‐5‐phosphate synthase |
| ecfA1 | Energy‐coupling factor transporter ATP‐binding protein EcfA1 |
| ecfA2 | Energy‐coupling factor transporter ATP‐binding protein EcfA2 |
| ecfT | Energy‐coupling factor transporter transmembrane protein EcfT |
| ecsA | ABC‐type transporter ATP‐binding protein EcsA |
| efp | Elongation factor P |
| eno | Enolase |
| era | GTPase Era |
| fba | Fructose‐bisphosphate aldolase |
| fbp | Fructose‐1%2C6‐bisphosphatase class 3 |
| fchA | Methenyltetrahydrofolate cyclohydrolase |
| ffh | Signal recognition particle protein |
| fom3 | 2‐hydroxyethylphosphonate methyltransferase |
| frr | Ribosome‐recycling factor |
| ftsH | ATP‐dependent zinc metalloprotease FtsH |
| ftsZ | Cell division protein FtsZ |
| fumA | Fumarate hydratase class I%2C aerobic |
| fusA | Elongation factor G |
| gap | Glyceraldehyde‐3‐phosphate dehydrogenase |
| gatA | Glutamyl‐tRNA(Gln) amidotransferase subunit A |
| gatB | Aspartyl/glutamyl‐tRNA(Asn/Gln) amidotransferase subunit B |
| gatC | Aspartyl/glutamyl‐tRNA(Asn/Gln) amidotransferase subunit C |
| glmM | Phosphoglucosamine mutase |
| glmS | Glutamine‐‐fructose‐6‐phosphate aminotransferase [isomerizing] |
| glnH | Glutamine‐binding periplasmic protein |
| glnS | Glutamine‐‐tRNA ligase |
| glpK | Glycerol kinase |
| gltB | Ferredoxin‐dependent glutamate synthase 1 |
| gltD | Glutamate synthase [NADPH] small chain |
| glyA | Serine hydroxymethyltransferase |
| glyQS | Glycine‐‐tRNA ligase |
| gmk | Guanylate kinase |
| gpmI | 2%2C3‐bisphosphoglycerate‐independent phosphoglycerate mutase |
| graR | Response regulator protein GraR |
| groS | 10 kDa chaperonin |
| gtaB | UTP‐‐glucose‐1‐phosphate uridylyltransferase |
| guaA | GMP synthase [glutamine‐hydrolyzing] |
| guaB | Inosine‐5'‐monophosphate dehydrogenase |
| gyrA | DNA gyrase subunit A |
| gyrB | DNA gyrase subunit B |
| hadI | 2‐hydroxyisocaproyl‐CoA dehydratase activator |
| hcp | Hydroxylamine reductase |
| hemL | Glutamate‐1‐semialdehyde 2%2C1‐aminomutase |
| hicd | Homoisocitrate dehydrogenase |
| hinT | Purine nucleoside phosphoramidase |
| hisD | Histidinol dehydrogenase |
| hisF | Imidazole glycerol phosphate synthase subunit HisF |
| hisG | ATP phosphoribosyltransferase |
| hisH | Imidazole glycerol phosphate synthase subunit HisH |
| hisI | Phosphoribosyl‐AMP cyclohydrolase |
| hrb | High molecular weight rubredoxin |
| hslR | Heat shock protein 15 |
| hslV | ATP‐dependent protease subunit HslV |
| htpG | Chaperone protein HtpG |
| hup | DNA‐binding protein HU |
| ileS | Isoleucine‐‐tRNA ligase |
| ilvB | Acetolactate synthase large subunit |
| ilvC | Ketol‐acid reductoisomerase (NADP(+)) |
| ilvD | Dihydroxy‐acid dehydratase |
| ilvH | Putative acetolactate synthase small subunit |
| ilvK | Branched‐chain‐amino‐acid aminotransferase 2 |
| infA | Translation initiation factor IF‐1 |
| infC | Translation initiation factor IF‐3 |
| iscS | Cysteine desulfurase IscS |
| iscU | Iron‐sulfur cluster assembly scaffold protein IscU |
| ispF | 2‐C‐methyl‐D‐erythritol 2%2C4‐cyclodiphosphate synthase |
| ispG | 4‐hydroxy‐3‐methylbut‐2‐en‐1‐yl diphosphate synthase (flavodoxin) |
| lepA | Elongation factor 4 |
| leuB | 3‐isopropylmalate dehydrogenase |
| leuD1 | 3‐isopropylmalate dehydratase small subunit 1 |
| leuS | Leucine‐‐tRNA ligase |
| livF | High‐affinity branched‐chain amino acid transport ATP‐binding protein LivF |
| livH | High‐affinity branched‐chain amino acid transport system permease protein LivH |
| lon1 | Lon protease 1 |
| lptB | Lipopolysaccharide export system ATP‐binding protein LptB |
| lysC | Aspartokinase |
| lysS | Lysine‐‐tRNA ligase |
| map | Methionine aminopeptidase 1 |
| metA | Homoserine O‐succinyltransferase |
| metG | Methionine‐‐tRNA ligase |
| metH | Methionine synthase |
| metI | D‐methionine transport system permease protein MetI |
| metN | Methionine import ATP‐binding protein MetN |
| metQ | Methionine‐binding lipoprotein MetQ |
| mgl | L‐methionine gamma‐lyase |
| miaB | tRNA‐2‐methylthio‐N(6)‐dimethylallyladenosine synthase |
| minD | Septum site‐determining protein MinD |
| mnmA | tRNA‐specific 2‐thiouridylase MnmA |
| mnmG | tRNA uridine 5‐carboxymethylaminomethyl modification enzyme MnmG |
| mog | Molybdopterin adenylyltransferase |
| mop | Aldehyde oxidoreductase |
| mprA | Response regulator MprA |
| mraZ | Transcriptional regulator MraZ |
| murAB | UDP‐N‐acetylglucosamine 1‐carboxyvinyltransferase 2 |
| nikB | Nickel transport system permease protein NikB |
| nrdD | Anaerobic ribonucleoside‐triphosphate reductase |
| nrdJ | Vitamin B12‐dependent ribonucleotide reductase |
| nrdR | Transcriptional repressor NrdR |
| nspC | Carboxynorspermidine/carboxyspermidine decarboxylase |
| nth | Endonuclease III |
| ntpB | V‐type sodium ATPase subunit B |
| nusA | Transcription termination/antitermination protein NusA |
| nusG | Transcription termination/antitermination protein NusG |
| obg | GTPase Obg |
| oppF | Oligopeptide transport ATP‐binding protein OppF |
| paaK | Phenylacetate‐coenzyme A ligase |
| pduL | Phosphate propanoyltransferase |
| pfkA | ATP‐dependent 6‐phosphofructokinase |
| pgk | Phosphoglycerate kinase |
| pgsA | CDP‐diacylglycerol‐‐glycerol‐3‐phosphate 3‐phosphatidyltransferase |
| pheS | Phenylalanine‐‐tRNA ligase alpha subunit |
| pmpR | Transcriptional regulatory protein PmpR |
| pncB2 | Nicotinate phosphoribosyltransferase pncB2 |
| pnp | Polyribonucleotide nucleotidyltransferase |
| ppdK | Pyruvate%2C phosphate dikinase |
| ppiB | Peptidyl‐prolyl cis‐trans isomerase B |
| prfA | Peptide chain release factor 1 |
| prfB | Peptide chain release factor 2 |
| proA | Gamma‐glutamyl phosphate reductase |
| proS | Proline‐‐tRNA ligase |
| prs | Ribose‐phosphate pyrophosphokinase |
| pstB3 | Phosphate import ATP‐binding protein PstB 3 |
| pstC | Phosphate transport system permease protein PstC |
| pstS | Phosphate‐binding protein PstS |
| ptsI | Phosphoenolpyruvate‐protein phosphotransferase |
| purC | Phosphoribosylaminoimidazole‐succinocarboxamide synthase |
| purD | Phosphoribosylamine‐‐glycine ligase |
| purE | N5‐carboxyaminoimidazole ribonucleotide mutase |
| purF | Amidophosphoribosyltransferase |
| purH | Bifunctional purine biosynthesis protein PurH |
| purU | Formyltetrahydrofolate deformylase |
| pyrB | Aspartate carbamoyltransferase catalytic subunit |
| pyrD | Dihydroorotate dehydrogenase B (NAD(+))%2C catalytic subunit |
| pyrE | Orotate phosphoribosyltransferase |
| pyrF | Orotidine 5'‐phosphate decarboxylase |
| pyrG | CTP synthase |
| pyrH | Uridylate kinase |
| pyrI | Aspartate carbamoyltransferase regulatory chain |
| queA | S‐adenosylmethionine:tRNA ribosyltransferase‐isomerase |
| rarA | Replication‐associated recombination protein A |
| recA | Protein RecA |
| recU | Holliday junction resolvase RecU |
| rffG | dTDP‐glucose 4%2C6‐dehydratase 2 |
| rhlE | ATP‐dependent RNA helicase RhlE |
| rho | Transcription termination factor Rho |
| ribH | 6%2C7‐dimethyl‐8‐ribityllumazine synthase |
| rlmH | Ribosomal RNA large subunit methyltransferase H |
| rlmL | Ribosomal RNA large subunit methyltransferase K/L |
| rmlA | Glucose‐1‐phosphate thymidylyltransferase |
| rnfC | Electron transport complex subunit RnfC |
| rnfE | Electron transport complex subunit RnfE |
| rnhA | Ribonuclease H |
| rnjA | Ribonuclease J1 |
| rny | Ribonuclease Y |
| rph | Ribonuclease PH |
| rplA | 50S ribosomal protein L1 |
| rplB | 50S ribosomal protein L2 |
| rplC | 50S ribosomal protein L3 |
| rplD | 50S ribosomal protein L4 |
| rplE | 50S ribosomal protein L5 |
| rplF | 50S ribosomal protein L6 |
| rplJ | 50S ribosomal protein L10 |
| rplK | 50S ribosomal protein L11 |
| rplL | 50S ribosomal protein L7/L12 |
| rplM | 50S ribosomal protein L13 |
| rplN | 50S ribosomal protein L14 |
| rplO | 50S ribosomal protein L15 |
| rplP | 50S ribosomal protein L16 |
| rplQ | 50S ribosomal protein L17 |
| rplR | 50S ribosomal protein L18 |
| rplS | 50S ribosomal protein L19 |
| rplT | 50S ribosomal protein L20 |
| rplU | 50S ribosomal protein L21 |
| rplV | 50S ribosomal protein L22 |
| rplW | 50S ribosomal protein L23 |
| rplX | 50S ribosomal protein L24 |
| rpmA | 50S ribosomal protein L27 |
| rpmB | 50S ribosomal protein L28 |
| rpmC | 50S ribosomal protein L29 |
| rpmD | 50S ribosomal protein L30 |
| rpmE | 50S ribosomal protein L31 |
| rpmF | 50S ribosomal protein L32 |
| rpmG | 50S ribosomal protein L33 |
| rpmI | 50S ribosomal protein L35 |
| rpoA | DNA‐directed RNA polymerase subunit alpha |
| rpoB | DNA‐directed RNA polymerase subunit beta |
| rpoC | DNA‐directed RNA polymerase subunit beta' |
| rpoZ | DNA‐directed RNA polymerase subunit omega |
| rpsB | 30S ribosomal protein S2 |
| rpsC | 30S ribosomal protein S3 |
| rpsD | 30S ribosomal protein S4 |
| rpsE | 30S ribosomal protein S5 |
| rpsF | 30S ribosomal protein S6 |
| rpsG | 30S ribosomal protein S7 |
| rpsH | 30S ribosomal protein S8 |
| rpsI | 30S ribosomal protein S9 |
| rpsJ | 30S ribosomal protein S10 |
| rpsK | 30S ribosomal protein S11 |
| rpsL | 30S ribosomal protein S12 |
| rpsM | 30S ribosomal protein S13 |
| rpsO | 30S ribosomal protein S15 |
| rpsP | 30S ribosomal protein S16 |
| rpsQ | 30S ribosomal protein S17 |
| rpsR | 30S ribosomal protein S18 |
| rpsS | 30S ribosomal protein S19 |
| rpsT | 30S ribosomal protein S20 |
| rpsU | 30S ribosomal protein S21 |
| rsfS | Ribosomal silencing factor RsfS |
| rsmH | Ribosomal RNA small subunit methyltransferase H |
| rsxA | Electron transport complex subunit RsxA |
| rsxB | Electron transport complex subunit RsxB |
| rsxD | Electron transport complex subunit RsxD |
| ruvB | Holliday junction ATP‐dependent DNA helicase RuvB |
| sbcD | Nuclease SbcCD subunit D |
| secA | Protein translocase subunit SecA |
| secY | Protein translocase subunit SecY |
| serC | Phosphoserine aminotransferase |
| serS | Serine‐‐tRNA ligase |
| sigA | RNA polymerase sigma factor SigA |
| smpB | SsrA‐binding protein |
| soj | Sporulation initiation inhibitor protein Soj |
| speA | Arginine decarboxylase |
| speB | Agmatinase |
| speD | S‐adenosylmethionine decarboxylase proenzyme |
| speE | Polyamine aminopropyltransferase |
| spoIIIE | DNA translocase SpoIIIE |
| spoVG | Putative septation protein SpoVG |
| sucB | Dihydrolipoyllysine‐residue succinyltransferase component of 2‐oxoglutarate dehydrogenase complex |
| tdcB | L‐threonine ammonia‐lyase |
| tgt | Queuine tRNA‐ribosyltransferase |
| thiC | Phosphomethylpyrimidine synthase |
| thiD | Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase |
| thiH | 2‐iminoacetate synthase |
| thiM | Hydroxyethylthiazole kinase |
| thiQ | Thiamine import ATP‐binding protein ThiQ |
| thrZ | Threonine‐‐tRNA ligase 2 |
| thyX | Flavin‐dependent thymidylate synthase |
| tktA | Transketolase 1 |
| trmL | tRNA (cytidine(34)‐2'‐O)‐methyltransferase |
| trpB | Tryptophan synthase beta chain |
| trpS | Tryptophan‐‐tRNA ligase |
| tsf | Elongation factor Ts |
| typA | GTP‐binding protein TypA/BipA |
| tyrS | Tyrosine‐‐tRNA ligase |
| ung | Uracil‐DNA glycosylase |
| upp | Uracil phosphoribosyltransferase |
| uppP | Undecaprenyl‐diphosphatase |
| uvrA | UvrABC system protein A |
| uvrB | UvrABC system protein B |
| valS | Valine‐‐tRNA ligase |
| walR | Transcriptional regulatory protein WalR |
| xpt | Xanthine phosphoribosyltransferase |
| ybiT | putative ABC transporter ATP‐binding protein YbiT |
| ychF | Ribosome‐binding ATPase YchF |
| ydcP | putative protease YdcP |
| yitJ | Bifunctional homocysteine S‐methyltransferase/5%2C10‐methylenetetrahydrofolate reductase |
| yknY | putative ABC transporter ATP‐binding protein YknY |
| yrrK | Putative pre‐16S rRNA nuclease |
| yxdL | ABC transporter ATP‐binding protein YxdL |
Figure 5Venn diagram summarizing the number of networks that include proteins from the five considered species
Figure 6Overview of the defined protein networks highlighting the respective distribution per species. (a) Bar chart showing the number of networks classified according to their size; (b) Scatter plots showing the distribution of the networks based on the respective number of proteins from A. woodii compared to the other considered species. Circle diameter is proportional to the number of BBHs within each network
Figure A1Extended version of Figure 7 showing the proteins of the three clusters of the WLP
Figure 7Selected networks displaying different amplification patterns in genes involved in the Wood–Ljungdahl pathway. An extended version of this figure including all proteins of the WL pathway is presented in Figure A1