| Literature DB >> 32082281 |
Arkadiy I Garber1,2, Kenneth H Nealson1, Akihiro Okamoto3, Sean M McAllister4, Clara S Chan2,4, Roman A Barco1, Nancy Merino1,5,6.
Abstract
Iron is a micronutrient for nearly all life on Earth. It can be used as an electron donor and electron acceptor by iron-oxidizing and iron-reducing microorganisms and is used in a variety of biological processes, including photosynthesis and respiration. While it is the fourth most abundant metal in the Earth's crust, iron is often limiting for growth in oxic environments because it is readily oxidized and precipitated. Much of our understanding of how microorganisms compete for and utilize iron is based on laboratory experiments. However, the advent of next-generation sequencing and surge in publicly available sequence data has made it possible to probe the structure and function of microbial communities in the environment. To bridge the gap between our understanding of iron acquisition, iron redox cycling, iron storage, and magnetosome formation in model microorganisms and the plethora of sequence data available from environmental studies, we have created a comprehensive database of hidden Markov models (HMMs) based on genes related to iron acquisition, storage, and reduction/oxidation in Bacteria and Archaea. Along with this database, we present FeGenie, a bioinformatics tool that accepts genome and metagenome assemblies as input and uses our comprehensive HMM database to annotate provided datasets with respect to iron-related genes and gene neighborhood. An important contribution of this tool is the efficient identification of genes involved in iron oxidation and dissimilatory iron reduction, which have been largely overlooked by standard annotation pipelines. We validated FeGenie against a selected set of 28 isolate genomes and showcase its utility in exploring iron genes present in 27 metagenomes, 4 isolate genomes from human oral biofilms, and 17 genomes from candidate organisms, including members of the candidate phyla radiation. We show that FeGenie accurately identifies iron genes in isolates. Furthermore, analysis of metagenomes using FeGenie demonstrates that the iron gene repertoire and abundance of each environment is correlated with iron richness. While this tool will not replace the reliability of culture-dependent analyses of microbial physiology, it provides reliable predictions derived from the most up-to-date genetic markers. FeGenie's database will be maintained and continually updated as new genes are discovered. FeGenie is freely available: https://github.com/Arkadiy-Garber/FeGenie.Entities:
Keywords: hidden Markov model (HMM) database; iron gene regulation; iron oxidation; iron reduction; iron storage; iron transport; magnetosome; siderophore
Year: 2020 PMID: 32082281 PMCID: PMC7005843 DOI: 10.3389/fmicb.2020.00037
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Scheme of known iron-oxidizers and iron-reducers. There are several different types of iron-oxidizers known, with more information on Gram-negative (A) bacteria compared to Gram-positive (B) bacteria (note: the acidophilic aerobic iron-oxidizers can use either a copper protein or cytochrome c to transfer electrons in the periplasm). (C) For iron-reducers, there are only two mechanisms known and under anaerobic conditions. The genes identified by FeGenie are in boxes above each type, with the exception of Cyt b573, which has yet to be confirmed for iron oxidation (White et al., 2016). FeGenie does not include pili and flavin-related genes since these genes are commonly associated with other functions/metabolisms. Modified from White et al. (2016) and Wang et al. (2019). OM, outer membrane; P, periplasm; and CM, cytoplasmic membrane.
FIGURE 2Scheme of known iron acquisition, storage, and regulation pathways. Gram-negative (A) and Gram-positive (B) bacteria have different mechanisms to uptake iron due to differences in the cell membrane structure. Iron(II)/(III) uptake can also be mediated extracellularly by redox cycling secondary metabolites, such as phenazine-1-carboxylic acid (Cornelis and Dingemans, 2013). OM, outer membrane; P, periplasm; and CM, cytoplasmic membrane. Modified from Anzaldi and Skaar (2010); Caza and Kronstad (2013), Contreras et al. (2014); Kranzler et al. (2014), Lau et al. (2016). *Fe(III) release from siderophores intracellularly could include Fe(III) reduction (e.g., fpvG, Ganne et al., 2017) or modification/hydrolysis of the siderophore (e.g., esterase, Brickman and McIntosh, 1992).
Summary of iron-related protein families that are represented as pHMMs in FeGenie.
| Iron acquisition | Iron(II)/(III) transport | Efe |
| Heme oxygenase | ChuS8, ChuZ9, HemO10,11, PigA10,11, Hem | |
| Heme transport | Has | |
| Transferrin/Lactoferrin | Tbp | |
| Siderophore synthesis | Acs | |
| Siderophore transport | BesA48, CbrA | |
| Iron Gene regulation | Transcriptional regulation | |
| Iron oxidation and reduction | Iron oxidation | Cyc166,67, Cyc266,67,68, FoxABC69, FoxEYZ70, Sulfocyanin71, PioA |
| Probable iron oxidation and possible iron reduction | MtoA | |
| Dissimilatory iron reduction | CymA74, Mtr | |
| Probable iron reduction | MtrCB, MtrAB, MtoAB-MtrC | |
| Iron storage | Iron storage | |
| Magnetosome-related | Magnetosome formation | MamABEKLMOPQI82,83 (Note: These genes are found in all known magnetotactic microorganisms, except for |
Summary of metagenomes analyzed.
| Amazon River Plume (Station 3) | River/ocean mixing, intermediate salinity | SAMN02628402 | 377,266 | |
| Amazon River Plume (Station 10) | River/ocean mixing, low salinity | SAMN02628416 | 143,340 | |
| Amazon River Plume (Station 27) | River/ocean mixing, high salinity | SAMN02628424 | 278,301 | |
| The Cedars (BS5 2011) | Serpentinizing, alkaline groundwater (shallow source) | GCA_002583255.1 | 32,646 | |
| The Cedars (BS5 2012) | Serpentinizing, alkaline groundwater (shallow source) | GCA_002581825.1 | 50,323 | |
| The Cedars (GPS1 2011) | Serpentinizing, alkaline groundwater (deep source) | GCA_002581705.1 | 86,466 | |
| The Cedars (GPS1 2012) | Serpentinizing, alkaline groundwater (deep source) | GCA_002581605.1 | 78,321 | |
| Jinata Hot Springs | Iron-rich groundwater, mixed with seawater | PRJNA392119 | 992,695 | |
| Loihi Seamount (S1) (i.e., Syringe Sample) | Marine hydrothermal vent Fe microbial mat (surficial syringe sample) | SRR6114197 | 146,898 | |
| Loihi Seamount (S6) (i.e., Scoop Sample 1) | Marine hydrothermal vent Fe microbial mat (bulk scoop sample) | Gp0295815 | 390,888 | |
| Loihi Seamount (S19) (i.e., Scoop Sample 2) | Marine hydrothermal vent Fe microbial mat (bulk scoop sample) | Gp0295816 | 827,472 | |
| Mid-Atlantic Ridge, Rainbow (664-BS3) (i.e., Syringe Sample 1) | Marine hydrothermal vent Fe microbial mat (surface syringe sample) | Gp0295819 | 414,137 | |
| Mid-Atlantic Ridge, Rainbow (664-SC8) (i.e., Scoop Sample) | Marine hydrothermal vent Fe microbial mat (bulk scoop sample) | Gp0295820 | 597,486 | |
| Mid-Atlantic Ridge, TAG (665-MMA12) (i.e., Syringe Sample 2) | Marine hydrothermal vent Fe microbial mat (surface syringe sample) | Gp0295821 | 255,314 | |
| Mid-Atlantic Ridge, Snakepit (667-BS4) (i.e., Syringe Sample 3) | Marine hydrothermal vent Fe microbial mat (surface syringe sample) | Gp0295823 | 422,234 | |
| Mariana Backarc, Urashima (801-BM1-B4, S7) (i.e., Scoop Sample) | Marine hydrothermal vent Fe microbial mat (surface syringe sample) | Gp0295817 | 365,851 | |
| Arabian Sea metagenome ( | Marine surface water | PRJNA391943 | 398,870 | |
| Chile/Peru Coast metagenome ( | Marine surface water | PRJNA391943 | 375,779 | |
| East Africa Coast metagenome ( | Marine surface water | PRJNA391943 | 464,070 | |
| Indian Ocean metagenome ( | Marine surface water | PRJNA391943 | 178,873 | |
| Mediterranean metagenome ( | Marine surface water | PRJNA391943 | 607,005 | |
| North Atlantic metagenome ( | Marine surface water | PRJNA391943 | 673,120 | |
| North Pacific metagenome ( | Marine surface water | PRJNA391943 | 601,358 | |
| Red Sea metagenome ( | Marine surface water | PRJNA391943 | 331,387 | |
| South Atlantic metagenome ( | Marine surface water | PRJNA391943 | 735,385 | |
| South Pacific metagenome ( | Marine surface water | PRJNA391943 | 1,128,901 | |
| Rifle Aquifer | Terrestrial subsurface aquifer | 203,744 |
FIGURE 3FeGenie algorithm overview. Color-coded to represent various aspects of the program, including external programs/dependencies, optional databases for cross-reference, and custom Python scripts.
FIGURE 4Dot plot showing the relative abundance of different iron gene categories within 28 representative isolate genomes. The isolate genomes were selected as model microorganisms to demonstrate the accuracy of FeGenie for identifying genes involved in iron oxidation and reduction, iron transport (including siderophores and heme), iron storage, and iron gene regulation. The genomes were obtained from the NCBI RefSeq and GenBank databases and analyzed by FeGenie. The size of each dot reflects the number of genes identified for each category and normalized to the number of protein-coding genes predicted within each genome. *This category is reserved for genes related to the mtoAB/pioAB gene family.
FIGURE 5(A) Dot plot showing the distribution of iron genes on 27 metagenomes and (B) a scaled heatmap with accompanying dendrogram that represents hierarchical clustering of metagenome datasets based on identified iron genes. The dot plot shows the relative abundance of iron genes across 27 metagenomes. The size of each dot reflects the number of genes identified for each category, normalized to the number of protein-coding genes predicted within each metagenome. To generate the dendrogram, Ward’s method for hierarchical clustering was used, along with the Euclidian distance metric. The heatmap was created from a scaled version of FeGenie’s matrix output, which summarizes the amount of iron genes for each category present in each metagenome assembly. GPS, Grotto Pool Spring; BS, Barnes Spring. *This category is reserved for genes related to the mtoAB/pioAB gene family.
FIGURE 6Dot plot showing the relative abundance of different iron gene categories in 17 genomes from the Candidate Phyla Radiation and other candidate taxa. The genomes were obtained from the NCBI RefSeq and GenBank databases and analyzed by FeGenie. *This category is reserved for genes related to the MtoAB/PioAB gene family.