| Literature DB >> 32350143 |
Song-Can Chen1,2,3, Guo-Xin Sun1, Yu Yan4, Konstantinos T Konstantinidis5,6, Si-Yu Zhang5, Ye Deng1, Xiao-Min Li1,3, Hui-Ling Cui1,3, Florin Musat2, Denny Popp7, Barry P Rosen8, Yong-Guan Zhu9,10.
Abstract
The rise of oxygen on the early Earth about 2.4 billion years ago reorganized the redox cycle of harmful metal(loids), including that of arsenic, which doubtlessly imposed substantial barriers to the physiology and diversification of life. Evaluating the adaptive biological responses to these environmental challenges is inherently difficult because of the paucity of fossil records. Here we applied molecular clock analyses to 13 gene families participating in principal pathways of arsenic resistance and cycling, to explore the nature of early arsenic biogeocycles and decipher feedbacks associated with planetary oxygenation. Our results reveal the advent of nascent arsenic resistance systems under the anoxic environment predating the Great Oxidation Event (GOE), with the primary function of detoxifying reduced arsenic compounds that were abundant in Archean environments. To cope with the increased toxicity of oxidized arsenic species that occurred as oxygen built up in Earth's atmosphere, we found that parts of preexisting detoxification systems for trivalent arsenicals were merged with newly emerged pathways that originated via convergent evolution. Further expansion of arsenic resistance systems was made feasible by incorporation of oxygen-dependent enzymatic pathways into the detoxification network. These genetic innovations, together with adaptive responses to other redox-sensitive metals, provided organisms with novel mechanisms for adaption to changes in global biogeocycles that emerged as a consequence of the GOE.Entities:
Keywords: arsenic; biogeochemistry; detoxification; evolution; oxygen
Year: 2020 PMID: 32350143 PMCID: PMC7229686 DOI: 10.1073/pnas.2001063117
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.Phylogenetic distribution of 13 arsenic detoxification genes. (Left) Reference phylogenetic trees of major lineages of Bacteria, Archaea, and Eukaryotes. (Right) Relative abundance of 13 arsenic detoxification genes present within each major lineage. The 13 genes were divided into three sets (A-C) according to their phyletic distribution patterns. The reference phylogeny was reconstructed from concatenate alignment of 16 ribosomal proteins, as previously reported (19). Divergent times and corresponding confidence intervals (95%) were estimated using PhyloBayes (analysis 7; Table 2). Timescale: Hd, Hadean; Ph, Phanerozoic; Ga, billions of years.
Birth Age of 13 arsenic resistance genes estimated under analytical scenarios 7 to 12
| Analysis | 7 | 8 | 9 | 10 | 11 | 12 |
| Model assumptions and calibrations | ||||||
| Rate model | Uncorrelated | Uncorrelated | Uncorrelated | Uncorrelated | Uncorrelated | Uncorrelated |
| Calibration | Full set | Full set | −Cyanobacteria | −Rhodophyta | Full set | Full set |
| Root prior | U(3.35,4.38) | Γ(3.95;0.23) | U(3.35,4.38) | U(3.35,4.38) | U(3.35,4.38) | U(3.35,4.38) |
| Topology | ML | ML | ML | ML | MT | Three-domain tree |
| Gene age (Gyr) | ||||||
| | 3.40 (3.23–3.61) | 3.37 (3.23–3.72) | 3.44 (3.03–3.68) | 3.40 (3.24–3.73) | 3.40 (3.24–3.72) | 3.45 (3.24–3.76) |
| | 2.79 (2.55–2.96) | 2.77 (2.59–3.05) | 2.86 (2.51–3.06) | 2.79 (2.56–3.04) | 2.78 (2.60–3.04) | 2.81 (2.59–3.04) |
| | 2.39 (2.03–2.68) | 2.39 (2.03–2.76) | 2.45 (2.04–2.72) | 2.38 (2.01–2.76) | 2.40 (2.06–2.76) | 2.39 (2.07–2.74) |
| | 2.47 (2.12–2.75) | 2.46 (2.09–2.86) | 2.53 (2.12–2.79) | 2.46 (2.06–2.84) | 2.47 (2.12–2.83) | 2.46 (2.17–2.82) |
| | 2.47 (2.12–2.75) | 2.46 (2.09–2.86) | 2.53 (2.12–2.79) | 2.46 (2.06–2.84) | 2.47 (2.12–2.83) | 2.46 (2.17–2.82) |
| | 1.99 (1.70–2.21) | 1.99 (1.73–2.34) | 2.04 (1.71–2.30) | 1.98 (1.68–2.30) | 2.00 (1.73–2.30) | 2.01 (1.75–2.28) |
| | 1.36 (0.84–2.02) | 1.40 (0.84–2.04) | 1.42 (0.79–2.01) | 1.37 (0.78–2.00) | 1.39 (0.82–2.15) | 1.38 (0.79–2.05) |
| | 1.70 (1.61–1.81) | 1.61 (1.43–1.82) | 1.70 (1.61–1.82) | 1.70 (1.61–1.84) | 1.58 (1.41–1.78) | 1.64 (1.43–1.83) |
| | 1.63 (1.57–1.73) | 1.63 (1.56–1.76) | 1.63 (1.56–1.74) | 1.63 (1.57–1.77) | 1.63 (1.56–1.73) | 1.63 (1.57–1.75) |
| | 1.53 (1.33–1.70) | 1.53 (1.34–1.74) | 1.57 (1.31–1.73) | 1.53 (1.37–1.73) | 1.69 (1.53–1.88) | 1.56 (1.36–1.77) |
| | 1.31 (0.93–1.58) | 1.29 (1.02–1.59) | 1.35 (0.95–1.58) | 1.31 (1.00–1.59) | 1.30 (1.01–1.58) | 1.34 (1.02–1.62) |
| | 1.11 (0.92–1.29) | 1.11 (0.96–1.31) | 1.17 (0.95–1.32) | 1.09 (0.93–1.28) | 1.10 (0.94–1.28) | 1.18 (1.01–1.38) |
| | 1.02 (0.89–1.14) | 1.02 (0.92–1.19) | 1.05 (0.89–1.17) | 1.02 (0.90–1.19) | 0.79 (0.65–0.94) | 1.05 (0.93–1.19) |
Autocorrelated, autocorrelated rate model; Uncorrelated, uncorrelated rate model.
−Cyanobacteria, subsampled calibration points without Cyanobacteria; −Rhodophyta, subsampled calibration points without Rhodophyta.
U, uniform distribution (upper, lower); Γ: Gamma distribution (mean; SD).
ML, maximum likelihood tree of ribosomal proteins; MT: alternative topology reflecting minority bipartitions; Three-domain tree: tree topology where archaea and eukaryotes are sister group.
Median age estimates of gene birth nodes, with 95% confidence intervals in parentheses; Gyr, billion years.
Birth Age of 13 arsenic resistance genes estimated under analytical scenarios 1 to 6
| Analysis | 1 | 2 | 3 | 4 | 5 | 6 |
| Model assumptions and calibrations | ||||||
| Rate model | Autocorrelated | Autocorrelated | Autocorrelated | Autocorrelated | Autocorrelated | Autocorrelated |
| Calibration | Full set | Full set | −Cyanobacteria | −Rhodophyta | Full set | Full set |
| Root prior | U(3.35,4.38) | Γ(3.95;0.23) | U(3.35,4.38) | U(3.35,4.38) | U(3.35,4.38) | U(3.35,4.38) |
| Topology | ML | ML | ML | ML | MT | Three-domain tree |
| Gene age (Gyr) | ||||||
| | 3.55 (3.27–3.72) | 3.59 (3.31–3.79) | 3.69 (3.45–3.89) | 3.50 (3.28–3.76) | 3.57 (3.29–3.83) | 3.62 (3.40–3.86) |
| | 2.97 (2.71–3.09) | 3.10 (2.77–3.31) | 3.18 (2.87–3.38) | 2.99 (2.77–3.30) | 3.08 (2.78–3.34) | 3.00 (2.81–3.23) |
| | 2.70 (2.34–2.89) | 2.74 (2.39–2.95) | 2.82 (2.44–2.98) | 2.64 (2.41–2.91) | 2.74 (2.42–3.01) | 2.73 (2.50–3.01) |
| | 2.79 (2.45–2.97) | 2.83 (2.45–3.03) | 2.91 (2.55–3.10) | 2.73 (2.49–2.98) | 2.83 (2.52–3.09) | 2.82 (2.61–3.08) |
| | 2.79 (2.45–2.97) | 2.83 (2.45–3.03) | 2.91 (2.55–3.10) | 2.73 (2.49–2.98) | 2.83 (2.52–3.09) | 2.82 (2.61–3.08) |
| | 2.07 (1.57–2.36) | 2.10 (1.57–2.41) | 2.16 (1.58–2.46) | 2.03 (1.61–2.39) | 2.10 (1.60–2.47) | 2.10 (1.73–2.47) |
| | 2.26 (1.78–2.47) | 2.27 (1.94–2.54) | 2.36 (1.99–2.61) | 2.19 (1.85–2.51) | 2.26 (1.90–2.62) | 2.28 (1.90–2.58) |
| | 1.91 (1.79–2.04) | 1.92 (1.79–2.03) | 1.93 (1.80–2.05) | 1.90 (1.79–2.04) | 1.82 (1.59–2.00) | 1.91 (1.80–2.06) |
| | 1.80 (1.70–1.91) | 1.81 (1.70–1.91) | 1.81 (1.70–1.92) | 1.79 (1.70–1.91) | 1.86 (1.50–2.14) | 1.80 (1.70–1.93) |
| | 1.71 (1.48–1.84) | 1.71 (1.51–1.90) | 1.77 (1.55–1.93) | 1.63 (1.49–1.86) | 1.91 (1.69–2.09) | 1.71 (1.54–1.94) |
| | 1.72 (1.48–1.85) | 1.73 (1.53–1.94) | 1.79 (1.57–1.92) | 1.64 (1.51–1.89) | 1.74 (1.54–1.97) | 1.72 (1.58–1.95) |
| | 0.97 (0.80–1.11) | 0.99 (0.80–1.16) | 1.02 (0.78–1.11) | 0.95 (0.79–1.13) | 1.00 (0.85–1.16) | 1.17 (0.99–1.40) |
| | 1.14 (0.97–1.26) | 1.15 (0.98–1.31) | 1.18 (1.02–1.28) | 1.08 (0.97–1.26) | 1.00 (0.81–1.16) | 1.14 (1.02–1.32) |
Autocorrelated, autocorrelated rate model; Uncorrelated, uncorrelated rate model.
−Cyanobacteria, subsampled calibration points without Cyanobacteria; −Rhodophyta, subsampled calibration points without Rhodophyta.
U, uniform distribution (upper, lower); Γ: Gamma distribution (mean; SD).
ML, maximum likelihood tree of ribosomal proteins; MT: alternative topology reflecting minority bipartitions; Three-domain tree: tree topology where archaea and eukaryotes are sister group.
Median age estimates of gene birth nodes, with 95% confidence intervals in parentheses; Gyr, billion years.
Fig. 2.Gene birth date for each of 13 arsenic detoxification genes. Gene ages were derived from reconciliation results (cycle), using fully dated species trees (n = 1200) sampled from 12 PhyloBayes analyses. The median age estimates under each analytical scenario (Tables 1 and 2) were shown as diamond. The uncertainties associated with the results from all PhyloBayes analyses were integrated as 95% composite confidence intervals (whisker of the boxplot). Age estimates of genes evolved before, around, and after GOE were shown as blue, yellow, and green, respectively. Atmospheric oxygen content throughout Earth’s history was overlaid on the gene’s age (red line) (9). Right y axis, pO2, relative to the present atmospheric level (PAL); left y axis, gene names. Genes found in both anaerobes and aerobes, or only in aerobes were denoted as blue and green, respectively (Fig. 3). Oxygen-dependent genes (arsI and arsH) were indicated by star. AsIII, AsV, and MAsIII were used to delineate genes acting on inorganic arsenite, arsenate, or methylarsenite, respectively. Ga, billions of years.
Fig. 3.Distribution of 13 arsenic detoxification genes among strict anaerobes and aerobes. Species were classified either as aerobes (including facultative anaerobes) or anaerobes based on their capability to use oxygen as a terminal electron acceptor. Each black tick indicated the presence of the corresponding gene in a taxon. Genes evolved before or at beginning of GOE were denoted as blue, and those after as green. Oxygen-dependent genes (arsI and arsH) were indicated with the star symbol.
Fig. 4.Arsenic resistance systems before (A) and after (B) the GOE. As(III), arsenite; As(V), arsenate; MAs(III), trivalent methylarsenite; MAs(V), pentavalent methylarsenate; SAM, S-adenosylmethionine; GSH, reduced glutathione; GSSG, oxidized glutathione; Grxred: reduced glutaredoxin; Grxox, oxidized glutaredoxin; Trxred, reduced thioredoxin; Trxred, oxidized thioredoxin.