| Literature DB >> 28587624 |
María Sofía Urbieta1,2, Nicolás Rascovan3, Martín P Vázquez3, Edgardo Donati4.
Abstract
BACKGROUND: Several archaeal species from the order Sulfolobales are interesting from the biotechnological point of view due to their biomining capacities. Within this group, the genus Acidianus contains four biomining species (from ten known Acidianus species), but none of these have their genome sequenced. To get insights into the genetic potential and metabolic pathways involved in the biomining activity of this group, we sequenced the genome of Acidianus copahuensis ALE1 strain, a novel thermoacidophilic crenarchaeon (optimum growth: 75 °C, pH 3) isolated from the volcanic geothermal area of Copahue at Neuquén province in Argentina. Previous experimental characterization of A. copahuensis revealed a high biomining potential, exhibited as high oxidation activity of sulfur and sulfur compounds, ferrous iron and sulfide minerals (e.g.: pyrite). This strain is also autotrophic and tolerant to heavy metals, thus, it can grow under adverse conditions for most forms of life with a low nutrient demand, conditions that are commonly found in mining environments.Entities:
Keywords: Acidianus copahuensis; Biomining genes; Thermoacidophilic archaea
Mesh:
Substances:
Year: 2017 PMID: 28587624 PMCID: PMC5461723 DOI: 10.1186/s12864-017-3828-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Digital DDH estimation in silico of Acidianus copahuensis genome against all other available Sulfolobales genomes
| Query genome | Reference genome | DDH (%) | +/− | Distance | Prob. DDH > = 70% | G + C difference |
|---|---|---|---|---|---|---|
|
|
| 30.1 | 2.45 | 0.1416 | 0.11 | 16.04 |
|
|
| 28.2 | 2.43 | 0.1526 | 0.05 | 2.85 |
|
|
| 25.3 | 2.4 | 0.1719 | 0.01 | 0.15 |
|
|
| 23.8 | 2.38 | 0.1833 | 0 | 1.07 |
|
|
| 21.9 | 2.35 | 0.2007 | 0 | 0.53 |
|
|
| 20.6 | 2.32 | 0.2133 | 0 | 10.59 |
|
|
| 19.4 | 2.29 | 0.2263 | 0 | 7.68 |
|
|
| 19.2 | 2.28 | 0.2294 | 0 | 2.97 |
|
|
| 18.6 | 2.27 | 0.2368 | 0 | 6.36 |
|
|
| 17.9 | 2.24 | 0.2453 | 0 | 12.11 |
|
|
| 15.7 | 2.16 | 0.278 | 0 | 1.49 |
|
|
| 15.6 | 2.15 | 0.2804 | 0 | 11.35 |
Fig. 1Acidianus copahuensis proteins compared to all other Sulfolobales species. a Network analysis representing the results of an “all vs. all” BLASTP comparison of all Sulfolobales proteins from sequenced genomes. Only hits with more than 65% of both proteins aligned and E-value lower that 1E-20 were considered as a match between two proteins. The line width is proportional to the number of proteins shared by two species. b Heatmap analysis representing the Acidianus copahuensis proteins that were found in other Sulfolobales species based on the BLASTP comparison mentioned above. Proteins were classified depending on the number of genomes where each protein is present (see heatmap side bar and the upper-right bar chart as a reference for the categories used). Clustering was performed using the Euclidean distance
Fig. 2Phylogeny of Sulfolobales based on SOR proteins and 16S rRNA genes. Phylogenetic trees were obtained by the Maximum Likelihood method for all known SOR proteins found in NCBI database (a) and the corresponding 16S rRNA genes (b) in the same organisms. Bootstrap supports for nodes were obtained using 1000 repetitions and are expressed as the proportion of times (in decimals) that each node was supported. Archaea branches from the phylum Euryarchaeota are colored in blue and from the phylum Crenarchaeota in red while Bacteria branches are in green
Fig. 3Organization of dsrE-tusA-hdr-like gene clusters in Sulfolobales. Genomic organization of the cluster is almost identical in all Sulfolobales to that shown by Liu et al. 2014 for the M. cuprina genome [28]. Acidianus copahuensis proteins were used as references to calculate the percentage similarity to corresponding proteins from all other species. The homologous genes from different species are represented using the same color. Arrow orientation indicates the orientation of the ORF in the genome and lengths are proportional to the real length of the protein. “Cons. H.P.” is an acronym for conserved hypothetical protein. Double bar indicates that the gene is located in a different genomic region. All proteins from each species were concatenated and the resulting poly-proteins were used to build a Maximum Likelihood phylogenetic tree, placed on the left of the figure. Bootstrap supports for nodes were obtained using 1000 repetitions and are expressed as the proportion of times (in decimals) that each node was supported
Comparison of the genes and proteins in the hydrogenase cluster described in A. ambivalens [72] with the correspondent homologous found in A. copahuensis genome
|
| Homologs in A. copahuensis | ||||||
|---|---|---|---|---|---|---|---|
| Gene | Protein acc. Number (NCBI) | ORF (aa) | ORF (aa) | Protein acc. Number (NCBI) | Aa identical positions (%) | Prevalence in Sulfolobales (# of genomes) | Predicted function |
|
| CAC86887 | 628 | 633 | EZQ01597 | 82 | 4 (M.S / M.Y / S.AZ1 / S.acd1) | membrane-bound NiFe hydrogenase large subunit |
|
| CAC86884 | 417 | 417 | EZQ01596 | 85 | 4 (M.S / M.Y / S.AZ1 / S.acd1) | NiFe hydrogenase small FeS subunit |
|
| CAC86885 | 269 | 278 | EZQ01616 | 68 | 2 (S.AZ1, S.acd1) | hydrogenase membrane anchor, heme b-binding |
|
| CAC86886 | 454 | 452 | EZQ01617 | 74 | 2 (S.AZ1, S.acd1) | hypotetical FeS subunit |
|
| CAC86889 | 201 | 206 | EZQ01618 | 55 | 1 (S. acd1) | hypothetical protein from hydrogenase cluster |
|
| CAC86888 | 113 | 92 | EZQ01598 | 70 | 1 (S. acd1) | Rieske-type FeS protein |
|
| CAC86890 | 394 | 395 | EZQ01599 | 69 | 4 (M.S / M.Y / S.AZ1 / S.acd1) | hydrogenase maturation protein |
|
| CAC86891 | 101 | 104 | EZQ01600 | 65 | 2 (S.AZ1, S.acd1) | hydrogenase maturation protein |
|
| CAC86892 | 339 | 335 | EZQ01601 | 67 | 4 (M.S / M.Y / S.AZ1 / S.acd1) | hydrogenase maturation protein |
|
| CAC86893 | 70 | 72 | EZQ01602 | 45 | 0 | Unknown/hypothetical protein in hydrogenase cluster |
|
| CAC86894 | 143 | 142 | EZQ01603 | 64 | 2 (S.AZ1, S.acd1) | Unknown/hypothetical protein in hydrogenase cluster |
|
| CAC86895 | 163 | 159 | EZQ01604 | 54 | 2 (S.AZ1, S.acd1) | Maturation protease for HynL |
The proteins are identified by their NCBI accession number. ORF (aa): indicates length of the open reading frame in amino acids. The prevalence in other Sulfolobales species was calculated based on the all vs. all BLASTP comparison used for Fig. 1. For some proteins, more than one hit was found in certain species (See Additional file 2: Table S1). Acronyms: M.S. (Metallosphaera sedula), M.Y (Metallosphaera yellowstonensis), S.AZ1 (Sulfolobales archaeon AZ1), S.acd1 (Sulfolobales archaeon Acd1). Aa: amino acids
Fig. 4Organization of fox genes clusters in Sulfolobales. Sulfolobus metallicus genes were used as reference because it is the model organism where this complex was first described (*). Similarity of the amino acid sequences of all Sulfolobales with fox genes to the S.metallicus references was estimated using a BLASTP analysis. The homologous genes from different species are represented using the same color, arrow orientation indicates the orientation of the ORF in the genome and lengths are proportional to the real length of the proteins. Homologous genes are linked by lines to track changes in genome organization among species. Double bar indicates that the gene is located in a different genomic region
Carbon fixation proteins from the hydroxypropionate–hydroxybutyrate carbon fixation pathway in M. sedula and the homologous found in A. copahuensis genome
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
| Gene name | Uniprot ID | ORF (aa) | NCBI accession | ORF (aa) | E-value | Aa identical positions (%) | HhcR (position) | Protein Function |
|
| A4YIR2 | 115 | EZQ01893 | 115 | 1.00E-46 | 64 | HhcR transcription factor | |
|
| A4YD22 | 510 | EZQ11106 | 507 | 0 | 78 | + (−46) | Acetyl-CoA/propionyl-CoA carboxylase, alpha subunit |
|
| A4YD23 | 167 | EZQ11107 | 167 | 2.00E-84 | 64 | - | Acetyl-CoA/propionyl-CoA carboxylase, beta subunit |
|
| A4YGI1 | 524 | EZQ11108 | 523 | 0 | 81 | + (−53) | Acetyl-CoA/propionyl-CoA carboxylase, gamma subunit |
|
| A4YEN2 | 357 | EZQ11004 | 356 | 0 | 82 | ? (−274) | Malonyl-CoA/succinyl-CoA reductase |
|
| A4YI81 | 314 | EZQ04857 | 317 | 2.00E-174 | 75 | + (−20) | Malonate semialdehyde reductase |
|
| A4YGR1 | 661 | EZQ11066 | 653 | 0 | 78 | - | 3-Hydroxypropionyl-CoA synthetase |
|
| A4YGN2 | 332 | EZQ01730 | 333 | 0 | 76 | + (−19) | Acryloyl-CoA reductase |
|
| A4YEG2 | 140 | EZQ10835 | 141 | 4.00E-71 | 70 | + (−65) | Methylmalonyl-CoA epimerase |
|
| A4YEG1 | 553 | EZQ10834 | 550 | 0 | 83 | ? (−242) | Methylmalonyl-CoA mutase (catalytic subunit) |
|
| A4YIE3 | 155 | EZQ04694 | 138 | 8.00E-75 | 78 | + (−39) | Methylmalonyl-CoA mutase (coenzyme B12-binding subunit) |
|
| A4YGN0 | 360 | EZQ01731 | 360 | 0 | 77 | + (−41) | Succinate semialdehyde reductase |
|
| A4YDR9 | 549 | - | - | - | - | - | 4-Hydroxybutyryl-CoA synthetase |
|
| A4YDT1 | 564 | EZQ11368 | 559 | 0 | 58 | - | 4-Hydroxybutyryl-CoA synthetase |
|
| A4YGM8 | 472 | EZQ01733 | 471 | 0 | 85 | + (−37) | 4-Hydroxybutyryl-CoA synthetase |
|
| A4YGC7 | 507 | EZQ02033 | 506 | 0 | 75 | + (−30) | 4-Hydroxybutyryl-CoA dehydratase |
|
| A4YEH9 | 395 | EZQ11076 | 396 | 0 | 71 | + (−40) | Acetoacetyl-CoA-ketothiolase |
|
| A4YGM9 | 391 | EZQ01732 | 388 | 0 | 72 | + (−46) | Probable 3-hydroxybutyryl-CoA dehydrogenase |
|
| A4YDS4 | 651 | EZQ11226 | 661 | 0 | 49 | + (−91) | Bifunctional crotonyl-CoA hydratase/3-hydroxybutyryl-CoA dehydrogenase |
| A4YI89 | 259 | EZQ04864 | 257 | 2.00E-133 | 71 | + (−15) | 3-hydroxypropionyl-coenzyme A dehydratase | |
The proteins are identified by their NCBI accession number (A. copahuensis) or Uniprot identification number (M. sedula). ORF (aa) indicates length of the open reading frame in amino acids, E-values and amino acid identities were obtained by BLASTP analysis. The presence of the HhcR regulatory motifs in A. copahuensis at the promoter of each gene was determined following the procedures described by Leyn et al. 2015 [53]. Presence is indicated with (+), absence by (−) and cases where it was found more distant than average, with (?), and positions upstream the start codon are indicated in parenthesis. Aa: amino acids