| Literature DB >> 29270251 |
Raquel Quatrini1, Lorena V Escudero2,3, Ana Moya-Beltrán1, Pedro A Galleguillos2,3, Francisco Issotta1, Mauricio Acosta2, Juan Pablo Cárdenas4, Harold Nuñez1, Karina Salinas5, David S Holmes1,6, Cecilia Demergasso2,3.
Abstract
10.1601/nm.2199 CLST is an extremely acidophilic gamma-proteobacteria that was isolated from the Gorbea salt flat, an acidic hypersaline environment in northern Chile. This kind of environment is considered a terrestrial analog of ancient Martian terrains and a source of new material for biotechnological applications. 10.1601/nm.2199 plays a key role in industrial bioleaching; it has the capacity of generating and maintaining acidic conditions by producing sulfuric acid and it can also remove sulfur layers from the surface of minerals, which are detrimental for their dissolution. CLST is a strain of 10.1601/nm.2199 able to tolerate moderate chloride concentrations (up to 15 g L-1 Cl-), a feature that is quite unusual in extreme acidophilic microorganisms. Basic microbiological features and genomic properties of this biotechnologically relevant strain are described in this work. The 3,974,949 bp draft genome is arranged into 40 scaffolds of 389 contigs containing 3866 protein-coding genes and 75 RNAs encoding genes. This is the first draft genome of a halotolerant 10.1601/nm.2199 strain. The release of the genome sequence of this strain improves representation of these extreme acidophilic Gram negative bacteria in public databases and strengthens the framework for further investigation of the physiological diversity and ecological function of 10.1601/nm.2199 populations.Entities:
Keywords: Acidithiobacillaceae; Bioleaching; Flexible gene complement; Halotolerance; Mars analog; Osmotolerance; Salar de Gorbea; Sulfur oxidization
Year: 2017 PMID: 29270251 PMCID: PMC5735861 DOI: 10.1186/s40793-017-0305-8
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Phylogenetic tree based on the 16S rRNA gene sequences highlighting the position of 10.1601/nm.2199 strain CLST relative to other type and non-type strains of the genus 10.1601/nm.2198. The GenBank database accession codes are indicated between brackets. The evolutionary history was inferred by using the Maximum Parsimony and the Subtree-Pruning-Regrafting (SPR) algorithm with search level 1 [52]. The initial trees were obtained by the random addition of sequences. The analysis involved 16 nucleotide sequences and a total of 1307 non-ambiguous positions in the final dataset. Evolutionary analyses were conducted in MEGA version 6.22 [53]. Tree construction used a bootstrapping process repeated 1000 times to generate a majority consensus tree. A sequence from 10.1601/nm.2205 was used as outgroup. The tree is drawn to scale, with branch lengths calculated using the average pathway method [53]; the scale bar corresponds to the number of changes over the whole sequence
Fig. 2Scanning electron microscopy image of 10.1601/nm.2199 strain CLST
Classification and general features of 10.1601/nm.2199 CLST
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum 10.1601/nm.808 | TAS [ | ||
| Class 10.1601/nm.24436 | TAS [ | ||
| Order 10.1601/nm.2196 | TAS [ | ||
| Family 10.1601/nm.2197 | TAS [ | ||
| Genus 10.1601/nm.2198 | TAS [ | ||
| Species 10.1601/nm.2199 | TAS [ | ||
| Strain: CLST (10.1601/strainfinder?urlappend=%3Fid%3DDSM+103717) | IDA | ||
| Gram stain | Negative | IDA | |
| Cell shape | Rod | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Not reported | IDA | |
| Temperature range | 25 -35 °C | IDA | |
| Optimum temperature | 28 °C | IDA | |
| Optimum pH | 1.7 | IDA | |
| Carbon source | CO2 | TAS [ | |
| MIGS-6 | Habitat | Brine, acidic hypersaline environment | IDA |
| MIGS-6.3 | Salinity | 10 -15 gL−1 chloride | IDA |
| MIGS-22 | Oxygen requirement | Aerobic and facultative anaerobic | IDA |
| MIGS-15 | Biotic relationship | Free-living | TAS [ |
| MIGS-14 | Pathogenicity | Non-pathogen | NAS |
| MIGS-4 | Geographic location | Gorbea salt flat, Antofagasta region, Chile | IDA |
| MIGS-5 | Sample collection | 11/20/2007 | IDA |
| MIGS-4.1 | Latitude | 25°25′72.2´´S | IDA |
| MIGS-4.2 | Longitude | 68°41′53.2´´W | IDA |
| MIGS-4.4 | Altitude | 4000 m.a.s.l. | IDA |
aEvidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [49]. Data is in compliance with MIGS version 2.0 [50] and the NamesforLife database [51]
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Draft |
| MIGS-28 | Libraries used | GS FLX Titanium paired end libraries |
| MIGS 29 | Sequencing platforms | Roche 454 GS FLX |
| MIGS 31.2 | Fold coverage | 36 × |
| MIGS 30 | Assemblers | Newbler 2.0.01.14 |
| MIGS 32 | Gene calling method | Glimmer 3.02 |
| Genbank ID |
| |
| GenBank Date of Release | 2017-04-05 | |
| GOLD ID | Gp0136483 | |
| BIOPROJECT |
| |
| MIGS 13 | Source Material Identifier | Gorbea-A |
| Project relevance | Territorial biodiversity, Tree of Life, Biomining, Astrobiology |
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 3,974,949 | 100.00 |
| DNA coding (bp) | 3,051,435 | 76.76 |
| DNA G + C (bp) | 1,939,775 | 48.80 |
| DNA scaffolds | 40 | 100.00 |
| Total genesb | 3941 | 100.00 |
| Protein coding genes | 3866 | 98.09 |
| RNA genesc | 75 | 1.91 |
| Pseudo genesd | n.d. | n.d. |
| Genes in internal clusters | 2118 | 54.78 |
| Genes with function prediction | 2468 | 63.38 |
| Genes assigned to COGs | 1803 | 46.63 |
| Genes with Pfam domains | 2634 | 68.13 |
| Genes with signal peptides | 292 | 7.55 |
| Genes with transmembrane helices | 880 | 22.33 |
| CRISPR repeats | 0 | 0.00 |
aThe total is based on either the size of the genome in base pairs or the total number of genes in the annotated genome
bIncludes tRNA, tmRNA, rRNA
cIncludes 23S, 16S and 5S rRNA
dn.d.: Not determined
Number of genes associated with general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 165 | 4.27 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.03 | RNA processing and modification |
| K | 102 | 2.64 | Transcription |
| L | 121 | 3.13 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 31 | 0.80 | Cell cycle control, Cell division, chromosome partitioning |
| V | 59 | 1.53 | Defense mechanisms |
| T | 111 | 2.87 | Signal transduction mechanisms |
| M | 138 | 3.57 | Cell wall/membrane biogenesis |
| N | 69 | 1.78 | Cell motility |
| U | 46 | 1.19 | Intracellular trafficking and secretion |
| O | 88 | 2.28 | Posttranslational modification, protein turnover, chaperones |
| C | 124 | 3.21 | Energy production and conversion |
| G | 75 | 1.94 | Carbohydrate transport and metabolism |
| E | 116 | 3.00 | Amino acid transport and metabolism |
| F | 56 | 1.45 | Nucleotide transport and metabolism |
| H | 103 | 2.66 | Coenzyme transport and metabolism |
| I | 53 | 1.37 | Lipid transport and metabolism |
| P | 93 | 2.41 | Inorganic ion transport and metabolism |
| Q | 22 | 0.57 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 99 | 2.56 | General function prediction only |
| S | 131 | 3.38 | Function unknown |
| – | 2063 | 53.36 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Fig. 3Exclusive gene complement of 10.1601/nm.2199 strain CLST relative to other strains of the species. All possible pairwise comparisons were performed. The number of exclusive genes resulting from the pairwise comparisons between the genomes of CLST and the other 10.1601/nm.2199 strains is plotted against the number of genomes compared. Genes were color coded according to their COG predicted function, as indicated in the lateral bar