| Literature DB >> 26981428 |
Rubén Sánchez-Nieves1, Marc Facciotti2, Sofía Saavedra-Collado1, Lizbeth Dávila-Santiago1, Roy Rodríguez-Carrero1, Rafael Montalvo-Rodríguez1.
Abstract
The genus Haloarcula belongs to the family Halobacteriaceae which currently has 10 valid species. Here we report the draft genome sequence of strain SL3, a new species within this genus, isolated from the Solar Salterns of Cabo Rojo, Puerto Rico. Genome assembly performed using NGEN Assembler resulted in 18 contigs (N50 = 601,911 bp), the largest of which contains 1,023,775 bp. The genome consists of 3.97 MB and has a GC content of 61.97%. Like all species of Haloarcula, the genome encodes heterogeneous copies of the small subunit ribosomal RNA. In addition, the genome includes 6 rRNAs, 48 tRNAs, and 3797 protein coding sequences. Several carbohydrate-active enzymes genes were found, as well as enzymes involved in the dihydroxyacetone processing pathway which are not found in other Haloarcula species. The NCBI accession number for this genome is LIUF00000000 and the strain deposit number is CECT9001.Entities:
Keywords: Halophilic archaea; Henome sequence; Puerto Rico
Year: 2016 PMID: 26981428 PMCID: PMC4778667 DOI: 10.1016/j.gdata.2016.02.005
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Fig. 1The subsystem category distribution of strain SL3 (Haloarcula rubripromontorii). The chart represents the coverage of proteins which were grouped into subsystems. Each section represents a subsystem and the number of proteins within that subsystem. A total of 2064 proteins were categorized within these subsystems. This chart was generated by RAST (Rapid Annotation System Technology).
CAZyme annotation of strain SL3 (H. rubripromontorii).
| Class | Family | Number of members |
| Auxiliary Activity (AA) | 2 | 6 |
| 3 | 2 | |
| 6 | 1 | |
| 7 | 1 | |
| Carbohydrate-Binding Module (CBM) | 6 | 3 |
| 13 | 1 | |
| 16 | 1 | |
| 35 | 2 | |
| 40 | 1 | |
| 44 | 3 | |
| Carbohydrate Esterase (CE) | 1 | 4 |
| 4 | 3 | |
| 8 | 2 | |
| 10 | 3 | |
| 14 | 1 | |
| Dockerin | 2 | |
| Glycoside Hydrolase (GH) | 2 | 1 |
| 3 | 2 | |
| 13 | 6 | |
| 15 | 3 | |
| 32 | 1 | |
| 36 | 1 | |
| 42 | 1 | |
| 68 | 1 | |
| 74 | 3 | |
| 77 | 1 | |
| 97 | 1 | |
| 99 | 1 | |
| 109 | 8 | |
| 120 | 1 | |
| Glycosyl Transferase (GT) | 1 | 1 |
| 2 | 12 | |
| 4 | 21 | |
| 19 | 1 | |
| 20 | 1 | |
| 40 | 1 | |
| 66 | 3 | |
| 75 | 1 | |
| 81 | 1 | |
| 83 | 3 | |
| 94 | 1 | |
| Polysaccharide Lyase (PL) | 5 | 1 |
| 12 | 1 | |
| Specifications: | |
| Organism | |
| Strain | SL3 |
| Sequencer or array type | MiSeq Systems (Illumina) |
| Data format | Analyzed |
| Experimental factors | Microbial strain |
| Experimental features | Assembled and annotated whole genome |
| Consent | |
| Sample source location | Solar Salterns of Cabo Rojo, Puerto Rico 17°57′12″N, 67°11′45″W |
| Organism | |
| Source | Cabo Rojo, Puerto Rico |
| Genome Size, Mb | 3.97 |
| GC content, % | 61.97 |
| tRNA | 48 |
| rRNA | 6 |
| Protein coding sequences | 3797 |