| Literature DB >> 25197453 |
Gro Elin Kjæreng Bjerga1, Erik Hjerde1, Concetta De Santi2, Adele Kim Williamson1, Arne Oskar Smalås1, Nils Peder Willassen1, Bjørn Altermark1.
Abstract
Here we report the 8 Mb high quality draft genome of Streptomyces sp. strain AW19M42, together with specific properties of the organism and the generation, annotation and analysis of its genome sequence. The genome encodes 7,727 putative open reading frames, of which 6,400 could be assigned with COG categories. Also, 62 tRNA genes and 8 rRNA operons were identified. The genome harbors several gene clusters involved in the production of secondary metabolites. Functional screening of the isolate was positive for several enzymatic activities, and some candidate genes coding for those activities are listed in this report. We find that this isolate shows biotechnological potential and is an interesting target for bioprospecting.Entities:
Keywords: Actinobacteria; Bioprospecting; Streptomyces; enzymes; metabolites
Year: 2014 PMID: 25197453 PMCID: PMC4148980 DOI: 10.4056/sigs.5038901
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of sp. strain AW19M42 according to the MIGS recommendations [17]
| | | | |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Subclass | TAS [ | ||
| Current classification | Order | TAS [ | |
| Suborder | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | NAS | ||
| Strain AW19M42 | IDA | ||
| Gram stain | Gram positive | NDA | |
| Cell shape | Branched mycelia | NDA | |
| Motility | Dispersion of spores | NDA | |
| Sporulation | Sporulating | NDA | |
| Temperature range | Range not determined, grows at 15°C and 28°C | IDA | |
| MIGS-6.3 | Salinity | Not determined, but survives 50% natural sea water | IDA |
| MIGS-22 | Oxygen requirements | Aerobic | NDA |
| Carbon source | Not reported | ||
| Energy source | Not reported | ||
| MIGS-6 | Habitat | Inner organs of sea squirt | IDA |
| MIGS-15 | Biotic relationship | Free-living | IDA |
| MIGS-14 | Pathogenicity | Non-pathogenic | NDA |
| Biosafety level | 1 | ||
| MIGS-4 | Geographic location | Hellmofjorden, Norway | IDA |
| MIGS-5 | Sample collection time | April 2010 | IDA |
| MIGS-4.1 | Latitude | N67 49.24316 | IDA |
| MIGS-4.2 | Longitude | E16 28.99465 | IDA |
| MIGS-4.3 | Depth | 77.35 m | IDA |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [28]. If the evidence code is IDA, then the property was directly observed for a live isolate by one of the authors or an expert or mentioned in the acknowledgements.
Figure 1Phylogenetic tree indicating the phylogenetic relationship of sp. strain AW19M42 relative to other species. The phylogenetic tree was made by comparing the 16S rDNA sequence of the sp. strain AW19M42 to the closest related sequences from both validated type strains and unidentified isolates. S. venezuelea is used as outgroup. All positions containing gaps and missing data were eliminated. There were a total of 1,389 positions in the final dataset. The bar shows the number of base substitutions per site.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Improved high quality draft |
| MIGS-28 | Libraries used | One Illumina Paired-End library |
| MIGS-29 | Sequencing platforms | Illumina HiSeq2000 |
| MIGS-31.2 | Fold coverage | 350× |
| MIGS-30 | Assemblers | CLC paired-end assembly |
| MIGS-32 | Gene calling method | Glimmer 3 |
| Genbank ID | CBRG000000000 | |
| Genbank Date of Release | September 11, 2013 | |
| GOLD ID | Gi0070794 | |
| Project relevance | Bioprospecting |
Genome statistics, including nucleotide content and gene count levels
| | | |
|---|---|---|
| Genome size (bp) | 8,008,851 | 100 |
| DNA coding region (bp) | 6,979,999 | 87.2 |
| DNA G+C content (bp) | 4,951,797 | 70.6 |
| Total genes | 7,813 | n/a |
| rRNA operons | 8 | n/a |
| tRNA genes | 62 | n/a |
| Protein-coding genes | 7,727 | 100 |
| Genes assigned to COGs | 6,400 | 82.8 |
| Genes with signal peptides | 987 | 12.8 |
| Genes with transmembrane helices | 1,660 | 21.5 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 264 | 3.4 | Translation |
| A | 1 | 0.0 | RNA processing and modification |
| K | 836 | 10.8 | Transcription |
| L | 330 | 4.3 | Replication, recombination and repair |
| B | 5 | 0.1 | Chromatin structure and dynamics |
| D | 71 | 0.9 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 159 | 2.1 | Defense mechanisms |
| T | 442 | 5.7 | Signal transduction mechanisms |
| M | 338 | 4.3 | Cell wall/membrane biogenesis |
| N | 28 | 0.4 | Cell motility |
| Z | 6 | 0.1 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 79 | 1.0 | Intracellular trafficking and secretion |
| O | 200 | 2.6 | Posttranslational modification, protein turnover, chaperones |
| C | 409 | 5.3 | Energy production and conversion |
| G | 665 | 8.6 | Carbohydrate transport and metabolism |
| E | 730 | 9.4 | Amino acid transport and metabolism |
| F | 123 | 1.6 | Nucleotide transport and metabolism |
| H | 262 | 3.4 | Coenzyme transport and metabolism |
| I | 330 | 4.3 | Lipid transport and metabolism |
| P | 435 | 5.6 | Inorganic ion transport and metabolism |
| Q | 417 | 5.4 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 1,181 | 15.3 | General function prediction only |
| S | 465 | 6.0 | Function unknown |
| - | 1,327 | 17.2 | Not in COGs |
aThe total is based on the total number of protein coding genes in the annotated genome.
Figure 2Degradation halos around colonies of sp. AW19M42 growing on agar plates supplemented with A, skim milk, B, gelatin, C, tributyrin, D, DNA, E, chitin and F, starch.
Candidate genes coding for putative lipase, caseinase, gelatinase and DNase activities identified in sp. strain AW19M42 draft genome.
| | | |
|---|---|---|
| STREP_0737 | Lipase | 273 |
| STREP_1671 | Triacylglycerol lipase | 266 |
| STREP_1821 | G-D-S-L family lipolytic protein | 281 |
| STREP_2698 | Lipase class 2 | 297 |
| STREP_2704 | Triacylglycerol lipase | 269 |
| STREP_4585 | Secreted hydrolase | 268 |
| STREP_5662 | Lipase or acylhydrolase family protein | 367 |
| STREP_6665 | Esterase/lipase | 259 |
| STREP_6850 | Esterase/lipase | 429 |
| STREP_7611 | Triacylglycerol lipase | 366 |
| STREP_5784 | Peptidase M4 thermolysin | 523 |
| STREP_6038 | Peptidase M4 thermolysin | 680 |
| STREP_3662 | Peptidase M4 thermolysin | 358 |
| STREP_0198 | Putative secreted serine protease | 361 |
| STREP_0258 | Protease | 278 |
| STREP_0974 | Protease | 488 |
| STREP_1078 | Serine protease | 388 |
| STREP_1313 | M6 family metalloprotease domain-containing protein | 398 |
| STREP_1389 | M6 family metalloprotease domain protein | 1,389 |
| STREP_2216 | Putative secreted subtilisin-like serine protease | 511 |
| STREP_2239 | metalloprotease | 296 |
| STREP_3135 | Metalloprotease domain protein | 127 |
| STREP_3964 | ATP-dependent protease La | 808 |
| STREP_3975 | ATP-dependent metalloprotease FtsH | 673 |
| STREP_4000 | Streptogrisin-B - Pronase enzyme B SGPB/Serine protease B | 299 |
| STREP_5179 | ATP-dependent Clp protease proteolytic subunit | 222 |
| STREP_5180 | ATP-dependent Clp protease, ATP-binding subunit ClpX | 432 |
| STREP_5944 | Protease | 527 |
| STREP_5945 | Protease | 534 |
| STREP_6196 | Protease | 383 |
| STREP_6570 | Protease | 701 |
| STREP_6821 | Putative protease | 352 |
| STREP_7179 | Serine protease | 635 |
| STREP_7580 | Protease | 856 |
| STREP_0436 | Exodeoxyribonuclease VII, large subunit | 403 |
| STREP_0437 | Exodeoxyribonuclease VII small subunit | 91 |
| STREP_1352 | Exodeoxyribonuclease III Xth | 268 |
| STREP_1969 | TatD-related deoxyribonuclease | 1,969 |
| STREP_2155 | Deoxyribonuclease V | 220 |
| STREP_2430 | Deoxyribonuclease/rho motif-related TRAM | 452 |
| STREP_4206 | Deoxyribonuclease | 776 |
| STREP_6678 | Probable endonuclease 4 - Endodeoxyribonuclease | 275 |
| STREP_2729 | Chitinase, glycosyl hydrolase 18 family | 628 |
| STREP_5817 | Chitinase, glycosyl hydrolase 18 family | 424 |
| STREP_5513 | Carbohydrate-binding CenC domain protein | 577 |
| STREP_3508 | Glycoside hydrolase family protein | 609 |
| STREP_4257 | Putative endochitinase | 350 |
| STREP_6187 | Chitinase, glycosyl hydrolase 19 family | 297 |
| STREP_6188 | Chitinase, glycosyl hydrolase 19 family | 291 |
| STREP_1696 | Glycoside hydrolase starch-binding protein | 573 |
| STREP_5789 | Secreted alpha-amylase | 458 |
| STREP_7405 | Malto-oligosyltrehalose synthase | 834 |
| STREP_1697 | Alpha-1,6-glucosidase, pullulanase-type | 1,774 |