| Literature DB >> 31763402 |
Huong Van Nguyen1, Phung Minh Truong1, Huy Thuc Duong2, Hiep Minh Dinh3, Chuong Hoang Nguyen4.
Abstract
We report here the biosynthesis of daidzein in Streptomyces sp. SS52, its genome sequence and the analysis of its genome for finding putative genes involved in daidzein biosynthesis. The Streptomyces sp. SS52 strain was isolated from the plant Phyllanthus urinaria in Tra Vinh province, Vietnam. This endophytic strain is capable of producing the isoflavone daidzein in the culture medium. Streptomyces sp. SS52 possesses a linear genome of 8,184,045 bp and the GC content of this genome is 72.5%. The preliminary genome analysis identified homologs of genes involved in the de novo biosynthesis of daidzein in the genome of Streptomyces sp. SS52. The genome sequencing of Streptomyces sp. SS52 was essential for the study of the biosynthesis of daidzein in Streptomyces bacteria.Entities:
Keywords: Daidzein; Genome sequence; NGS; SS52; Streptomyces
Year: 2019 PMID: 31763402 PMCID: PMC6859222 DOI: 10.1016/j.dib.2019.104746
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
The 1H NMR and13C NMR spectra of daidzein.
| 1 (DMSO‑ | ||
|---|---|---|
| δΗ, J (Hz) | ||
| 1 | ||
| 2 | 8.28, br | 152.9 |
| 3 | 104.1 | |
| 4 | 174.6 | |
| 5 | 7.96, d, 8.5 | 127.0 |
| 6 | 6.92, dd, 8.5, 2.0 | 115.0 |
| 7 | 162.3 | |
| 8 | 6.86, d, 2.0 | 101.8 |
| 9 | 156.9 | |
| 10 | 104.8 | |
| 5-OH | ||
| 7-OH | ||
| 8-OH | ||
| 1’ | 122.6 | |
| 2’/6’ | 7.37, d, 8.5 | 130.0 |
| 3’/5’ | 6.79, d, 8.5 | 114.9 |
| 4’ | 157.4 | |
| 4′-OH | 9.51, br | |
Features of the genome of Streptomyces sp. SS52.
| Feature | |
|---|---|
| Source of isolation | |
| Genome size (bp) | 8,184,045 |
| GC content (%) | 72.5 |
| Gene total | 7320 |
| Protein coding sequences | 6843 |
| tRNA | 67 |
| rRNA | 6 (5S), 6 (16S), 6 (23S) |
| Pseudogenes | 389 |
Putative genes involved in the biosynthesis of daidzein in the Streptomyces sp. SS52 genome.
| Locus tag | Genome position | Annotated function | Analogous protein | Amino acid identity (%) | Functionally conserved amino acids (%) |
|---|---|---|---|---|---|
| E5N77_22775 ( | 4999095..5000633 | Histidine ammonia-lyase | Phenylalanine ammonia-lyase ( | 31 | 49 |
| E5N77_23955 | 5262703..5264088 | Cytochrome P450 | Cinnamate 4-hydroxylase ( | 23 | 40 |
| E5N77_15975 | 3550902..3552470 (complement) | 4-coumarate-CoA ligase family protein | 4-coumarate-CoA ligase ( | 42 | 60 |
| E5N77_05755 | 1315157..1316263 | Type III polyketide synthase | Chalcone synthase ( | 28 | 45 |
| E5N77_07535 | 1678955..1679926 (complement) | Aldo/keto reductase | Chalcone reductase ( | 35 | 54 |
| E5N77_05760 | 1316260..1317474 | Cytochrome P450 | Cytochrome P450 ( | 34 | 48 |
| E5N77_23955 | 5262703..5264088 | Cytochrome P450 | Isoflavone synthase ( | 23 | 40 |
Specification Table
| Subject | Biology |
| Specific subject area | Microbiology, Genomics, Biotechnology |
| Type of data | Table, complete genome sequence |
| How data were acquired | Chromatographic techniques, Nuclear Magnetic Resonance (NMR) spectroscopy, genome sequencing by PacBio Sequel and Illumina HiSeq 4000 |
| Data format | Raw and Analyzed |
| Parameters for data collection | Isolation and identification of daidzein in the culture of |
| Description of data collection | |
| Data source location | Center for Research and Application in Bioscience, Ho Chi Minh City, Vietnam |
| Data accessibility | Data are available within this article and the genome sequence of |
The genome sequence data of In Data presented here could contribute to clarify the molecular mechanism of daidzein biosynthesis which is now poorly understood in |