| Literature DB >> 33022985 |
Adeel Malik1,2, Yu Ri Kim1, Seung Bum Kim1.
Abstract
The genus Streptacidiphilus represents a group of acidophilic actinobacteria within the family Streptomycetaceae, and currently encompasses 15 validly named species, which include five recent additions within the last two years. Considering the potential of the related genera within the family, namely Streptomyces and Kitasatospora, these relatively new members of the family can also be a promising source for novel secondary metabolites. At present, 15 genome data for 11 species from this genus are available, which can provide valuable information on their biology including the potential for metabolite production as well as enzymatic activities in comparison to the neighboring taxa. In this study, the genome sequences of 11 Streptacidiphilus species were subjected to the comparative analysis together with selected Streptomyces and Kitasatospora genomes. This study represents the first comprehensive comparative genomic analysis of the genus Streptacidiphilus. The results indicate that the genomes of Streptacidiphilus contained various secondary metabolite (SM) producing biosynthetic gene clusters (BGCs), some of them exclusively identified in Streptacidiphilus only. Several of these clusters may potentially code for SMs that may have a broad range of bioactivities, such as antibacterial, antifungal, antimalarial and antitumor activities. The biodegradation capabilities of Streptacidiphilus were also explored by investigating the hydrolytic enzymes for complex carbohydrates. Although all genomes were enriched with carbohydrate-active enzymes (CAZymes), their numbers in the genomes of some strains such as Streptacidiphilus carbonis NBRC 100919T were higher as compared to well-known carbohydrate degrading organisms. These distinctive features of each Streptacidiphilus species make them interesting candidates for future studies with respect to their potential for SM production and enzymatic activities.Entities:
Keywords: CAZyme; Kitasatospora; NRPS; PKS; Streptacidiphilus; Streptomyces; Streptomycetaceae; acidophile; biodegradation; biosynthetic gene cluster; comparative genomics; pan-genome; specialized (secondary) metabolite
Year: 2020 PMID: 33022985 PMCID: PMC7601586 DOI: 10.3390/genes11101166
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Features of 11 Streptacidiphilus genomes and their comparison with representatives from Streptomyces and Kitasatospora. Species: 1, Streptacidiphilus albus JL83T; 2, Streptacidiphilus anmyonensis NBRC 103185T; 3, Streptacidiphilus bronchialis DSM 106435T; 4, Streptacidiphilus carbonis NBRC 100919T; 5, Streptacidiphilus jeojiense NRRL B-24555T; 6, Streptacidiphilus jiangxiensis NBRC 100920T; 7, Streptacidiphilus melanogenes NBRC 103184T; 8, Streptacidiphilus neutrinimicus NBRC 100921T; 9, Streptacidiphilus oryzae TH49T; 10, Streptacidiphilus pinicola KCTC 49008T; 11, Streptacidiphilus rugosus AM-16T; 12, Streptomyces albus DSM 41398T; 13, Streptomyces avermitilis MA-4680T; 14, Streptomyces coelicolor A3(2); 15, Kitasatospora azatica KCTC 9699T; 16, Kitasatospora mediocidica KCTC 9733T; 17, Kitasatospora setae KM-6054T.
| Species | BioProject Accession | Size (Mb) | No. of Contigs | GC Content (%) | CDS | Pseudogenes | tRNA (rRNA) | Accessory Genes | Unique Genes | No. of Biosynthetic Gene Clusters (BGCs) | Reference |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | PRJNA234786 | 9.91 | 1 | 71.8 | 8,166 | 366 | 84 (27) | 3,458 | 2,066 | 35 | [ |
| 2 | PRJDB3297 | 9.38 | 153 | 72 | 7941 | 453 | 73 (3) | 4678 | 759 | 30 | [ |
| 3 | PRJNA438532 | 7.09 | 1 | 72.70 | 5681 | 247 | 66 (24) | 1915 | 1665 | 19 | [ |
| 4 | PRJDB3297 | 8.45 | 139 | 71.3 | 7062 | 454 | 72 (3) | 3591 | 1162 | 32 | [ |
| 5 | PRJNA238534 | 9.15 | 144 | 71.5 | 7769 | 257 | 73 (19) | 3990 | 1376 | 18 | - |
| 6 | PRJDB3297 | 9.52 | 96 | 72 | 8285 | 76 | 69 (5) | 4545 | 1247 | 30 | [ |
| 7 | PRJDB3297 | 8.77 | 107 | 71.9 | 7481 | 390 | 74 (3) | 4547 | 554 | 31 | [ |
| 8 | PRJDB3297 | 8.41 | 184 | 71.9 | 7119 | 459 | 71 (3) | 4193 | 567 | 38 | [ |
| 9 | PRJNA234788 | 7.81 | 6 | 73.4 | 6526 | 272 | 63 (23) | 2447 | 1884 | 15 | [ |
| 10 | PRJNA475452 | 8.43 | 281 | 71.8 | 7440 | 311 | 68 (8) | 4137 | 846 | 35 | [ |
| 11 | PRJNA234778 | 9 | 4 | 71.8 | 7750 | 356 | 74 (27) | 4101 | 1171 | 22 | [ |
| 12 | PRJNA271625 | 8.38 | 1 | 72.60 | 7330 | 58 | 65 (18) | NA | NA | 35 | [ |
| 13 | PRJNA189 | 9.11 | 2 | 70.68 | 7446 | 0 | 68 (18) | NA | NA | 37 | [ |
| 14 | PRJNA242 | 9.05 | 1 | 72 | 8152 | 60 | 65 (18) | NA | NA | 29 | [ |
| 15 | PRJNA234862 | 8.27 | 3 | 71.6 | 6844 | 292 | 74 (29) | NA | NA | 30 | [ |
| 16 | PRJNA234781 | 8.68 | 7 | 71.9 | 7103 | 254 | 71 (33) | NA | NA | 32 | [ |
| 17 | PRJDA19951 | 8.78 | 1 | 74.20 | 7182 | 0 | 74 (27) | NA | NA | 38 | [ |
Figure 1Neighbor-joining tree based on 16S rRNA gene sequences showing the relationships between 15 currently recognized species of Streptacidiphilus. Numbers at the nodes represent the bootstrap support (%), and the scale bar the substitutions rates per nucleotide position.
Figure 2Distribution of COG categories in Streptacidiphilus genomes. (A) Full proteomes of Streptacidiphilus species, and for additional comparison representative strains from Kitasatospora and Streptomyces are added. (B) Core genome, (C) accessory genome, (D) unique genome, (E) average comparison, and (F) KEGG orthologs. The COG categories indicate RNA processing and modification (A), chromatin structure and dynamics (B), energy production and conversion (C), cell cycle control, cell division, chromosome partitioning (D), amino acid metabolism and transport (E), nucleotide metabolism and transport (F), carbohydrate metabolism and transport (G), coenzyme metabolism and transport (H), lipid metabolism and transport (I), translation, ribosomal structure and biogenesis (J), transcription (K), replication, recombination and repair (L), cell wall/membrane/envelop biogenesis (M), post-translational modification, protein turnover, chaperone functions (O), inorganic ion transport and metabolism (P), secondary metabolites biosynthesis, transport and catabolism (Q), function unknown (S), signal transduction mechanisms (T), intracellular trafficking, secretion, and vesicular transport (U), and defense mechanisms (V), respectively.
Figure 3The pan-genome of Streptacidiphilus based on 11 species. (A) Core vs. pan developmental plot of eleven Streptacidiphilus genomes. The blue line indicates an increase in the number of genes in a pan-genome on sequential addition of genomes, and suggests an open pan-genome for this genus. In contrast, the number of core genes shared across these genomes decreases with the sequential addition of genomes, indicated by the red line. (B) Plot representing the number of new unique genes as a function of the number of strains added sequentially.
Distribution of biosynthetic gene clusters (BGCs) in the genomes of Streptacidiphilus and their comparison with representative Kitasatospora and Streptomyces. Cluster type with an overall aggregate of 10 or above are shown only. Species: 1, S. albus JL83T; 2, S. anmyonensis NBRC 103185T; 3, S. bronchialis DSM 106435T; 4, S. carbonis NBRC 100919T; 5, S. jeojiense NRRL B-24555T; 6, S. jiangxiensis NBRC 100920T; 7, S. melanogenes NBRC 103184T; 8, S. neutrinimicus NBRC 100921T; 9, S. oryzae TH49T; 10, S. pinicola KCTC 49008T; 11, S. rugosus AM-16T; 12, Str. albus DSM 41398T; 13, Str. avermitilis MA-4680T; 14, Str. coelicolor A3(2); 15, K. azatica KCTC 9699T; 16, K. mediocidica KCTC 9733T; 17, K. setae KM-6054T.
| Cluster Type/Species | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Terpene | 3 | 4 | 3 | 6 | 5 | 6 | 4 | 4 | 8 | 5 | 5 | 2 | 7 | 5 | 3 | 3 | 5 | 78 |
| T1PKS | 0 | 6 | 0 | 8 | 1 | 1 | 6 | 11 | 0 | 3 | 0 | 3 | 4 | 1 | 2 | 5 | 1 | 52 |
| NRPS | 4 | 2 | 1 | 4 | 3 | 5 | 1 | 2 | 0 | 4 | 2 | 2 | 4 | 3 | 0 | 3 | 2 | 42 |
| Siderophore | 2 | 2 | 1 | 2 | 2 | 2 | 3 | 2 | 1 | 2 | 2 | 2 | 4 | 3 | 2 | 2 | 2 | 36 |
| Lantipeptide | 7 | 1 | 3 | 2 | 2 | 1 | 0 | 2 | 1 | 1 | 1 | 2 | 0 | 3 | 3 | 1 | 4 | 34 |
| Bacteriocin | 0 | 1 | 1 | 2 | 1 | 0 | 1 | 1 | 2 | 3 | 1 | 1 | 1 | 2 | 3 | 0 | 3 | 23 |
| Butyrolactone | 2 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 2 | 2 | 1 | 0 | 0 | 2 | 1 | 6 | 22 |
| T2PKS | 0 | 2 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 2 | 2 | 0 | 1 | 12 |
| T3PKS | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 12 |
| Lassopeptide | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 3 | 0 | 0 | 0 | 2 | 11 |
| T1PKS_NRPS | 1 | 0 | 3 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 10 |
| Thiopeptide | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 1 | 0 | 0 | 2 | 0 | 0 | 9 |
| T1PKS_OtherKS | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 1 | 2 | 1 | 1 | 1 | 8 |
| Butyrolactone_OtherKS | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 6 |
| Other hybrids | 9 | 6 | 4 | 1 | 1 | 5 | 6 | 6 | 0 | 2 | 1 | 8 | 5 | 3 | 4 | 8 | 6 | 75 |
| Others | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 2 | 5 | 3 | 3 | 1 | 3 | 2 | 23 |
| Total | 35 | 30 | 19 | 32 | 18 | 30 | 31 | 38 | 15 | 35 | 22 | 35 | 37 | 29 | 30 | 32 | 38 | 506 |
Figure 4Most abundant known BGCs in Streptacidiphilus. Bar color indicates the classification of each cluster type in MIBiG database. Numbers above the bars indicate the range of percent similarity shown the clusters against known BGCs.
List of known BGCs observed only in Streptacidiphilus genomes.
| Known BGC | Organism Name | Cluster No. | Cluster Type | Similarity (%) | MIBiG Reference |
|---|---|---|---|---|---|
| Cacibiocin | 10 | Aminocoumarin | 64 | BGC0001154 | |
| Cyanopeptin | 30 | NRPS | 50 | BGC0000331 | |
| Eicoseicosapentaenoic acid | 18 | Otherks | 10 | BGC0000865 | |
| Erythrochelin | 18 | NRPS | 28 | BGC0000349 | |
| Frenolicin | 9 | T2PKS | 50 | BGC0000225 | |
| Lankacidin | 25 | Other | 26 | BGC0001100 | |
| Micromonolactam | 2 | T1PKS | 100 | BGC0000095 |
Distribution of carbohydrate-active enzymes (CAZymes) and their various families in Streptacidiphilus and their comparison with representative Streptomyces and Kitasatospora species. For comparison, the CAZy profile of a well-known cellulose degrading Streptomyces sp. SirexAA strain is also shown. Values in parentheses (columns 3–8) represent different types of CAZy families. GH = glycoside hydrolase, GT = glycosyl transferase, CE = carbohydrate esterases, PL = polysaccharide lyases, CBM = carbohydrate-binding modules, AA = auxiliary activities.
| Organism Name | Genes (%) | GH | GT | CE | PL | CBM | AA |
|---|---|---|---|---|---|---|---|
| 366 (4.48) | 145 (46) | 84 (14) | 76 (7) | 9 (6) | 120 (16) | 15 (5) | |
| 339 (4.27) | 134 (42) | 79 (14) | 71 (7) | 4 (2) | 108 (15) | 24 (6) | |
| 268 (4.72) | 132 (49) | 56 (13) | 60 (8) | 4 (3) | 86 (20) | 11 (4) | |
| 422 (5.98) | 235 (69) | 77 (13) | 77 (8) | 9 (5) | 152 (26) | 15 (6) | |
| 440 (5.66) | 236 (60) | 93 (12) | 74 (8) | 17 (5) | 129 (21) | 22 (6) | |
| 370 (4.47) | 171 (56) | 88 (14) | 70 (8) | 1 (1) | 100 (18) | 29 (6) | |
| 329 (4.40) | 133 (43) | 80 (14) | 64 (8) | 4 (3) | 96 (16) | 27 (6) | |
| 313 (4.40) | 130 (47) | 75 (14) | 62 (7) | 5 (4) | 83 (16) | 25 (6) | |
| 326 (5.00) | 154 (53) | 83 (15) | 64 (8) | 4 (4) | 48 (16) | 20 (5) | |
| 390 (5.24) | 190 (61) | 87 (13) | 59 (9) | 9 (7) | 131 (20) | 24 (5) | |
| 334 (4.31) | 147 (47) | 81 (14) | 66 (8) | 4 (4) | 99 (16) | 25 (6) | |
| 262 (3.57) | 120 (48) | 55 (11) | 54 (8) | 3 (3) | 30 (12) | 21 (7) | |
| 347 (4.52) | 169 (55) | 75 (13) | 60 (9) | 12 (9) | 75 (18) | 25 (7) | |
| 383 (4.70) | 187 (60) | 68 (14) | 71 (9) | 15 (11) | 98 (19) | 23 (7) | |
| 297 (4.67) | 137 (50) | 63 (13) | 63 (9) | 8 (5) | 80 (21) | 17 (5) | |
| 364 (5.34) | 172 (56) | 70 (15) | 67 (8) | 2 (2) | 160 (21) | 18 (5) | |
| 333 (4.69) | 128 (48) | 84 (14) | 71 (9) | 8 (2) | 96 (18) | 15 (4) | |
| 308 (4.29) | 122 (41) | 78 (13) | 62 (8) | 3 (2) | 100 (19) | 21 (5) |
Distribution of key CAZymes involved in the degradation of complex carbohydrates among Streptacidiphilus and their comparison with representative Streptomyces and Kitasatospora.
| Enzyme Class | CAZy Family | Main Activity | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cellulose degrading CAZymes | GH1 | β-Glucosidase, β-galactosidase | 4 | 3 | 5 | 3 | 3 | 6 | 3 | 3 | 4 | 6 | 4 | 3 | 3 | 5 | 6 | 5 | 2 | 4 |
| GH3 | β-Glucosidases, β-D-xylopyranosidase | 8 | 7 | 8 | 9 | 14 | 10 | 8 | 7 | 11 | 9 | 7 | 6 | 5 | 9 | 7 | 7 | 2 | 4 | |
| GH5 | Endo-β-1,4-glucanase, β-mannosidase | 2 | 5 | 5 | 11 | 5 | 7 | 5 | 6 | 1 | 9 | 5 | 1 | 3 | 5 | 3 | 3 | 6 | 2 | |
| GH6 | Cellobiohydrolase, endo-β-1,4-glucanase | 3 | 1 | 4 | 1 | 4 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 4 | 3 | 1 | 3 | 2 | 3 | |
| GH8 | Endo-β-1,4-glucanase | 1 | 2 | 0 | 0 | 1 | 2 | 2 | 2 | 0 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | |
| GH9 | Endo-β-1,4-glucanase | 2 | 0 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 2 | 1 | 0 | 1 | 2 | |
| GH12 | Endo-β-1,4-glucanase | 0 | 0 | 1 | 2 | 2 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 3 | 2 | 1 | 1 | 0 | 0 | |
| GH44 | Endoglucanase | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| GH48 | Cellobiohydrolase | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | |
| GH74 | Endoglucanase | 1 | 3 | 1 | 4 | 4 | 2 | 2 | 2 | 3 | 3 | 4 | 0 | 4 | 3 | 1 | 2 | 1 | 1 | |
| Subtotal | 22 | 21 | 28 | 32 | 35 | 31 | 22 | 22 | 21 | 33 | 26 | 12 | 23 | 30 | 21 | 22 | 16 | 18 | ||
| Hemicellulose degrading CAZymes | GH2 | β-Galactosidase, β-glucuronidase | 1 | 1 | 2 | 4 | 4 | 1 | 1 | 0 | 6 | 2 | 2 | 4 | 6 | 7 | 4 | 1 | 1 | 0 |
| GH10 | Endo-β-1,4-xylanase | 0 | 0 | 3 | 3 | 1 | 1 | 0 | 2 | 3 | 2 | 1 | 0 | 2 | 2 | 1 | 3 | 2 | 3 | |
| GH11 | Endo-β-1,4-xylanase | 0 | 0 | 2 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 2 | 1 | 1 | 0 | 1 | |
| GH16 | Xyloglycosyltransferase | 10 | 5 | 3 | 8 | 9 | 5 | 8 | 4 | 3 | 6 | 7 | 4 | 3 | 5 | 6 | 9 | 10 | 7 | |
| GH26 | Endo-β-1,4-mannanase | 0 | 2 | 0 | 3 | 3 | 5 | 3 | 3 | 0 | 3 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | |
| GH30 | Endo-β-1,4-xylanase | 0 | 3 | 3 | 3 | 2 | 2 | 3 | 2 | 1 | 2 | 0 | 2 | 3 | 1 | 1 | 2 | 1 | 1 | |
| GH31 | α-Glucosidases, α-xylosidase | 0 | 0 | 1 | 4 | 4 | 1 | 0 | 0 | 5 | 1 | 0 | 2 | 2 | 2 | 4 | 4 | 0 | 0 | |
| GH39 | α-L-iduronidase, β-xylosidase | 3 | 0 | 0 | 2 | 2 | 1 | 1 | 1 | 2 | 0 | 2 | 0 | 2 | 0 | 0 | 1 | 0 | 0 | |
| GH42 | β-Galactosidase | 0 | 2 | 2 | 6 | 4 | 5 | 2 | 1 | 2 | 6 | 2 | 0 | 4 | 2 | 3 | 2 | 1 | 0 | |
| GH43 | α-l-arabinofuranosidase, β-xylosidase | 1 | 1 | 3 | 4 | 5 | 1 | 0 | 0 | 4 | 2 | 3 | 2 | 6 | 6 | 6 | 6 | 1 | 2 | |
| GH53 | Endo-β-1,4-galactanase | 1 | 0 | 0 | 7 | 4 | 3 | 2 | 4 | 3 | 8 | 1 | 0 | 0 | 0 | 0 | 5 | 4 | 0 | |
| Subtotal | 16 | 14 | 19 | 45 | 38 | 25 | 20 | 17 | 30 | 32 | 18 | 15 | 29 | 28 | 27 | 35 | 20 | 14 | ||
| Chitin and chitosan degrading CAZymes | GH18 | Chitinase | 14 | 12 | 6 | 12 | 17 | 16 | 12 | 10 | 7 | 12 | 14 | 5 | 8 | 12 | 10 | 14 | 12 | 19 |
| GH19 | Chitinase | 1 | 2 | 2 | 1 | 2 | 2 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 2 | 3 | 0 | 1 | 2 | |
| GH20 | Exo-β- | 2 | 4 | 2 | 3 | 3 | 5 | 4 | 2 | 4 | 4 | 4 | 5 | 3 | 4 | 2 | 2 | 2 | 2 | |
| GH46 | Endo-β-1,4-chitosanase | 1 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 2 | 2 | 2 | 3 | 1 | 0 | 3 | |
| Subtotal | 18 | 18 | 11 | 17 | 23 | 23 | 17 | 13 | 12 | 17 | 19 | 13 | 13 | 20 | 18 | 17 | 15 | 26 | ||
| Oxidative enzymes | AA10 | Lytic polysaccharide monooxygenase (LPMO) | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 4 | 7 | 6 | 0 | 0 | 5 |