| Literature DB >> 31848398 |
G Othoum1, S Prigent2, A Derouiche2, L Shi2, A Bokhari3, S Alamoudi4, S Bougouffa1, X Gao1, R Hoehndorf1, S T Arold1, T Gojobori1,3, H Hirt3, F F Lafi1,5, J Nielsen2,6,7, V B Bajic1, I Mijakovic8,9, M Essack10.
Abstract
Recent advancements in the use of microbial cells for scalable production of industrial enzymes encourage exploring new environments for efficient microbial cell factories (MCFs). Here, through a comparison study, ten newly sequenced Bacillus species, isolated from the Rabigh Harbor Lagoon on the Red Sea shoreline, were evaluated for their potential use as MCFs. Phylogenetic analysis of 40 representative genomes with phylogenetic relevance, including the ten Red Sea species, showed that the Red Sea species come from several colonization events and are not the result of a single colonization followed by speciation. Moreover, clustering reactions in reconstruct metabolic networks of these Bacillus species revealed that three metabolic clades do not fit the phylogenetic tree, a sign of convergent evolution of the metabolism of these species in response to special environmental adaptation. We further showed Red Sea strains Bacillus paralicheniformis (Bac48) and B. halosaccharovorans (Bac94) had twice as much secreted proteins than the model strain B. subtilis 168. Also, Bac94 was enriched with genes associated with the Tat and Sec protein secretion system and Bac48 has a hybrid PKS/NRPS cluster that is part of a horizontally transferred genomic region. These properties collectively hint towards the potential use of Red Sea Bacillus as efficient protein secreting microbial hosts, and that this characteristic of these strains may be a consequence of the unique ecological features of the isolation environment.Entities:
Mesh:
Year: 2019 PMID: 31848398 PMCID: PMC6917714 DOI: 10.1038/s41598-019-55726-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of sequencing features of the ten Red Sea genomes.
| Strain | Species based on 16S RNA assignment | Source | GenBank accession number | Genome size (Mb) | No. contigs | No. ORFs | No. rRNA genes (5S, 16S, 23S) |
|---|---|---|---|---|---|---|---|
| Bac48 | MN | CP023666 | 4.46 | 1 | 4366 | 24 | |
| Bac84 | MM | P023665 | 4.38 | 1 | 4306 | 24 | |
| Bac44 | MN | CP033044- CP033045 | 5.43 | 2 | 5639 | 34 | |
| Bac144 | BS | CP033051 | 4.59 | 1 | 4440 | 39 | |
| Bac94 | MM | CP033043 | 5.23 | 1 | 5055 | 58 | |
| Bac111 | MM | CP033052 | 3.96 | 1 | 3899 | 30 | |
| Bac57 | MN | CP033053- CP033054 | 4.23 | 2 | 4107 | 27 | |
| Bac330 | MN | CP033048 | 4.46 | 1 | 4221 | 18 | |
| Bac332 | MN | CP033046- CP033047 | 4.56 | 2 | 4492 | 18 | |
| Bac324 | MN | CP033049- CP033050 | 4.06 | 2 | 4306 | 24 |
Figure 1Phylogenetic tree of the ten Red Sea strains and 30 other species. The Red Sea species are displayed in grey while other previously sequenced strains are displayed in black.
Predicted genomic islands and prophage regions in the Red Sea genomes showing their overlap with biosynthetic genes
| Genome | GI % | Prophage % | Overlap with biosynthetic genes | Assigned product | |
|---|---|---|---|---|---|
| Number of genes | Cluster type(s) | ||||
| 5.05 | 2.43 | 38 | terpene, trans-acyltransferase PKS/NRPS | — | |
| 5.77 | 3.20 | 21 | terpene, bacteriocin | — | |
| 2.89 | 2.75 | 0 | |||
| 5.29 | 0.61 | 53 | terpene | — | |
| 4.31 | 0.28 | 19 | terpene | — | |
| 6.19 | 0.88 | — | |||
| 9.91 | 7.04 | 46 | phosphonate, trans-acyltransferase PKS/NRPS, lantipeptide, NRPS | Bacillaene | |
| 6.95 | 3.11 | 43 | NRPS, NRPS-PKS | — | |
| 9.39 | 4.11 | 72 | trans-acyltransferase PKS/NRPS, NRPS | ||
| 5.19 | 3.68 | 0 | — | ||
Figure 2Evaluation of sporulation (A) and protein secretion (B) in the Red Sea strains through gene prediction and in vitro measurements. Strains with the third TatA subunit missing, are indicated with *.
Figure 3Metabolic networks reconstruction. The left part of the figure corresponds to the number of reactions present in the metabolic networks. The right part of the figure corresponds to a clustering performed on the 32 species based on the presence and absence of reactions. The 32 species have been divided into seven metabolic clades based on this clustering.
Figure 4Number of predicted secondary metabolic gene clusters in the genomes of the Red Sea strains (red) and 22 other Bacillus genomes (blue).
Figure 5Co-localization of gene clusters using the normalized position for each SMGC based on the length of the chromosome and the position of the middle of the SMGC. Plain lines join co-localized SMGCs, dotted lines join SMGCs that are not co-localized.