| Literature DB >> 31071993 |
Laila Ziko1, Mustafa Adel2,3, Mohamed N Malash4,5, Rania Siam6.
Abstract
The recent rise in antibiotic and chemotherapeutic resistance necessitates the search for novel drugs. Potential therapeutics can be produced by specialized metabolism gene clusters (SMGCs). We mined for SMGCs in metagenomic samples from Atlantis II Deep, Discovery Deep and Kebrit Deep Red Sea brine pools. Shotgun sequence assembly and secondary metabolite analysis shell (antiSMASH) screening unraveled 2751 Red Sea brine SMGCs, pertaining to 28 classes. Predicted categorization of the SMGC products included those (1) commonly abundant in microbes (saccharides, fatty acids, aryl polyenes, acyl-homoserine lactones), (2) with antibacterial and/or anticancer effects (terpenes, ribosomal peptides, non-ribosomal peptides, polyketides, phosphonates) and (3) with miscellaneous roles conferring adaptation to the environment/special structure/unknown function (polyunsaturated fatty acids, ectoine, ladderane, others). Saccharide (80.49%) and putative (7.46%) SMGCs were the most abundant. Selected Red Sea brine pool sites had distinct SMGC profiles, e.g., for bacteriocins and ectoine. Top promising candidates, SMs with pharmaceutical applications, were addressed. Prolific SM-producing phyla (Proteobacteria, Actinobacteria, Cyanobacteria), were ubiquitously detected. Sites harboring the largest numbers of bacterial and archaeal phyla, had the most SMGCs. Our results suggest that the Red Sea brine niche constitutes a rich biological mine, with the predicted SMs aiding extremophile survival and adaptation.Entities:
Keywords: Red Sea brine pools; extremophiles; specialized metabolism gene clusters
Mesh:
Year: 2019 PMID: 31071993 PMCID: PMC6562949 DOI: 10.3390/md17050273
Source DB: PubMed Journal: Mar Drugs ISSN: 1660-3397 Impact factor: 5.118
Figure 1Study workflow for the analysis of specialized metabolism gene clusters in Red Sea brine pools. Water and sediment samples were earlier collected from Red Sea Atlantis II Deep (ATII), Discovery Deep (DD), Kebrit Deep (KD) brine pools and brine-influenced (NBI, NBII) sites. Metagenomic prokaryotic DNA was then extracted from each site and 454 shotgun sequencing was performed followed by read assembly. Taxonomic classification for archaeal and bacterial phyla was performed by protein-based phylogeny using the metagenomics rapid annotation using subsystems technology (MG-RAST) tool [37]. The metabolite analysis shell (AntiSMASH) tool was then used for annotation, for identifying specialized metabolism gene clusters (SMGCs) by translated amino acid sequence comparison with signature biosynthetic genes profile hidden Marcov Model (pHMMs), and for structure prediction of the specialized metabolites [9]. The Antibiotic Resistance Target Seeker (ARTS) tool was used to detect housekeeping and/or resistance genes within the SMGCs [38]. The predicted specialized metabolites were grouped by their potential functions into three major groups. Lastly, top candidate SMGCs were identified in the Red Sea brine dataset.
Assembly metrics denoted for each of the Red Sea brine pool and other hydrothermal vent samples.
| Description | Detailed Description | Sites | Reference Sampling & Read Sequences/Assembly | Number of Reads | Number of Reads after Trimming | Number of Assembled Reads | Number of Contigs > 500 bp | Average Contig Size (bp) | Largest Contig Size (bp) |
|---|---|---|---|---|---|---|---|---|---|
| Atlantis II water column (Water above brine pool) | Atlantis II 50 m water column | ATII 50 | [ | 1,461,910 | 1,461,904 | 582,768 | 36,262 | 1149 | 21,887 |
| Atlantis II 200 m water column | ATII 200 | 1,260,578 | 1,260,561 | 530,441 | 34,640 | 1131 | 25,392 | ||
| Atlantis II 700 m water column | ATII 700 | 1,128,514 | 1,128,507 | 554,335 | 32,860 | 1285 | 33,783 | ||
| Atlantis II 1500 m water column | ATII 1500 | 833,739 | 833,730 | 316,101 | 20,374 | 1331 | 51,927 | ||
| Atlantis II brine water | Atantis II brine-seawater interface | ATII INF | [ | 832,138 | 832,128 | 743,064 | 9933 | 1214 | 25,150 |
| Atantis II brine Upper Convective Layer | ATII UCL | 886,030 | 886,019 | 794,715 | 11,994 | 1454 | 103,389 | ||
| Atantis II brine Lower Convective Layer | ATII LCL | 4,104,966 | 4,104,994 | 3,901,967 | 19,165 | 2084 | 350,936 | ||
| Discovery Deep brine water | Discovery Deep brine-seawater interface | DD INF | [ | 1,095,181 | 1,095,157 | 752,025 | 14,144 | 1201 | 28,080 |
| Discovery Deep brine water | DD BR | 1,111,044 | 1,111,032 | 763,387 | 15,306 | 1216 | 22,118 | ||
| Kebrit Deep brine water | Kebrit Deep Upper brine-seawater interface | KD UINF | [ | 1,562,521 | 1,562,512 | 1,020,749 | 24,517 | 1495 | 58,542 |
| Kebrit Deep Lower brine-seawater interface | KD LINF | 1,510,272 | 1,510,262 | 926,337 | 31,983 | 1241 | 38,825 | ||
| Kebrit Deep brine water | KD BR | 1,379,832 | 1,379,814 | 913,803 | 22,280 | 945 | 14,864 | ||
| Other water metagenomes | Guaymas Basin deep-sea hydrothermal vent plume water | GB VNT | [ | 628,619 | 628,569 | 155,841 | 7654 | 1082 | 17,353 |
| Kueishantao shallow-sea hydrothermal vent (water above vent) | KSW VNT | [ | 261,446 | 261,399 | 199,237 | 411 | 4685 | 179,360 | |
| Kueishantao shallow-sea hydrothermal vent (water) | K VNT | [ | 444,655 | 444,597 | 338,480 | 2194 | 1843 | 88,498 | |
| Juan de Fuca Ridge hydrothermal vent diffuse flow seawater | JDF VNT | [ | 226,981 | 226,916 | 35,357 | 9366 | 1135 | 9366 | |
| Sediments | Atlantis II sediments | ATII SDM | [ | 1,138,406 | 1,138,381 | 478,453 | 30,352 | 1194 | 33,674 |
| Discovery Deep sediments | DD SDM | 1,258,290 | 1,258,273 | 597,552 | 38,529 | 1233 | 38,081 | ||
| Non-brine sediments | NB SDM | 253,568 | 253,564 | 92,530 | 7292 | 1177 | 1315 | ||
| Other metagenome (biofilm) | Loki’s Castle deep-sea vent biofilm (microbial mat) | LC MM | [ | 717,550 | 717,135 | 525,719 | 7897 | 1546 | 42,387 |
| Total | 22,096,240 | 22,095,454 | 14,222,861 | 377,153 | - | - |
SMGCs abundance and the counts of archaeal and bacterial phyla in each site. SMGCs are named using the same acronyms as those used by by antiSMASH tool [45]. Hyphens indicate hybrid clusters. Cf_fatty_acid: fatty acid putative cluster. Cf_saccharide: saccharide putative cluster. Hserlactone: cluster coding for homoserine lactone. NRPS: cluster coding for non-ribosomal peptide synthetase. OtherKS: cluster coding for other types of polyketide synthases. Pufa: cluster coding for poly-unsaturated fatty acids. T1pks: type I polyketide synthase. T2pks: type II polyketide synthase. T3pks: type III polyketide synthase. Acyl_amino_acids: cluster coding for N-acyl amino acid.
| Detailed Description | Assembly | Number of SMGCs | Normalized Number of SMGCs * | Types of SMGCs | Number of Phyla | SMGCs Detected Uniquely Once at a Particular Site |
|---|---|---|---|---|---|---|
| Red Sea metagenomic samples: | ||||||
| Atlantis II 1500 m water column | ATII 1500 | 168 | 531.48 | 9 | 33 | Otherks-Pufa-T1pks, T2pks-Cf_fatty_acid |
| Atlantis II 700 m water column | ATII 700 | 269 | 485.27 | 8 | 32 | Otherks-Pufa, Otherks-T1pks |
| Kebrit Deep Lower brine-seawater interface | KD LINF | 334 | 360.56 | 9 | 33 | Cf_saccharide-Bacteriocin, Hserlactone |
| Atlantis II 200 m water column | ATII 200 | 170 | 320.49 | 8 | 30 | Cf_fatty_acid-Arylpolyene |
| Atantis II brine Upper Convective Layer | ATII UCL | 210 | 264.25 | 7 | 21 | |
| Atlantis II 50 m water column | ATII 50 | 146 | 250.53 | 8 | 30 | |
| Kebrit Deep Upper brine-seawater interface | KD UINF | 252 | 246.88 | 13 | 32 | Ladderane-Cf_fatty_acid, Nrps-T1pks, T1PKS |
| Atantis II brine-seawater interface | ATII INF | 162 | 218.02 | 6 | 19 | |
| Discovery Deep sediments | DD SDM | 114 | 190.78 | 4 | 23 | |
| Non-brine sediments | NB SDM | 16 | 172.92 | 4 | 9 | |
| Kebrit Deep brine water | KD BR | 149 | 163.05 | 4 | 23 | |
| Atlantis II sediments | ATII SDM | 70 | 146.30 | 7 | 26 | |
| Atantis II brine Lower Convective Layer | ATII LCL | 524 | 134.29 | 13 | 25 | Cf_fatty_acid-Cf_saccharide, Cf_saccharide-nrps, Phosphonate, T3pks-cf_saccharide |
| Discovery Deep brine-seawater interface | DD INF | 94 | 125.00 | 1 | 20 | |
| Discovery Deep brine water | DD BR | 73 | 95.63 | 2 | 20 | |
| Other metagenomic samples: | ||||||
| Guaymas Basin deep-sea hydrothermal vent plume water | GB VNT | 11 | 70.58 | 3 | 30 | Pufa |
| Kueishantao shallow-sea hydrothermal vent (water above vent) | KSW VNT | 11 | 55.21 | 4 | 12 | Thiopeptide |
| Kueishantao shallow-sea hydrothermal vent (water) | K VNT | 10 | 29.54 | 5 | 13 | |
| Juan de Fuca Ridge hydrothermal vent diffuse flow seawater | JDF VNT | 3 | 84.85 | 3 | 14 | |
| Loki’s Castle deep-sea vent biofilm (microbial mat) | LC MM | 26 | 49.46 | 5 | 23 | Acyl_amino_acids |
* Normalized number of SMGCs are the number of SMGCs detected at each site divided by the number of assembled reads *106.
Figure 2Overview of the specialized metabolism gene clusters encoded by the Red Sea brine pool metagenomes. The detected gene clusters are named as denoted by antiSMASH [9]. Normalized SMGC values were used.
The potential functions of the array of specialized metabolites encoded by Red Sea brine SMGCs. SMGCs are named as denoted by antiSMASH tool [45]. Hyphens indicate hybrid clusters. Cf_fatty_acid: fatty acid putative cluster. Cf_putative: unknown type putative cluster. Cf_saccharide: saccharide putative cluster. Hserlactone: cluster coding for homoserine lactone. Lantipeptide: cluster coding for lanthipeptide. NRPS: cluster coding for non-ribosomal peptide synthetase. OtherKS: cluster coding for other types of polyketide synthases. Pufa: cluster coding for poly-unsaturated fatty acids. T1pks: type I polyketide synthase. T2pks: type II polyketide synthase. T3pks: type III polyketide synthase.
| General Functional Classification: | Product (Enzyme) | Gene Cluster Names | Representative Basic Structure | Potential Function/Application of Product | Percentage of Total SMGCs | |
|---|---|---|---|---|---|---|
| 1. Products of predicted functions commonly abundant in microbes | Saccharide | Cf_saccharide | - | Microbe-host interactions e.g. lipopolysaccharides. Some saccharides that are diffusible were reported to have antibacterial activities [ | 80.61% | |
| Fatty Acid | Cf_fatty_acid | - | Structural functions and reported that composition can change as an adaptation to temperature and pressure in the environment [ | 7.69% | ||
| Aryl polyenes | Arylpolyene |
| Aryl polyene SMGCs found in abundance in Gram negative bacteria. Previously reported to have a protective role from damage caused by reactive oxygen species [ | 0.52% | ||
| Acyl-homoserine lactones | Hserlactone |
| Quorum sensing [ | 0.03% | ||
| 2. Subset of products with potential antibacterial and/or anticancer effects: | Terpenes | Terpene |
| A subset of the terpenes possesses antibacterial effect [ | 1.89% | |
| Peptides | Ribosomal peptides | Bacteriocin |
| Some have antibacterial activity, and some have selective cancer cytotoxic activity [ | 0.78% | |
| Non-ribosomal peptides | Cf_saccharide-nrps | - | Many non-ribosomal peptides have antibacterial (e.g., β-lactams) and anticancer (e.g. bleomycin) effects [ | 0.25% | ||
| Polyketides | (Type I Polyketide synthase) | Nrps-T1pks | - | A subset are responsible for antibiotic synthesis e.g. the type I polyketide synthase (PKSI) producing erythromycin [ | 0.2% | |
| (Type II Polyketide synthase) | T2pks-Cf_fatty_acid | - | Some type II polyketide synthase (PKSII) enzymes produce aromatic polyketide antibiotics e.g. oxytetracycline [ | 0.09% | ||
| (Type III Polyketide synthase) | T3pks | - | Type III Polyketide synthase (PKSIII) enzymes are capable of producing an array of compounds including pyrones—a subset of pyrones were previously reported to have antibacterial and anticancer effects [ | 0.31% | ||
| Phosphonates | Phosphonate |
| Some natural phosphonates are antibacterials e.g. fosfomycin. Also have structural functions [ | 0.01% | ||
| 3. Miscellaneous: products are predicted to confer adaptation to the environment/special structure/unknown function: | Others | Cf_putative | - | Some code for biosynthetic gene clusters of types that are still unknown [ | 8.13% | |
| Polyunsaturated fatty acids | Otherks-Pufa |
| Polyunsaturated fatty acids (PUFAs) are membrane adaptations to piezophiles, thermophiles and psychrophiles to prevent membrane crystallization [ | 0.14% | ||
| Ectoine | Ectoine |
| Halophilic adaptation & effective in vitro in preventing protein misfolding characteristic in diseases e.g. Alzheimer’s [ | 0.08% | ||
| Ladderane | Ladderane-Cf_fatty_acid |
| Unique component of anammoxosome membrane in anammox (anaerobic ammonium oxidizing) bacteria and potential biofuel [ | 0.05% | ||
Figure 3Promising Red Sea brine specialized metabolite candidates. Five core structures for specialized metabolites were predicted using antiSMASH 4.0 [45]: (A) KD brine–seawater Lower Interface (KD LINF) site had two non-ribosomal peptides predicted to be synthesized by two non-ribosomal peptide synthetase (NRPS) clusters. (B) KD brine–seawater Upper Interface (KD UINF) site had a non-ribosomal peptide predicted to be synthesized by a NRPS cluster and a hybrid polyketide-non-ribosomal peptide predicted to be synthesized by T1PKS-NRPS. (C) ATII 1500 site had a polyketide predicted to be produced by a T1pks-pufa-otherks hybrid cluster. Four top promising candidate SMGCs were detected by ARTS [38]: (D) ATII brine Lower Convective Layer (ATII LCL) site had two clusters encoding Terpene and T3PKS. (E) ATII 200 site had a cluster encoding aryl polyene. (F) ATII 1500 site had a cluster encoding Otherks-PUFA-T1PKS (PUFA: polyunsaturated fatty acid). ser: serine, nrp: non-ribosomal peptide, gln: glutamine, gly: glycine, asp: aspartate, thr: threonine, pk: polyketide, ohmal: hydroxy malate.
Figure 4Heat maps representing hierarchical classification of the SMGCs detected in Red Sea brine pool metagenomes. (A) Heat map for the Red Sea water samples SMGCs. (B) Heat map for the Red Sea sediment samples SMGCs. Hierarchical clustering was based on the relative abundance of normalized numbers of SMGCs detected at each site.
Figure 5Archaeal and bacterial phyla detected in the marine metagenomic dataset. The relative abundancies (represented as % of total detected phyla) detected by MG-RAST are presented. The phylum is represented if its relative abundance is ≥ 0.5% in at least one of the assemblies.