| Literature DB >> 35323947 |
Mindong Liang1, Leshi Liu1, Fei Xu2, Xiaoqian Zeng1, Runjun Wang3, Jinling Yang2, Weishan Wang4, Loganathan Karthik1, Jiakun Liu5, Zhiheng Yang1, Guoliang Zhu1, Shuliu Wang1, Linquan Bai6, Yaojun Tong6, Xueting Liu1, Min Wu7, Li-Xin Zhang1, Gao-Yi Tan1.
Abstract
Direct cloning of biosynthetic gene clusters (BGCs) from microbial genomes facilitates natural product-based drug discovery. Here, by combining Cas12a and the advanced features of bacterial artificial chromosome library construction, we developed a fast yet efficient in vitro platform for directly capturing large BGCs, named CAT-FISHING (CRISPR/Cas12a-mediated fast direct biosynthetic gene cluster cloning). As demonstrations, several large BGCs from different actinomycetal genomic DNA samples were efficiently captured by CAT-FISHING, the largest of which was 145 kb with 75% GC content. Furthermore, the directly cloned, 110 kb long, cryptic polyketide encoding BGC from Micromonospora sp. 181 was then heterologously expressed in a Streptomyces chassis. It turned out to be a new macrolactam compound, marinolactam A, which showed promising anticancer activity. Our results indicate that CAT-FISHING is a powerful method for complicated BGC cloning, and we believe that it would be an important asset to the entire community of natural product-based drug discovery.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35323947 PMCID: PMC8989516 DOI: 10.1093/nar/gkac181
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.Schematic diagram of CAT-FISHING to capture cryptic BGCs for NP discovery. LHA: left homology arm; RHA: right homology arm; BGC: biosynthetic gene cluster; BAC: bacterial artificial chromosome; oriS and oriV are two origins of replication in the BAC plasmid; cmr: chloramphenicol resistance gene; crRNA: CRISPR RNA; PAM: protospacer adjacent motif; LC/MS: liquid chromatography–tandem mass spectrometry; NMR: nuclear magnetic resonance.
Figure 2.Direct cloning of a 50-kb and an 80-kb DNA fragment from pBAC-ZL through a Cas12a-based cloning strategy. (A) Two different target segments with different lengths (50 and 80 kb) in the BAC plasmid. (B) Analysis of Cas12a-digested BAC plasmid (pBAC-ZL) by PFGE. PFGE was performed with 0.5% agarose at 6 V/cm with a 1–25 s switching pulse time for 16–18 h in 0.5× TBE buffer. MidRange PFG Marker (NEB Biolabs) was used as a molecular size marker. (C, D) Determination of the clone numbers and positive rates for the DNA fragments of different lengths in three independent experiments/plates. The data shown are mean ± SD.
Figure 3.Direct cloning of large BGCs from genomic DNA by CAT-FISHING. (A) Workflow of cloning large BGCs from genomic DNA by CAT-FISHING. (B) Three different target BGCs of different lengths (49, 87 and 145 kb) in the chromosome DNA of S. albus J1074. (C) Analysis of CRISPR/Cas12a-digested genomic DNA of S. albus J1074 by PFGE. Three pairs of crRNAs were used for genomic DNA digestion to obtain the resulting target DNA fragment. Bands of target BGC are indicated by red arrows. (D) Validation of positive clones containing a paulomycin (pBAC2015-49kb-J1074), surugamide (pBAC2015-87kb-J1074) or candicidin (pBAC2015-145kb-J1074) gene cluster, on the basis of PFGE. PFGE was performed with 0.5% agarose at 6 V/cm with a 1–25 s switching pulse time for 16–18 h in 0.5× TBE buffer. MidRange PFG Marker (NEB Biolabs) was used as a molecular size marker.
The cloning efficiencies of target BGCs from different actinomycetal genomic DNA samples
| Cloning performance | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Genome features | Target BGC features | Route 1 (simple and fast) | Route 2 (high efficiency)a | |||||||
| Strains and target NPs | Size | (G + C)% | Type | Length | (G + C)% | Correct/ checkedb | Efficiency | Correct/ checkedc | Efficiency | BGC integrityd |
|
| 7.1 Mb | 73% | ||||||||
| Demeclocycline | PKS | 41 kb | 73% | 11/20 | ∼55% | NA | NA | 100% | ||
|
| 8.9 Mb | 69% | ||||||||
| Spinosad | PKS | 80 kb | 67% | 6/20 | ∼30% | NA | NA | 100% | ||
|
| 9.6 Mb | 68% | ||||||||
| Butenyl-spinosyn | PKS | 79 kb | 66% | 5/20 | ∼25% | NA | NA | 100% | ||
|
| 10.5 Mb | 71% | ||||||||
| Avermectin | PKS | 80 kb | 69% | 5/20 | ∼25% | NA | NA | 100% | ||
|
| 6.8 Mb | 73% | ||||||||
| Paulomycin | Quinone glycoside | 49 kb | 71% | 8/20, 5/20, 7/20 | 32 ± 7% | 7/10 | ∼70% | 100% | ||
| Surugamide | NRPS | 87 kb | 76% | 4/20, 4/20, 5/20 | 21 ± 3% | 7/10 | ∼70% | 100% | ||
| Candicidin | PKS | 145 kb | 75% | 2/20, 1/20, 2/24 | 8 ± 2% | 5/10 | ∼50% | 100% | ||
|
| 7.6 Mb | 73% | ||||||||
| Marinolactam A | PKS | 110 kb | 72% | 5/20 | ∼25% | NA | NA | 100% | ||
aRoute 2: target DNA fragment isolation and purification with PFGE (Supplementary Figure S10).
bA total of 20 or 24 clones were randomly selected from each agar plate for PCR screening.
cA total of 10 clones were randomly selected from agar plate for PCR screening.
dConfirmed by digestion or PFGE of three to five PCR-validated clones.
Figure 4.Heterologous biosynthesis of surugamide, spinosad and candicidin in S. albus Del14. Identification of surugamide, spinosad and candicidin in the fermentation broth of S. albus Del14-87kb (A), S. albus Del14-81kb (B) and S. albus Del14-145kb (C), respectively, by HR-MS. All the accurate mass measurements for impurities were within <5 ppm of the theoretical m/z value.
Figure 5.Cloning and heterologous expression of a 110-kb polyketide BGC from Micromonospora sp. 181. (A) The predicted 110-kb marinolactam gene cluster, determined by antiSMASH analysis. (B) PCR verification of S. albus Del14-110kb (Δ14-110kb). Five transformants were randomly selected. F13 (733 bp), F14 (744 bp) and F15 (745 bp) are the PCR products amplified with 110-scr-up-F/R, 110-scr-middle-F/R and 110-scr-down-F/R, respectively. ‘−’ represents control genomic DNA of S. albus Del14, used as a PCR template. ‘+’ represents positive control genomic DNA of Micromonospora sp. 181, used as a PCR template. (C) Detection of NPs by LC–MS in S. albus Del14-110kb ([M + H]+ = 500.3027 Da). (D) The structure of marinolactam A. A unique quaternary carbon chiral center at the C20 position was formed; the extra C28 methylation is highlighted in red (the detailed structure information can be found in Supplementary Table S8). (E) NMR correlations of marinolactams. (F) Calculated (red) and experimental ECD (blue) spectra for marinolactam A in MeOH.
Comparison of selected methods for large DNA fragment (>100 kb) cloning
| Methods | ULCC (high GC)a | CE | TC | Cost | Reference |
|---|---|---|---|---|---|
| BAC library | >120 kb | ∼1/1000 | >1 month | >$5000b | ( |
| CATCH | 36 kbc | >90% | 3–4 days | NA | ( |
| ExoCET | 106 kb | 1/24 | 1 week | NA | ( |
| CAPTURE | 113 kb | 100% | 3–4 days | NA | ( |
| CAT-FISHING | 145 kb | 50% | 3–4 days | ∼$40d | This study |
CE: cloning efficiency; TC: time consuming.
aUpper limit of cloning capacity (ULCC) for DNA sample with high GC content (>70%).
bPrice information for BAC library construction was provided by Wuhan Eightstars Bio-Technology Co. Ltd.
c36-kb jadomycin BGC from Streptomyces venezuelae (GC content: 72.3%) and 150-kb DNA fragment from E. coli (GC content: 50.6%) have been cloned by CATCH.
dSee Supplementary Tables S11 and S12.