Literature DB >> 26580858

Genome mining of astaxanthin biosynthetic genes from Sphingomonas sp. ATCC 55669 for heterologous overproduction in Escherichia coli.

Tian Ma1, Yuanjie Zhou1, Xiaowei Li1, Fayin Zhu1, Yongbo Cheng1, Yi Liu1, Zixin Deng1, Tiangang Liu2,3,4.   

Abstract

As a highly valued keto-carotenoid, astaxanthin is widely used in nutritional supplements and pharmaceuticals. Therefore, the demand for biosynthetic astaxanthin and improved efficiency of astaxanthin biosynthesis has driven the investigation of metabolic engineering of native astaxanthin producers and heterologous hosts. However, microbial resources for astaxanthin are limited. In this study, we found that the α-Proteobacterium Sphingomonas sp. ATCC 55669 could produce astaxanthin naturally. We used whole-genome sequencing to identify the astaxanthin biosynthetic pathway using a combined PacBio-Illumina approach. The putative astaxanthin biosynthetic pathway in Sphingomonas sp. ATCC 55669 was predicted. For further confirmation, a high-efficiency targeted engineering carotenoid synthesis platform was constructed in E. coli for identifying the functional roles of candidate genes. All genes involved in astaxanthin biosynthesis showed discrete distributions on the chromosome. Moreover, the overexpression of exogenous E. coli idi in Sphingomonas sp. ATCC 55669 increased astaxanthin production by 5.4-fold. This study described a new astaxanthin producer and provided more biosynthesis components for bioengineering of astaxanthin in the future.
© 2015 The Authors. Biotechnology Journal published by WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Entities:  

Keywords:  Astaxanthin; Sphingomonas; biosynthesis; complete genome; discrete distribution

Mesh:

Substances:

Year:  2015        PMID: 26580858      PMCID: PMC5064606          DOI: 10.1002/biot.201400827

Source DB:  PubMed          Journal:  Biotechnol J        ISSN: 1860-6768            Impact factor:   4.677


phytoene synthase geranylgeranyl pyrophosphate synthase phytoene dehydrogenase lycopene cyclase β‐carotene hydroxylase β‐carotene ketolase dimethylallyl pyrophosphate enhanced green fluorescent protein farnesyl pyrophosphate genomic DNA geranylgeranyl diphosphate isopentenyl‐diphosphate delta‐isomerase isopentenyl pyrophosphate 2‐C‐Methyl‐D‐erythritol 4‐phosphate mevalonate PacBio corrected read single‐molecule, real‐time

Introduction

Astaxanthin (3,3′‐dihydroxy‐β‐carotene‐4,4′‐dione) belongs to a large class of phytochemicals known as terpenes. Studies have shown that astaxanthin has beneficial effects on human health, including cancer prevention, immune response enhancement, cardiovascular disease prevention, radiation resistance, and biological antioxidant protection 1. Recently, astaxanthin has been used in many applications, such as nutritional supplements, pharmaceuticals, cosmetics, food colorants, and animal feed additives 2, 3, 4, 5, 6, and the worldwide market of astaxanthin is expected to increase dramatically. Currently, astaxanthin is obtained by chemical synthesis or solvent‐based extraction from natural sources, such as the green alga Haematococcus pluvialis 7 and the yeast Xanthophyllomyces dendrorhous (formerly known as Phaffia rhodozyma) 8. A few studies have also evaluated astaxanthin production in bacteria such as Paracoccus 9 and Brevundimonas genera 10. However, astaxanthin production is limited by low yields and limited natural sources. Therefore, researchers have focused on discovering novel microbial producers of astaxanthin to meet the growing demand for astaxanthin. In biological systems, astaxanthin contains a C40 methyl‐branched hydrocarbon backbone derived from successive condensation of eight C5 isoprene units. The C5 isoprene unit IPP, can be generated from an acetyl‐CoA precursor via the MVA pathway 11 or from pyruvate and glyceraldehyde‐3‐phosphate precursors via the MEP pathway 12. In these two pathways, the extended unit IPP is isomerized to DMAPP by IPP isomerase, which is encoded by the idi gene. DMAPP is then condensed with the extended unit IPP to form astaxanthin. Six proteins participate in this condensation process: CrtE, CrtB, CrtI, CrtY, CrtW, and CrtZ (Fig. 1).
Figure 1

Proposed astaxanthin biosynthetic pathway in Sphingomonas sp. ATCC 55669. (A) In the strain Sphingomonas sp. ATCC 55669, upstream events via the MEP pathway to get the precursors IPP and DMAPP. (B) The downstream carotenoid pathway involved the conversion of intermediates IPP and DMAPP to astaxanthin. DXS, 1‐deoxy‐D‐xylulose‐5‐phosphate synthase; DXR, 1‐deoxy‐D‐xylulose‐5‐phosphate reductoisomerase; IspD, 2‐C‐methyl‐D‐erythritol 4‐phosphate cytidylyltransferase; IspE, 4‐diphosphocytidyl‐2‐C‐methyl‐D‐erythritol kinase; IspF, 2‐C‐methyl‐D‐erythritol 2,4‐cyclodiphosphate synthase; IspG, (E)‐4‐hydroxy‐3‐methylbut‐2‐enyl‐diphosphate synthase; IspH, 4‐hydroxy‐3‐methylbut‐2‐enyl diphosphate reductase. G‐3‐P,D‐glyceraldehyde 3‐phosphate; DXP, 1‐deoxy‐D‐xylulose 5‐phosphate; CDP‐ME, 4‐(cytidine 5'‐diphospho)‐2‐C‐methyl‐D‐erythritol; CDP‐ME2P, 2‐phospho‐4‐(cytidine 5'‐diphospho)‐2‐C‐methyl‐D‐erythritol; ME‐cPP, 2‐C‐methyl‐D‐erythritol 2,4‐cyclodiphosphate; HMBPP, 1‐hydroxy‐2‐methyl‐2‐butenyl 4‐diphosphate.

Proposed astaxanthin biosynthetic pathway in Sphingomonas sp. ATCC 55669. (A) In the strain Sphingomonas sp. ATCC 55669, upstream events via the MEP pathway to get the precursors IPP and DMAPP. (B) The downstream carotenoid pathway involved the conversion of intermediates IPP and DMAPP to astaxanthin. DXS, 1‐deoxy‐D‐xylulose‐5‐phosphate synthase; DXR, 1‐deoxy‐D‐xylulose‐5‐phosphate reductoisomerase; IspD, 2‐C‐methyl‐D‐erythritol 4‐phosphate cytidylyltransferase; IspE, 4‐diphosphocytidyl‐2‐C‐methyl‐D‐erythritol kinase; IspF, 2‐C‐methyl‐D‐erythritol 2,4‐cyclodiphosphate synthase; IspG, (E)‐4‐hydroxy‐3‐methylbut‐2‐enyl‐diphosphate synthase; IspH, 4‐hydroxy‐3‐methylbut‐2‐enyl diphosphate reductase. G‐3‐P,D‐glyceraldehyde 3‐phosphate; DXP, 1‐deoxy‐D‐xylulose 5‐phosphate; CDP‐ME, 4‐(cytidine 5'‐diphospho)‐2‐C‐methyl‐D‐erythritol; CDP‐ME2P, 2‐phospho‐4‐(cytidine 5'‐diphospho)‐2‐C‐methyl‐D‐erythritol; ME‐cPP, 2‐C‐methyl‐D‐erythritol 2,4‐cyclodiphosphate; HMBPP, 1‐hydroxy‐2‐methyl‐2‐butenyl 4‐diphosphate. Sphingomonas spp. are strictly aerobic, rod‐shaped, chemoheterotrophic gram‐negative bacteria that harbor glycosphingolipids, rather than lipopolysaccharides, in their cell membranes, which distinguishes them from traditional gram‐negative bacteria. Sphingomonas is widely distributed in many different aquatic and terrestrial habitats 13, and has been shown to be able to survive in low‐nutrient conditions and metabolize a broad range of organic compounds, including refractory environmental pollutants 5, 14, 16. In addition, some Sphingomonas can produce useful high value‐added products, such as carotenoids 17. Whole‐genome sequencing has been widely applied to decode the functional activities of microbes, and a complete, highly accurate genome sequence is a critical starting point for mining the genetic features. High‐throughput sequencing technologies play an important role in obtaining such genome sequences 18. Specifically, third‐generation sequencing (TGS) PacBio SMRT technology provides an improved microbial genome assembly method with extended read lengths and a lack of GC bias. The majority of errors during PacBio sequencing occur because of random indels 19, which can be mostly corrected by SMRT dozens‐fold coverage sequencing reads. However, a small number of indel errors can persist in PacBio‐assembled genomes, limiting the accuracy of gene prediction. In contrast, Illumina sequencing, a traditional next‐generation sequencing technology (TNGS), generates highly accurate short‐length reads with very few indel errors but with many gaps during de novo genome assembly 20, 21. Hybrid error correction takes advantage of the strengths of both sequencing approaches. In this study, we identified and characterized the astaxanthin‐producing ability of strain Sphingomonas sp. ATCC 55669 by complete genome sequencing using a combined PacBio‐Illumina approach and gene annotation. Our results described this new astaxanthin producer and provided important insights into astaxanthin biosynthesis in Sphingomonas, revealing novel biosynthesis components for bioengineering.

Materials and methods

Strains, media, and reagents

The strain Sphingomonas sp. ATCC 55669 was cultured in #272 nutrient glucose medium (8.0 g dehydrated nutrient broth [Difco‐BD, Corpus Christi, TX, USA] with 5.0 g glucose/L distilled H2O) at 26.0°C in the dark with shaking (220 rpm). E. coli K‐12 MG1655 (DE3) was cultured in Luria broth (LB; containing 10.0 g/L NaCl, 10.0 g/L tryptone, and 5.0 g/L yeast extract) at 30.0°C in the dark with shaking (220 rpm). E. coli XL1‐Blue was used to replicate plasmids and construct clones. All media were autoclaved at 115°C for 30 min. The strains used in this study are listed in Table 1. The astaxanthin standard was purchased from Aladdin (Seattle, WA, USA).
Table 1

Strains and plasmids used in this study.

StrainDescriptionSource
Sphingomonas sp. ATCC 55669Isolated from Quebec CanadaAmerican Type Culture Collection
E. coli K‐12 MG1655 (DE3)F λ ilvG‐ rfb‐50 rph‐1 (DE3)Prof. Kristala Prather from MIT
A1 E. coli K‐12 MG1655 (DE3): pMH1, pFZ81, pFZ153This study
A2 E. coli K‐12 MG1655 (DE3): pMH1, pFZ81, pTM3518This study
A3 E. coli K‐12 MG1655 (DE3): pMH1, pFZ81, pTM2108This study
A4 E. coli K‐12 MG1655 (DE3): pMH1, pFZ81, pTM0906This study
A5 E. coli K‐12 MG1655 (DE3): pMH1, pFZ81, pTM2930This study
A6 E. coli K‐12 MG1655 (DE3): pMH1, pFZ81, pTM1181This study
Plasmid Backbone Relevant genotype Source
pMH42pET28a(+)pT7: N‐terminal His‐tagged EGFP, Kan+ This study
pTMB2EpBBR1MCS‐2plac: EGFP, Kan+ This study
pTMB2IpBBR1MCS‐2plac: Idi, Kan+ This study
pGZIpET28a(+)pT7: N‐terminal His‐tagged Idi, Kan+ 26
pFZ87pETDuet‐1pT7: Idi, Amp+ This study
pFZ152pETDuet‐1pT7: CrtY, CrtZ, CrtW, Amp+ This study
pFZ21pET28a(+)pT7: N‐terminal His‐tagged CrtE, Kan+ 31
pFZ22pET28a(+)pT7: N‐terminal His‐tagged CrtB, Kan+ 31
pFZ23pET28a(+)pT7: N‐terminal His‐tagged CrtI, Kan+ 31
pFZ112pETDuet‐1pT7: CrtE, CrtI, CrtB, Amp+ This study
pFZ153pETDuet‐1pT7: CrtE, CrtI, CrtB, pT7: Idi, CrtY, CrtZ, CrtW, Amp+ This study
pTM3518pETDuet‐1pT7: CrtEsp1, CrtI, CrtB, pT7: Idi, CrtY, CrtZ, CrtW, Amp+ This study
pTM2108pETDuet‐1pT7: CrtEsp2, CrtI, CrtB, pT7: Idi, CrtY, CrtZ, CrtW, Amp+ This study
pTM0906pETDuet‐1pT7: CrtEsp3, CrtI, CrtB, pT7: Idi, CrtY, CrtZ, CrtW, Amp+ This study
pTM2930pETDuet‐1pT7: CrtE, CrtI, CrtB, pT7: Idi, CrtY, CrtZsp1, CrtW, Amp+ This study
pTM1181pETDuet‐1pT7: CrtE, CrtI, CrtB, pT7: Idi, CrtY, CrtZsp2, CrtW, Amp+ This study
pMH1pBBR1MCS‐1plac: AtoB, ERG13 and N‐terminal His‐tagged tHMG1, Cam+ 26
pFZ81pBBR1MCS‐2plac: ERG12, ERG8, MVD1 and N‐terminal His‐tagged Idi, Kan+ 26
Strains and plasmids used in this study.

Preparation of genomic DNA of Sphingomonas sp. ATCC 55669 for genome sequencing

Sphingomonas sp. ATCC 55669 was cultured overnight. 50 mL of the culture was collected and centrifuged for 5 min at 5000 rpm. The pellet was resuspended in 10 mL distilled water and SET buffer (75 mM NaCl, 25 mM EDTA, 20 mM Tris‐Cl), centrifuged again, and resuspended with 5 mL SET buffer. Next, 450 µL of lysozyme (100 mg/mL) was added, and the cell suspension was incubated in a 37°C water bath for 2 h. 10 µL of RNase A (10 mg/mL) was added, followed by incubation for additional 20 min. Proteinase K (300 µL, 20 mg/mL) was added, and samples were incubated at 37°C in a water bath for 40 min. Finally, 5 mL of 10% SDS was added, followed by incubation at 55°C for 2 h. 2 mL of 5 M NaCl was added, and samples were mixed gently. The DNA mixture was then transferred to a round‐bottomed centrifuge tube, and 10 mL chloroform was added. After thorough, gentle mixing, samples were centrifuged at 12 000 rpm for 10 min. The upper aqueous phase was collected, and 10 mL chloroform was added; the mixing and centrifugation steps were repeated twice. DNA was then precipitated with 0.7 volumes of isopropanol and spooled into a 1.5‐mL centrifuge tube with a glass rod. Finally, DNA was washed with 1 mL of cold 70% ethanol for 1 min, dried, and redissolved in at least 200 µL nuclease‐free water at room temperature.

Genome sequencing, genome assembly, and gene prediction

Both Illumina and PacBio SMRT sequencing technology were used to obtain the genome of strain Sphingomonas sp. ATCC 55669. For Illumina sequencing, gDNA was fragmented to an average size of 350 bp without pre‐PCR amplification, and HiSeq2000 was used for sequencing. For PacBio reads, gDNA was fragmented to an average size of 10 kb, and genome sequencing was performed by PacBio RS with 8 SMRT cells in a 90‐min sequencing reaction. An additional SMRT library was built with gDNA fragmented to an average size of 900 bp. Sequencing of this library was performed by PacBio RS with 14 SMRT cells in two 45‐min sequencing reactions. Both strategies of PacBio‐only consensus calling and hybrid error correction with Illumina reads were performed for PacBio long reads, and BLASR 22 alignment was used with an error rate of 0.25 and minimum overlap length of 40 bp. A SMRT 900 bp library was constructed to help correct PacBio long reads, and final PBcRs were obtained from all SMRT 900 bp library data. Genome assembly was carried out with Celera Assembler 23, and the longest 25‐fold PBcR data were assembled with an utgErrorRate set to 0.05 and minimum overlap length of 40 bp. Genome indel detection was performed with the Burrows‐Wheeler transform algorithm based on short‐read alignment. The SOAPaligner 24 was used with a required identity of more than 95% for Illumina paired‐read alignment. The continuous gap size was limited to 3 bp. Reads Alignment Showing Tools software (RASTools: https://sourceforge.net/projects/rastools) was used for alignment with match‐level and gap flag functions for consensus sequences. Whole‐genome gene predictions were performed using the interpolated Markov model 25. Long ORFs were scanned by GLIMMER 3.0 25 to generate the training set. Start codon usage was then analyzed with the Estimated Locations of Pattern Hits software from http://cbcb.umd.edu/software/ELPH/. Genetic table 11 was used for gene prediction. Figure 2 summarizes the steps taken to integrate Illumina and PacBio SMRT sequencing technology in this study.
Figure 2

Assembly strategy used in this study. Shorter PacBio reads were used to correct indel errors of longer PacBio reads and followed by contig assembly. Illumina reads were used to correct persistent indel errors in contigs.

Assembly strategy used in this study. Shorter PacBio reads were used to correct indel errors of longer PacBio reads and followed by contig assembly. Illumina reads were used to correct persistent indel errors in contigs.

Plasmid construction for astaxanthin functional gene identification

The plasmids used in this study are listed in Table 1, and the primers used are listed in Supporting information, Table S1. The idi gene from E. coli, obtained from pGZI 26, was cloned into the plasmid pETDuet‐1 digested with NdeI and XhoI to yield pFZ87. The crtY and crtZ genes were amplified individually from Pantoea agglomerans 27, 28 with primers Idi‐PagCrtY‐F/CrtZ‐PagCrtY‐R and CrtY‐PagCrtZ‐F/CrtW‐PagCrtZ‐R. The crtW gene from Brevundimonas sp. SD212 29, codon‐optimized and synthesized by Genescript (Torrens Park, South Australia), was amplified with primers CrtZ‐BreCrtW‐F and BreCrtW‐R. The four fragments containing genes idi, crtY, crtZ, and crtW were assembled to yield plasmid pFZ152 via the Gibson method 30. The crtE, crtI, and crtB genes were separately amplified from pFZ21, pFZ22, and pFZ23 31 with primers Duet‐PanCrtE‐F/PanCrtI‐CrtE‐R, PanCrtE‐CrtI‐F/PanCrtB‐CrtI‐R, and PanCrtI‐CrtB‐F/Duet‐EcoRI‐PanCrtB‐R, respectively. Then, these three fragments with genes crtE, crtI, and crtB were assembled via the Gibson method to yield plasmid pFZ112. Finally, the fragment from pFZ112 including genes crtE, crtI, and crtB was digested with NdeI‐EcoRI and inserted into pFZ152 to yield plasmid pFZ153. Plasmid pFZ153, which contained crtEIB‐idi‐crtYZW, served as a positive control to confer astaxanthin production ability in E. coli. For identifying genes involved in astaxanthin biosynthesis from Sphingomonas sp. ATCC 55669, the homologous gene in plasmid pFZ153 was replaced. Specifically, crtE (gene locus_tag in NCBI: MC45_16265), crtE (gene locus_tag in NCBI: MC45_09750), and crtE (gene locus_tag in NCBI: MC45_04155) were amplified from the Sphingomonas sp. ATCC 55669 genome with primers Duet‐Pan3518‐F2/PanCrtI‐3518‐R, Duet‐Pan2108‐F2/PanCrtI‐2108‐R, and Duet‐Pan0906‐F2/PanCrtI‐0906‐R, respectively. The three fragments replaced crtE in plasmid pFZ153 to obtain pTM3518, pTM2108, and pTM0906. crtZ (gene locus_tag in NCBI: MC45_13565) and crtZ (gene locus_tag in NCBI: MC45_05470) were amplified with primers CrtY‐Pag2930‐F/CrtW‐Pag2930‐R and CrtY‐Pag1181‐F2/CrtW‐Pag1181‐R. The two fragments replaced crtZ in pFZ153 to obtain pTM2930 and pTM1181.

Overexpression of exogenous genes in Sphingomonas sp. ATCC 55669

The gene encoding EGFP was amplified from a commercial plasmid using primers EGFP‐A and EGFP‐S; the fragment was inserted into pET28a(+) digested with NdeI and EcoRI to yield pMH42 and then re‐amplified with primers EGFP‐pBBR‐S and EGFP‐pBBR‐A using pMH42 as a template. The amplified fragment was inserted into pBBR1MCS2 digested with KpnI and SacI to yield pTMB2E. EGFP expression was controlled by the IPTG‐inducible lacZ promoter. To transfer pTMB2E to Sphingomonas sp. ATCC 55669, electrocompetent Sphingomonas sp. ATCC 55669 cells were prepared. Cells were grown to an OD600 of 0.2–0.4, collected on ice, washed with ice‐cold 10% glycerol, and collected again by centrifugation. Cells were then washed with ice‐cold 10% glycerol, kept on ice for 30 min, and pelleted by centrifugation. After washing with 10% glycerol, cells were resuspended with 0.04 volumes of ice‐cold 10% glycerol. For each reaction, 120 µL of cells was mixed with 2 µL (about 100 ng) of pTMB2E. Cells were then transferred to a 2‐mm precooled electroporation cuvette, electroporated at 3 kV for 4.5–5 ms, and recovered with 800 µL of #272 medium. Recovered cells were incubated with shaking for 2 h at 26°C, spread on #272 agar plates supplemented with 60 mg/L kanamycin, and cultured at 26°C for at least three days. The resulting clones were picked up and cultured in #272 broth with 60 mg/L kanamycin. When OD600 of the cultures reached 0.6–0.8, IPTG was added to a final concentration of 0.1 mM. After 24 h of cultivation, fluorescence microscopy was used to detect EGFP expression. Sphingomonas sp. ATCC 55669 with pBBR1MCS2 served as the negative control. The target IPP isomerase gene idi was derived from plasmid pGZI 26. The gene was amplified using primers idi‐pET28a‐A and idi‐pET28a‐S and inserted into pBBR1MCS2 digested with BamHI and SacI to yield pTMB2I. With the above‐mentioned electrotransformation method, pTMB2I was transformed into Sphingomonas sp. ATCC 55669.

Astaxanthin identification

To detect astaxanthin from Sphingomonas sp. ATCC 55669, the pellet was collected by centrifugation and extracted with 1:4 (v:v) methanol and acetone by vortexing in the dark until residues were colorless. The supernatant was dried and redissolved in acetone, centrifuged, and transferred to sample vials for LC‐MS detection. The LC‐MS system (LTQ Orbitrap XL [Thermo Scientific, Waltham, MA, USA]) was equipped with a DIONEX Acclaim 120 C18 (4.6 mm × 250 mm × 5 µm) column. The chromatographic conditions were as follows: 95% buffer A (H2O with 0.1% formic acid) and 5% buffer B (acetonitrile with 0.1% formic acid) at 1 mL/min for 2 min; buffer A was decreased to 50% over 13 min and then to 5% over 25 min; maintained at 95% B and 5% A for 15 min; buffer A was increased to 95% over 1 min and maintained until 60 min. MS was carried out using the full scan method to detect the parent ion (597.394).

Shake‐flask fermentation and quantification of astaxanthin produced by Sphingomonas sp. ATCC 55669

Sphingomonas sp. ATCC 55669 and the mutant carrying recombinant pTMB2I or pBBR1MCS2 were cultured in #272 medium at 26.0°C in the dark with shaking (220 rpm). When OD600 of the cultures reached 0.6–0.8, IPTG was added to a final concentration of 0.1 mM. After 60 h, cells were collected for product detection. Astaxanthin was extracted and quantified using LC‐MS (Triple Stage Quadrupole Quantum [Thermo Scientific]) with a Thermo Hypersil Gold C18 (2.1 mm × 100 mm × 5 µm) column. The mobile phases and gradient used were as follows: isocratic 50% buffer A (H2O with 0.1% formic acid) and 50% buffer B (acetonitrile with 0.1% formic acid) at 0.35 mL/min for 2 min; buffer A was decreased to 2% over 8 min and maintained for 8 min; buffer A was increased to 50% over 0.1 min and maintained until 20 min. MS was then carried out using the multiple reaction monitoring (MRM) transition 32 to detect the parent ion (597.394) and product ion (147.200).

Shake‐flask fermentation and quantification of astaxanthin produced by E. coli

During E. coli K‐12 MG1655 (DE3) shake‐flask fermentation, the E. coli strain carrying pMH1 and pFZ81, which overexpressed upstream MVA pathway components provided adequate precursors of IPP and DMAPP 26. Mutants A1–A6, carrying pMH1, pFZ81 and pFZ153, pTM3518, pTM2108, pTM0906, pTM2930, or pTM1181, were cultured in 500‐mL flasks with 200 mL LB supplemented with 34 mg/L chloramphenicol, 50 mg/L kanamycin, and 100 mg/L ampicillin. The flasks were incubated at 30°C with shaking (200 rpm). When OD600 reached 0.7–0.9, IPTG was added to a final concentration of 0.1 mM. 8 h later, 2 mL of each culture was harvested, and astaxanthin was extracted as described above. The Dionex UltiMate 3000 HPLC system (Thermo Scientific) equipped with a Dionex Acclaim 120 C18 (4.6 mm × 250 mm × 5 µm) column was used for astaxanthin quantification. Samples were separated with mobile phases A (H2O with 0.1% trifluoroacetic acid) and B (acetonitrile with 0.1% trifluoroacetic acid) at a flow rate of 1.0 mL/min. The column temperature was 25°C, and detection occurred at 478 nm. Mobile phase compositions varied as follows: 0 min, 50% B; 5 min, 100% B; 20 min, 100% B; 25 min, 50% B; 27 min, 50% B.

Results

Confirmation of astaxanthin production in Sphingomonas sp. ATCC 55669

The product isolated from Sphingomonas sp. ATCC 55669 cells cultured in #272 nutrient glucose medium was analyzed by LC‐MS, and a molecule at m/z (597.392) was obtained at the same retention time as the astaxanthin standard (Supporting information, Fig. S1), suggesting that this strain naturally produced astaxanthin. Quantitative analysis indicated that Sphingomonas sp. ATCC 55669 produced 1.2 µg/g cdw (cells dry weight) astaxanthin. Sphingomonas astaxanthinifaciens was the first astaxanthin‐producing bacterium identified in this genus 33, and thus, we explored Sphingomonas sp. ATCC 55669 as another host for astaxanthin production.

Combined PacBio‐Illumina sequencing strategy for the complete genome of Sphingomonas sp. ATCC 55669

The SMRT 10 kbp library was sequenced with PacBio RS, yielding 393 Mbp raw reads (read N50 length: 4028 bp; mean read quality: 0.856). After removal of the adapter sequence, 378 Mbp reads remained (N50 length: 3220 bp; Supporting information, Fig. S2). A SMRT 900 bp library was built to generate more reads to correct the PacBio long reads, and 1.69 Gbp raw reads were achieved by PacBio RS (N50 length: 3824 bp; mean quality: 0.862). After removal of the adapter sequence, 1.3 Gbp reads remained (N50 length: 450 bp). Both 100‐fold coverage Illumina reads and SMRT 900 bp PacBio reads were used to correct SMRT 10 kbp long reads. With 100‐fold coverage from Illumina sequence data, hybrid error correction resulted in 300 Mbp PBcRs (N50 length: 1402 bp), whereas 100‐fold coverage from PacBio data yielded 327 Mbp PBcRs (N50 length: 2647 bp). Thus, the two strategies yielded similar data size, but the hybrid error correction strategy generated half the N50 length (Supporting information, Fig. S3). Final PBcRs were corrected with all data from the SMRT 900 bp library, yielding 327 Mbp reads (N50 length: 2756 bp). Genome assembly was performed with PacBio‐only corrected PBcRs, and three contigs remained with a total size of 4.07 Mbp and an N90 length greater than 3.86 Mbp. Using SOAPaligner, more than 96.85% of Illumina reads aligned to these three contigs with over 95% identity. Each contig showed more than 4 kbp self‐repeat of two edges by dot plot analysis, and PCR verification revealed that the self‐repeat was false; the contigs represented a single copy of the circular genome. Further genome coverage analysis of the Illumina reads showed that the poorly covered genomic area was less than 1000 bp. Forty‐eight single nucleotide insertions were detected with RASTools, and each insertion had at least 14 supported Illumina reads. Twenty‐three low‐coverage regions were verified by Sanger sequencing, and the results indicated that the low coverage in these regions may be explained by the randomly distributed Illumina short read sequencing strategy, whereas some indel errors may persist with the PacBio‐only assembly. With gene prediction, most of insertion sites after correction showed higher similarity with the existing reference sequences (Supporting information, Fig. S4). The final genome contained a 3.86‐Mbp circular chromosome and two circular plasmids with 163 and 39 kbp, respectively, at a GC content of 68.48% (Fig. 3; GenBank CP009571‐CP009573).
Figure 3

The complete genome of Sphingomonas sp. ATCC 55669 contained three circular DNAs: one chromosome (A) and two megaplasmids (B, C). From innermost to outermost circles: GC content; GC‐skew ([G – C] / [G + C]); methylation information of forward sequences (red, m4C; blue, 6mA); methylation information of reverse sequences (pink, m4C; cyan, 6mA); predicted genes color‐coded by COG categories of forward and reverse strands, respectively; and genes related to the non‐mevalonate pathway and carotenoid biosynthesis. All genes putatively involved in astaxanthin biosynthesis “ (gene names noted in the outer circle) showed a discrete distribution throughout the chromosome.

Bioinformatics analysis and engineering of astaxanthin biosynthetic pathway of Sphingomonas sp. ATCC 55669 by expressing an exogenous idi gene

From the complete genome sequence, 4023 putative ORFs were identified. All the predicted genes involved in astaxanthin biosynthesis could be found uniquely and with high homology. The homology of the predicted genes with annotations in the NCBI nr database is listed in Supporting information, Table S2. The upstream biosynthetic pathway belonged to the MEP pathway, and the downstream astaxanthin biosynthetic pathway was the same as that found in most bacteria. Among the predicted genes, ispDF had the highest homology with strain Sphingomonas wittichii RW1, which had both 2‐C‐methyl‐D‐erythritol 4‐phosphate cytidylyltransferase and 2‐C‐methyl‐D‐erythritol 2,4‐cyclodiphosphate synthase domains. In addition, all genes related to astaxanthin biosynthesis were unique, except idi, crtE, and crtZ. The idi gene, which encodes a key enzyme in the isoprenoid pathway, was absent from this system. In order to exploit the astaxanthin‐producing strain Sphingomonas sp. ATCC 55669, we attempted to establish a genetic manuscript method. Detection of EGFP as a reporter for gene expression indicated that the exogenous gene could be transformed into this strain and expressed with the pBBR1MCS ori and promoter Plac (Supporting information, Fig. S5A and S5B). Overexpression of exogenous idi in Sphingomonas sp. ATCC 55669 caused a 5.4‐fold increase in astaxanthin production compared to the strain without exogenous idi (Supporting information, Fig. S5C). The mutant harboring pTMB2I produced 6.91 µg/g cdw astaxanthin, whereas that harboring pBBR1MCS2 only produced 1.29 µg/g cdw. The complete genome of Sphingomonas sp. ATCC 55669 contained three circular DNAs: one chromosome (A) and two megaplasmids (B, C). From innermost to outermost circles: GC content; GC‐skew ([G – C] / [G + C]); methylation information of forward sequences (red, m4C; blue, 6mA); methylation information of reverse sequences (pink, m4C; cyan, 6mA); predicted genes color‐coded by COG categories of forward and reverse strands, respectively; and genes related to the non‐mevalonate pathway and carotenoid biosynthesis. All genes putatively involved in astaxanthin biosynthesis “ (gene names noted in the outer circle) showed a discrete distribution throughout the chromosome. Two crtZ candidates (crtZ and crtZ ) were homologous to strains Pantoea ananatis and Paracoccus sp. N81106 at about 60% identity. For crtE, no candidates with at least 30% identity were found. Candidates with between 20 and 30% identity were compared in terms of catalytic function domains. Finally, three crtE candidates (crtE , crtE , and crtE ) were chosen (Supporting information, Table S2). These candidates all catalyzed IPP condensation into the substrate. In the carotenoid biosynthetic pathway, FPP synthase, with high homology to crtE , could catalyze the condensation of IPP and DMAPP to FPP, a precursor for GGPP synthesis. In addition, octaprenyl diphosphate synthase, with high homology to crtE , and isoprenyl transferase, with high homology to crtE , could also catalyze the formation of FPP from IPP.

Heterologous overproduction for critical genes involved in the astaxanthin biosynthesis of Sphingomonas sp. ATCC 55669 in E. coli

In order to identify the predicted function of genes involved in astaxanthin biosynthesis, a carotenoid synthesis platform was constructed in E. coli. According to bioinformatic analysis of astaxanthin biosynthetic genes, two crtZ candidates (crtZ and crtZ ) and three crtE candidates (crtE , crtE , and crtE ) were identified. The result indicated that only one candidate (crtE ) had crtE function, whereas both crtZ and crtZ had crtZ function. Importantly, the nonfunctional crt genes in the system did not have corresponding roles, illustrating that the functional genes carried out the catalytic functions of the known genes in the proposed pathway. With the high‐efficiency targeted engineering carotenoid synthesis platform, four astaxanthin‐producing strains, i.e. A1, A2, A5, and A6 (harboring pFZ153, pTM3518, pTM2930, and pTM1181, respectively) produce 6.6, 4.44, 1.12, and 2.79 mg/g cdw astaxanthin, respectively, at 8 h after induction (Fig. 4).
Figure 4

Targeted engineering E. coli for astaxanthin overproduction. (A) Over‐production of the intermediates IPP and DMAPP was achieved by endogenous acetyl‐CoA via the MVA pathway. AtoB (acetyl‐CoA acetyltransferase) and Idi were from E. coli. Erg13 (3‐hydroxy‐3‐methylglutaryl‐CoA synthase), tHMG1 (a truncated version of 3‐hydroxy‐3‐methylglutaryl‐CoA reductase), ERG12 (mevalonate kinase), ERG8 (phosphomevalonate kinase), and MVD1 (mevalonate pyrophosphate decarboxylase) were from Saccharomyces cerevisiae. Plasmids pMH1 and pFZ81 were controlled by the Plac promoter. (B) Overproduction of astaxanthin from intermediates IPP and DMAPP via the carotenoid pathway. CrtEIB was from Pantoea ananatis; crtYZ was from Pantoea agglomerans; crtW was from Brevundimonas sp. SD212; idi was from E. coli; and crtEs , crtE , crtE , crtZ , and crtZ were from Sphingomonas sp. ATCC 55669. Gene expression was controlled by the PT7 promoter. (C) Production of astaxanthin in engineered strains from A1 to A6 detected by HPLC.

Targeted engineering E. coli for astaxanthin overproduction. (A) Over‐production of the intermediates IPP and DMAPP was achieved by endogenous acetyl‐CoA via the MVA pathway. AtoB (acetyl‐CoA acetyltransferase) and Idi were from E. coli. Erg13 (3‐hydroxy‐3‐methylglutaryl‐CoA synthase), tHMG1 (a truncated version of 3‐hydroxy‐3‐methylglutaryl‐CoA reductase), ERG12 (mevalonate kinase), ERG8 (phosphomevalonate kinase), and MVD1 (mevalonate pyrophosphate decarboxylase) were from Saccharomyces cerevisiae. Plasmids pMH1 and pFZ81 were controlled by the Plac promoter. (B) Overproduction of astaxanthin from intermediates IPP and DMAPP via the carotenoid pathway. CrtEIB was from Pantoea ananatis; crtYZ was from Pantoea agglomerans; crtW was from Brevundimonas sp. SD212; idi was from E. coli; and crtEs , crtE , crtE , crtZ , and crtZ were from Sphingomonas sp. ATCC 55669. Gene expression was controlled by the PT7 promoter. (C) Production of astaxanthin in engineered strains from A1 to A6 detected by HPLC. From this analysis, we clarified the entire astaxanthin biosynthesis pathway and found that the genes involved in astaxanthin biosynthesis were distributed throughout the chromosome of Sphingomonas sp. ATCC 55669 (Fig. 3).

Discussion

In this study, we found that Sphingomonas sp. ATCC 55669 naturally produced astaxanthin. Astaxanthin is beneficial to human health, enhancing immune system function and providing protection against lipid‐membrane oxidation and DNA damage. The demand for natural astaxanthin is expected to grow rapidly, with an annual worldwide market estimated at US$200 million 34. However, natural bioresources of astaxanthin are limited. Therefore, exploration of novel resources for production of natural astaxanthin may help meet this demand.

A combined PacBio‐Illumina approach for highly accurate complete genome sequence of Sphingomonas sp. ATCC 55669

A combined PacBio‐Illumina approach, which provides an error correction for the draft genome produced by PacBio with Illumina reads, generated a complete, highly accurate genome sequence. Illumina sequencing technology contains the least indel error and has multi‐fold coverage, Indel errors could be cut down to zero, as could base substitutions, yielding accurate result for many comparison purposes. However, there were still many gaps, preventing detailed analysis. PacBio sequencing technology produced long reads, but with 15.4–17.9% randomly distributed indel errors 35, 36. Most error correction strategies have used TNGS technologies to produce high‐accuracy reads to correct SMRT long reads 19. However, optimized traditional sequence alignment strategies are generally used to detect substitution errors. The interference of indel errors created inconsistencies between PacBio long and short reads. We used BLASR to align indel‐rich PacBio long reads, and PBcR accuracy significantly increased, allowing overlap extension after error correction. RASTools reduced the indel errors of PacBio long reads, but we preferred to use error correction for the draft genome produced by PacBio with Illumina reads. This sequencing technology combination and assembly method allowed us to obtain the complete genome sequence.

Astaxanthin biosynthetic genes and genes distribution of Sphingomonas sp. ATCC 55669

All genes involved in astaxanthin biosynthesis of Sphingomonas sp. ATCC 55669 were described herein. In the carotenoid biosynthetic pathway, the MEP pathway is upstream of the precursors IPP and DMAPP for astaxanthin production. Bacteria harboring the MEP pathway have been shown to lack the idi gene 37. In carotenoid biosynthesis, idi, which balances IPP and DMAPP levels, has been shown to control carbon flow into downstream pathways 26. However, idi was absent in Sphingomonas sp. ATCC 55669. Interestingly, overexpression of idi in this train increased astaxanthin production by nearly 5.4‐fold, supporting the important role of idi on carotenoid production in Sphingomonas sp. ATCC 55669. With the functional identification platform in E. coli, crtE , crtZ , and crtZ were shown to have corresponding functions in astaxanthin biosynthesis. CrtE had about 43% protein identity with FPP synthase, which catalyzes IPP and DMAPP to form FPP; FPP then absorbs IPP to generate GGPP. In this case, we found that crtE functioned to condense IPP and DMAPP to GGPP. CrtZ and crtZ had low specificity, consistent with a previous study 38. Thus, this first study of the astaxanthin biosynthetic pathway in Sphingomonas revealed additional astaxanthin biosynthetic components for bioengineering. Astaxanthin biosynthetic genes were distributed throughout the chromosome in Sphingomonas sp. ATCC 55669. In general, most genes involved in natural product production exist in the form of gene clusters, as has been described for astaxanthin production 39. However, the genome of Sphingomonas sp. ATCC 55669 was unique, with discrete distribution of the genes.

Production of astaxanthin in engineered E. coli and engineered Sphingomonas sp. ATCC 55669

In our targeted engineering carotenoid system constructed in E. coli, we produced up to 6.6 mg/g cdw astaxanthin, which was lower than the yield reported for the engineered microbe Xanthophyllomyces dendrorhous (up to 9 mg/g cdw) 40, but higher than the production in E. coli (2 mg/g cdw) 41. Thus, this targeted engineering method effectively increased astaxanthin production; we were even able to achieve production of 8.64 mg/g cdw astaxanthin in the A1 mutant at 4 h after induction. In order to express exogenous genes in Sphingomonas sp. ATCC 55669, a genetic manipulation method was successfully established using a simple method that shared some elements, such as plasmids and promoters, with E. coli. This may facilitate future attempts at genetic engineering in this genus. Indeed, Sphingomonas has recently attracted attention in the field of environmental microbiology as a potential metabolizer for a broad range of organic compounds, particularly refractory environmental pollutants, and for production of useful exopolysaccharides and carotenoids. Thus, this genetic method could be not only used in degradative engineering but also for carotenoid overproduction. In conclusion, we demonstrated that Sphingomonas sp. ATCC 55669 naturally produced astaxanthin. The complete genome was sequenced using PacBio‐Illumina sequencing. Subsequently, the entire astaxanthin biosynthetic pathway, including some new astaxanthin biosynthesis components, was identified. Genes were discretely distributed throughout the chromosome. We believe this work provides important insights into the identification of natural product biosynthetic pathways based on sequencing technology, particularly for cases in which genes are discretely distributed or even absent. Moreover, our study described an additional genetic source for astaxanthin production. As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors. Supporting Information Click here for additional data file.
  35 in total

1.  Performance comparison of benchtop high-throughput sequencing platforms.

Authors:  Nicholas J Loman; Raju V Misra; Timothy J Dallman; Chrystala Constantinidou; Saheer E Gharbia; John Wain; Mark J Pallen
Journal:  Nat Biotechnol       Date:  2012-05       Impact factor: 54.908

2.  Mapping of a carotenogenic gene cluster from Erwinia herbicola and functional identification of six genes.

Authors:  G Schnurr; A Schmidt; G Sandmann
Journal:  FEMS Microbiol Lett       Date:  1991-03-01       Impact factor: 2.742

3.  Production of metabolites in the biodegradation of phenanthrene, fluoranthene and pyrene by the mixed culture of Mycobacterium sp. and Sphingomonas sp.

Authors:  Yin Zhong; Tiangang Luan; Li Lin; Hong Liu; Nora F Y Tam
Journal:  Bioresour Technol       Date:  2010-10-30       Impact factor: 9.642

4.  SOAP2: an improved ultrafast tool for short read alignment.

Authors:  Ruiqiang Li; Chang Yu; Yingrui Li; Tak-Wah Lam; Siu-Ming Yiu; Karsten Kristiansen; Jun Wang
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

5.  Genetic engineering of the complete carotenoid pathway towards enhanced astaxanthin formation in Xanthophyllomyces dendrorhous starting from a high-yield mutant.

Authors:  Sören Gassel; Jürgen Breitenbach; Gerhard Sandmann
Journal:  Appl Microbiol Biotechnol       Date:  2013-11-17       Impact factor: 4.813

6.  Aerobic degradation of diethyl phthalate by Sphingomonas sp.

Authors:  Herbert H P Fang; Dawei Liang; Tong Zhang
Journal:  Bioresour Technol       Date:  2006-03-23       Impact factor: 9.642

7.  Proposal of the genus Sphingomonas sensu stricto and three new genera, Sphingobium, Novosphingobium and Sphingopyxis, on the basis of phylogenetic and chemotaxonomic analyses.

Authors:  M Takeuchi; K Hamana; A Hiraishi
Journal:  Int J Syst Evol Microbiol       Date:  2001-07       Impact factor: 2.747

8.  The origin of the Haitian cholera outbreak strain.

Authors:  Chen-Shan Chin; Jon Sorenson; Jason B Harris; William P Robins; Richelle C Charles; Roger R Jean-Charles; James Bullard; Dale R Webster; Andrew Kasarskis; Paul Peluso; Ellen E Paxinos; Yoshiharu Yamaichi; Stephen B Calderwood; John J Mekalanos; Eric E Schadt; Matthew K Waldor
Journal:  N Engl J Med       Date:  2010-12-09       Impact factor: 91.245

9.  Isolation of a beta-carotene over-producing soil bacterium, Sphingomonas sp.

Authors:  C Silva; J M S Cabral; F van Keulen
Journal:  Biotechnol Lett       Date:  2004-02       Impact factor: 2.461

10.  Aggressive assembly of pyrosequencing reads with mates.

Authors:  Jason R Miller; Arthur L Delcher; Sergey Koren; Eli Venter; Brian P Walenz; Anushka Brownley; Justin Johnson; Kelvin Li; Clark Mobarry; Granger Sutton
Journal:  Bioinformatics       Date:  2008-10-24       Impact factor: 6.937

View more
  16 in total

1.  Genomics-driven discovery of the biosynthetic gene cluster of maduramicin and its overproduction in Actinomadura sp. J1-007.

Authors:  Ran Liu; Fang Fang; Ziheng An; Renqiong Huang; Yong Wang; Xiao Sun; Shuai Fu; Aisi Fu; Zixin Deng; Tiangang Liu
Journal:  J Ind Microbiol Biotechnol       Date:  2019-12-18       Impact factor: 3.346

2.  Combinatorial expression of different β-carotene hydroxylases and ketolases in Escherichia coli for increased astaxanthin production.

Authors:  Yuanqing Wu; Panpan Yan; Xuewei Liu; Zhiwen Wang; Ya-Jie Tang; Tao Chen; Xueming Zhao
Journal:  J Ind Microbiol Biotechnol       Date:  2019-07-11       Impact factor: 3.346

Review 3.  Marine Microbial-Derived Resource Exploration: Uncovering the Hidden Potential of Marine Carotenoids.

Authors:  Ray Steven; Zalfa Humaira; Yosua Natanael; Fenny M Dwivany; Joko P Trinugroho; Ari Dwijayanti; Tati Kristianti; Trina Ekawati Tallei; Talha Bin Emran; Heewon Jeon; Fahad A Alhumaydhi; Ocky Karna Radjasa; Bonglee Kim
Journal:  Mar Drugs       Date:  2022-05-26       Impact factor: 6.085

4.  A regulated synthetic operon facilitates stable overexpression of multigene terpenoid pathway in Bacillus subtilis.

Authors:  Ingy I Abdallah; Dan Xue; Hegar Pramastya; Ronald van Merkerk; Rita Setroikromo; Wim J Quax
Journal:  J Ind Microbiol Biotechnol       Date:  2020-01-01       Impact factor: 3.346

Review 5.  In vitro reconstitution guide for targeted synthetic metabolism of chemicals, nutraceuticals and drug precursors.

Authors:  Gao-Yi Tan; Faying Zhu; Zixin Deng; Tiangang Liu
Journal:  Synth Syst Biotechnol       Date:  2016-04-01

Review 6.  Metabolic engineering for the microbial production of isoprenoids: Carotenoids and isoprenoid-based biofuels.

Authors:  Fu-Xing Niu; Qian Lu; Yi-Fan Bu; Jian-Zhong Liu
Journal:  Synth Syst Biotechnol       Date:  2017-08-30

7.  Metabolic Engineering of Escherichia coli for Producing Astaxanthin as the Predominant Carotenoid.

Authors:  Qian Lu; Yi-Fan Bu; Jian-Zhong Liu
Journal:  Mar Drugs       Date:  2017-09-22       Impact factor: 5.118

8.  Assessment of Expression Cassettes and Culture Media for Different Escherichia coli Strains to Produce Astaxanthin.

Authors:  Shun Li; Jun-Chao Huang
Journal:  Nat Prod Bioprospect       Date:  2018-06-06

9.  Astaxanthin overproduction in yeast by strain engineering and new gene target uncovering.

Authors:  Jin Jin; Ying Wang; Mingdong Yao; Xiaoli Gu; Bo Li; Hong Liu; Mingzhu Ding; Wenhai Xiao; Yingjin Yuan
Journal:  Biotechnol Biofuels       Date:  2018-08-23       Impact factor: 6.040

10.  Enhancement of Astaxanthin Biosynthesis in Oleaginous Yeast Yarrowia lipolytica via Microalgal Pathway.

Authors:  Larissa Ribeiro Ramos Tramontin; Kanchana Rueksomtawin Kildegaard; Suresh Sudarsan; Irina Borodina
Journal:  Microorganisms       Date:  2019-10-19
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.