| Literature DB >> 29704042 |
Ewelina Celińska1, Monika Borkowska2, Wojciech Białas2, Paulina Korpys2, Jean-Marc Nicaud3.
Abstract
Upon expression of a given protein in an <span class="Species">expression host, its secretion into the culture medium or cell-surface display is frequently advantageous in both research and industrial contexts. Hence, engineering strategies targeting folding, trafficking, and secretion of the proteins gain considerable interest. Yarrowia lipolytica has emerged as an efficient protein expression platform, repeatedly proved to be a competitive secretor of proteins. Although the key role of signal peptides (SPs) in secretory overexpression of proteins and their direct effect on the final protein titers are widely known, the number of reports on manipulation with SPs in Y. lipolytica is rather scattered. In this study, we assessed the potential of ten different SPs for secretion of two heterologous proteins in Y. lipolytica. Genomic and transcriptomic data mining allowed us to select five novel, previously undescribed SPs for recombinant protein secretion in Y. lipolytica. Their secretory potential was assessed in comparison with known, widely exploited SPs. We took advantage of Golden Gate approach, for construction of expression cassettes, and micro-volume enzymatic assays, for functional screening of large libraries of recombinant strains. Based on the adopted strategy, we identified novel secretory tags, characterized their secretory capacity, indicated the most potent SPs, and suggested a consensus sequence of a potentially robust synthetic SP to expand the molecular toolbox for engineering Y. lipolytica.Entities:
Keywords: Leader sequences; Modular cloning; Protein expression; Protein secretion; Signal peptides; Yarrowia lipolytica
Mesh:
Substances:
Year: 2018 PMID: 29704042 PMCID: PMC5959983 DOI: 10.1007/s00253-018-8966-9
Source DB: PubMed Journal: Appl Microbiol Biotechnol ISSN: 0175-7598 Impact factor: 4.813
Fig. 1Signal peptides of AEP and LIP2—the major proteins of Y. lipolytica secretome. Color code of the scheme is explained in legend: blue—pre-leader sequence, red—XA/XP dipeptide stretch, yellow—pro-leader, green—dimotif KR, black—mature protein. Arrows indicate sites recognized by specific aminopeptidase (red) operating in the endoplasmic reticulum (ER) or by endoprotease XPR6 (KEX2 homolog) (green), operating in the Golgi apparatus
Signal peptide consensus sequence logos, secondary structure prediction, hydrophobicity assessment, and D score values for (a) the 38 proteins identified through Y. lipolytica CLIB122 genomic sequence scan for ORFs bearing MK x{7–15} (x,A/P){2–10} x{10–120} KR motif and (b) SPs having the highest secretory capacity in Y. lipolytica cells with SoAMY and TlGAMY polypeptides
Amino acid sequences were aligned using MEGA 7.0.14 package and ClustalW algorithm; dashes indicate gaps; ambiguous sites are indicated as (X:Y) X and Y—alternative amino acid residues occurring with the same frequency. The logos were created using Web Logo tool. Color code: black—hydrophobic amino acid residue, blue—positively charged amino acid residue, green—polar uncharged amino acid residue. Size of a letter represents frequency of a corresponding amino acid residue occurrence in a respective position. Hydrophobicity evaluation was done using GRAVY calculator tool and secondary structure prediction—using SOPMA tool. (A) Due to relatively high sequence degeneration SOPMA and GRAVY analyses were done for one possible variant; (B) Both analyses were done for the two possible variants of the consensus sequence
Signal peptides under study accompanied by the results of computational analyses
aSOPMA: prediction of secondary structure; : alpha-helix,: beta turn, : random coil, : extended strand
bD score is used to discriminate signal peptides from non-signal peptides based on probability of the presence of a signal peptidase cleavage site; SignalP
cGRAVY value: grand average hydropathy calculated for 12 residues after the last positively charged residue of the n-region (HB12) or complete SP
dSignalP calculates signal peptide of 27 AA (MKLSTILFTACATLAAALPSPITPSEA-A V); PrediSi tool predicts the pre region of 17 AA (as reported in primary scientific papers (Pigniede et al., 2000), and this pre-sequence was analyzed in this study
Fig. 2Golden Gate Assembly Scaffold. The scheme illustrates a scaffold of GGVAs designed and constructed in this study (Golden Gate Vectors bearing complete Assemblies; pSB1A3 backbone) alongside with Golden Gate donor Vectors (GGVs; TOPO backbone) bearing individual elements (Golden Gate Fragments (GGFs)) to be ultimately assembled into different GGVA variants, depending on the GGFs type. The complete GGVA bears seven elements: InsUP/InsDOWN—zeta elements serving as the target regions for non-homologous recombination events flanked with NotI recognition sites; M—selection marker URA3; P1—promoter pTEF1; SP—signal peptide, one of the ten SPs under study; G1—gene of interest, one of the two SoAMY or TlGAMY devoid of their native SP; T3—terminator tLip2. Symbols corresponding to GGFs follow Synthetic Biology Open Language (SBOL) standard. Each GGF is flanked with a 4-nt overhang assigned a respective letter and a 4-nt sequence indicated in the scheme (A, B, C, D, X, K, L, and M). Altogether 20 complete GGVAs were obtained. Prior to transformation GGA was released from GGVA through digestion with NotI endonuclease
Fig. 3Relative amylolytic activity of SoAMY and TlGAMY in the batch culture medium supernatants of recombinant Y. lipolytica strains, transformed with GGAs bearing different SP-encoding sequences. The values constitute a mean from three independent runs of five subclone cultures, representing a corresponding GGA variant. The amylolytic activity was expressed as a SQRT% value in relation to the positive control strain (derivative of Po1g bearing pYLSC-SoAMY/TlGAMY; see Online Resource ESM_2). Grouping of the SPs under study depending on their secretory capacity towards SoAMY and TlGAMY proteins was conducted using Scheffe test. Error bars represent ±SD of three independent runs of five subclones representing a given GGA variant