| Literature DB >> 27495311 |
Kerry R Love1, Kartik A Shah1, Charles A Whittaker2, Jie Wu2, M Catherine Bartlett1, Duanduan Ma2, Rachel L Leeson1, Margaret Priest3, Jonathan Borowsky2, Sarah K Young3, J Christopher Love4,5.
Abstract
BACKGROUND: Pichia pastoris has emerged as an important alternative host for producing recombinant biopharmaceuticals, owing to its high cultivation density, low host cell protein burden, and the development of strains with humanized glycosylation. Despite its demonstrated utility, relatively little strain engineering has been performed to improve Pichia, due in part to the limited number and inconsistent frameworks of reported genomes and transcriptomes. Furthermore, the co-mingling of genomic, transcriptomic and fermentation data collected about Komagataella pastoris and Komagataella phaffii, the two strains co-branded as Pichia, has generated confusion about host performance for these genetically distinct species. Generation of comparative high-quality genomes and transcriptomes will enable meaningful comparisons between the organisms, and potentially inform distinct biotechnological utilies for each species.Entities:
Keywords: Cultivation dependent expression; Gene Set Enrichment Analysis (GSEA); Komagataella pastoris; Komagataella phaffii; Pichia pastoris; RNA-Seq; Secretome; Self-Organizing Maps (SOMs); Transcriptome
Mesh:
Substances:
Year: 2016 PMID: 27495311 PMCID: PMC4974788 DOI: 10.1186/s12864-016-2876-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Genome assembly and annotation statistics for major chromosomes
|
|
| ||
|---|---|---|---|
| WT | GS115 | ||
| Genome Size (Mb) | 9.6 | 9.4 | 9.4 |
| Chromosomes | 4 | 4 | 4 |
| Contigs | 11 | 7 | 9 |
| Pacbio Coverage | 168x | 118x | 207x |
| Illumina Coverage | 312x | 1869x | 1498x |
| Coding (%) | 78.6 | 79.9 | 79.5 |
| Coding Genes | 5241 | 5167 | 5183 |
| tRNA Genes | 122 | 123 | 123 |
| 5S rRNA Genes | 23 | 21 | 21 |
| GC% | 41.5 % | 41.3 % | 41.3 % |
Fig. 1Comparative genome structure of K. pastoris and K. phaffii. Circos plot indicating the sequence alignment between K. pastoris and K. phaffii marked with methanol utilization pathway (MUT) genes. Functional genetic elements marked on the plots include: small rDNA subunits (white circles), large rDNA subunits (black circles), and telomeric repeats (orange triangles)
Fig. 2Linear plasmid annotation and expression in K. phaffii. a) Schematic representation of the 11 kb linear plasmid annotated with seven genes homologous to the K. lactis killer plasmid. b) Comparison of gene expression between genes located on the killer plasmid in i) wild type K. phaffii or ii) K. phaffii GS115 and the average gene expression among chromosomally-located genes in each species during cultivation on three different carbon sources
Fig. 3Gene expression as a function of chromosomal location. Map of chromosomal location (base pair identity) for the most highly expressed genes (top 10 % expression) in a) K. pastoris and b) K. phaffii. Black lines indicate gene expression level at 24 h time point during batch cultivation in methanol. Red lines indicate locations of GC-rich autonomously replicating sequence (GC-ARS) motifs identified by BLAST. c) Box and whisker plot of the relative distance to the nearest GC-ARS motif in K. pastoris and K. phaffii for genes expressed at average levels genome-wide (45–55 % of max expression) and for the most highly expressed genes (top 10 % expression). Histograms of relative distances to GC-ARS motifs are shown alongside box plots for each gene set analyzed
Fig. 4Gene expression in K. pastoris and K. phaffii as a function of cultivation conditions. a) Heat map of gene expression (log2 fpkm) for the most highly expressed genes at in both strains. Gene expression is shown as a function of batch growth in glycerol, glucose or methanol during a 48 h cultivation period. b) Venn diagrams depicting the intersection between K. pastoris (orange) and K. phaffii (green) for genes that are highly (top 10 % expression) and differentially expressed (log2-fold change > 2, p < 0.05) during fermentation on a particular carbon source. Circle size is proportional to the total number of genes present for a given condition
Fig. 5Biological process enrichment as a function of cultivation in methanol. Heat map representation of the enrichment of GO biological process terms for expression phenotypes observed in K. pastoris and K. phaffii during a 48 h batch cultivation in methanol as characterized by self-organizing maps (SOMs). Representative temporal trajectories of gene expression were generated for each SOM by averaging expression data at each time point for genes present within a given map. Color density relates to the number of genes assigned to a particular process as a percentage of the total number genes present in a particular expression phenotype or map
Fig. 6Locations of mutations found in GS115 and phenotypic differences observed between GS115 and wildtype K. phaffii. a) The chromosomal locations of the 35 single nucleotide polymorphisms (SNPs) found in GS115 relative to wildtype K. phaffii. b) Growth curve of Komagatella strains on glucose media. Wildtype K. phaffii growth data is indicated with squares and GS115 growth data is indicated with triangles. Data shown for each strain is the mean from triplicate measurements. Error bars indicate 95 % confidence intervals. c Kill curve of Komagatella strains following exposure to UV light. Data shown for each strain is the mean from two experiments, each run in triplicate. Error bars are the standard deviation across all data collected for both experiments