| Literature DB >> 22174787 |
Lei Wang1, Wei Chen, Yun Feng, Yan Ren, Zhennan Gu, Haiqin Chen, Hongchao Wang, Michael J Thomas, Baixi Zhang, Isabelle M Berquin, Yang Li, Jiansheng Wu, Huanxin Zhang, Yuanda Song, Xiang Liu, James S Norris, Suriguga Wang, Peng Du, Junguo Shen, Na Wang, Yanlin Yang, Wei Wang, Lu Feng, Colin Ratledge, Hao Zhang, Yong Q Chen.
Abstract
Mortierella alpina is an oleaginous fungus which can produce lipids accounting for up to 50% of its dry weight in the form of triacylglycerols. It is used commercially for the production of arachidonic acid. Using a combination of high throughput sequencing and lipid profiling, we have assembled the M. alpina genome, mapped its lipogenesis pathway and determined its major lipid species. The 38.38 Mb M. alpina genome shows a high degree of gene duplications. Approximately 50% of its 12,796 gene models, and 60% of genes in the predicted lipogenesis pathway, belong to multigene families. Notably, M. alpina has 18 lipase genes, of which 11 contain the class 2 lipase domain and may share a similar function. M. alpina's fatty acid synthase is a single polypeptide containing all of the catalytic domains required for fatty acid synthesis from acetyl-CoA and malonyl-CoA, whereas in many fungi this enzyme is comprised of two polypeptides. Major lipids were profiled to confirm the products predicted in the lipogenesis pathway. M. alpina produces a complex mixture of glycerolipids, glycerophospholipids and sphingolipids. In contrast, only two major sterol lipids, desmosterol and 24(28)-methylene-cholesterol, were detected. Phylogenetic analysis based on genes involved in lipid metabolism suggests that oleaginous fungi may have acquired their lipogenic capacity during evolution after the divergence of Ascomycota, Basidiomycota, Chytridiomycota and Mucoromycota. Our study provides the first draft genome and comprehensive lipid profile for M. alpina, and lays the foundation for possible genetic engineering of M. alpina to produce higher levels and diverse contents of dietary lipids.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22174787 PMCID: PMC3234268 DOI: 10.1371/journal.pone.0028319
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Asexual lifecycle and lipid staining of M. alpina.
(A) Asexual lifecycle of the fungus. Haploid cells form sporangiophores, and sporangiospores germinate to hypha. (B) Fungal culture grown on PDA plate stained with 0.5% triphenoltetrazolium chloride. Lipid droplets are stained brown.
Features of the M. alpina genome.
|
| |
|
|
|
| Number of scaffolds larger than 2 kb | 476 |
| GC content (%) | 51.72 |
| Length of classified repeats (%) | 1.76 |
|
|
|
| Average gene length (bp) | 1847 |
| Average transcript length (bp) | 1504 |
| Number of single-exon genes | 3311 |
| Average number of introns per multi-exon gene | 3.32 |
| Average exon size (bp) | 435 |
| Average intron size (bp) | 140 |
| Number of transposase-related genes | 135 |
| Number of tRNA genes | 228 |
|
|
|
| NR alignment | 8552 |
| KEGG alignment | 8382 |
| KOG/COG assignment | 7066 |
| EC assignment | 3290 |
| KO assignment | 6415 |
| GO assignment | 5800 |
| InterPro signature | 8740 |
| Signal peptide | 2041 |
| Transmembrane domain | 2286 |
Figure 2Distribution of predicted M. alpina proteins among functional groups.
Gene prediction and annotation were performed as described in the Materials and Methods. KOG category assignment is shown. X-axis represents percent of predicted gene models.
Comparative analysis of multi-gene families.
| Species | Genes in the genome | Genes in multigene family (%) |
|
| ||
|
| 11,719 | 72.00 |
|
| 17,459 | 63.66 |
|
| 12,796 | 50.34 |
|
| ||
|
| 15,707 | 62.46 |
|
| 17,817 | 62.35 |
|
| 10,950 | 55.25 |
|
| 11,643 | 52.57 |
|
| 14,169 | 49.79 |
|
| 11,100 | 49.18 |
|
| 10,148 | 47.88 |
|
| 9,121 | 43.57 |
|
| 12,074 | 43.56 |
|
| 10,046 | 43.36 |
|
| 9,116 | 42.63 |
|
| 5,178 | 40.71 |
|
| 14,010 | 40.19 |
|
| 9,541 | 39.89 |
|
| 11,656 | 38.98 |
|
| 10,406 | 38.75 |
|
| 5,880 | 36.87 |
|
| 14,102 | 35.37 |
|
| 9,630 | 35.28 |
|
| 5,336 | 34.50 |
|
| 5,816 | 31.31 |
|
| 5,003 | 30.68 |
|
| 5,191 | 30.65 |
|
| 16,360 | 30.62 |
|
| 6,472 | 30.05 |
|
| 6,324 | 29.90 |
|
| 10,272 | 29.51 |
|
| 6,017 | 28.04 |
|
| 5,335 | 23.41 |
|
| 14,446 | 22.48 |
|
| 16,389 | 21.58 |
|
| 9,824 | 21.23 |
|
| 4,725 | 20.53 |
|
| ||
|
| 18,215 | 53.82 |
|
| 10,048 | 46.20 |
|
| 6,273 | 27.02 |
|
| 6,538 | 23.05 |
|
| 4,286 | 19.18 |
|
| ||
|
| 8,700 | 49.33 |
Figure 3Map of M. alpina mitochondrial genome.
The outer circle represents the scale in kb. From the outside in, circles 1 and 2 indicate the location of ORF transcribed in a clockwise and counter-clockwise direction, respectively. Transcripts were predicted according to KOG categories. Circle 3 depicts tRNA and noncoding RNA.
Figure 4Phylogenetic analysis.
Cladogram based on genes involved in fatty acid metabolic pathway. Orthologous proteins were defined as reciprocal best hit proteins with a minimum of 50% identity and 70% of the length of the query protein as calculated by the BLAST algorithm with a threshold value of E≤1×10−5. Phylogenies were analyzed based on 12 orthologous proteins identified among 43 genera and inferred from the resulting 98,765 amino character alignment using the Neighbor-Joining method. Numbers on the left are percent of bootstrapping, performed on 1000 replicates with alpha parameters and the fraction of invariant sites estimated once from the original data. Phytophthora infestans T30-4 in the Oomycetes was used as the outgroup. Scale bar shows evolutionary distances in replacements per site. The orthologous proteins used for this analysis were carnitine O-acetyltransferases, fatty acid desaturase 9, phospholipid∶diacylglycerol acyltransferase, lysophosphatidate acyltransferase, acetyl-CoA carboxylase, 3-oxoacyl-[acyl-carrier-protein] synthase II, fatty acid desaturase 6I and 6II, fatty acid synthase, and 5-aminolevulinate synthase. Numbers in parentheses are reported lipid production in the corresponding fungus. Question marks indicate species in which lipid production is uncertain. Species in red are oleaginous, with lipid production greater than 20%.
Figure 5Predicted lipogenesis pathway in M. alpina.
The glycolysis, pentose phosphate pathway, fatty acid synthesis, tricarboxylic acid cycle, malate/pyruvate cycle, sterol, glycerophospholipid, sphingolipid and glycerolipid synthesis pathways are outlined. Lipids detected by lipidomics are highlighted in yellow. Enzyme names are indicated and followed in parentheses by the number of genes encoding them. Enzymes conventionally thought to occur in a given pathway but for which homologs were not found in M. alpina are followed by a question mark. Hexokinase (HK, EC2.7.1.1), glucose-6-phosphate 1-dehydrogenase (G6PD, EC1.1.1.49), 6-phosphogluconolactonase (PGLS, EC3.1.1.31), phosphogluconate dehydrogenase (PGD, EC1.1.1.44), ribose 5-phosphate isomerase (RPIA, EC5.3.1.6), ribulose-5-phosphate-3-epimerase (RPE, EC5.1.3.1), transketolase (TKT, EC2.2.1.1), transaldolase (TALDO, EC2.2.1.2), glucose-6-phosphate isomerase (GPI, EC5.3.1.9), phosphofructokinase (PFK, EC2.7.1.11), fructose-bisphosphatase (FBP, EC3.1.3.11), aldolase A fructose-bisphosphate (ALDOA, EC4.1.2.13), glyceraldehyde 3-phosphate dehydrogenase (GAPDH, EC1.2.1.12), phosphoglycerate kinase (PGK, EC2.7.2.3), triose-phosphate isomerase (TPI, EC5.3.1.1), phosphoglycerate mutase (PGAM, EC5.4.2.1), enolase (ENO, EC4.2.1.11), pyruvate kinase (PK, EC2.7.1.40), lactate dehydrogenase (LDH, EC1.1.1.27), phosphoenolpyruvate carboxykinase (PCK, EC4.1.1.32 and EC4.1.1.49), acetyl-CoA carboxylase (ACC, EC6.4.1.2), malonyl-CoA decarboxylase (MLYCD, EC4.1.1.9), fatty acid synthase (FASN, EC2.3.1.86), acyl-CoA synthetase (ACSL, EC6.2.1.3), acyl-CoA thioesterase (ACOT, EC3.1.2.27), fatty acid elongase (ELOVL, EC2.3.1.-), fatty acid delta 9 desaturase (FADS9, EC1.14.19.1), fatty acid delta 12 desaturase (FADS12, EC1.14.19.6), fatty acid delta 15 desaturase (FADS15, EC1.14.19.-), fatty acid delta 6 desaturase (FADS6, EC1.14.19.3), fatty acid delta 5 desaturase (FADS5, EC1.14.19.-), malate dehydrogenase (MDH, EC1.1.1.37), malic enzyme (ME, EC1.1.1.40), pyruvate carboxylase (PC, EC6.4.1.1), ATP-citrate lyase (ACLY, EC2.3.3.8), pyruvate dehydrogenase (PDH, EC1.2.4.1), dihydrolipoamide acetyltransferase (DLAT, EC2.3.1.12), citrate synthase (CS, EC2.3.3.1), aconitase (ACO, EC4.2.1.3), isocitrate dehydrogenase (IDH, EC1.1.1.41 and EC1.1.1.42), glutamate dehydrogenase (GLUD, EC1.4.1.2 and EC1.4.1.3), 2-oxoglutarate dehydrogenase (OGDH, EC1.2.4.2), dihydrolipoamide succinyltransferase (DLST, EC2.3.1.61), succinyl-CoA ligase (SUCLG, EC6.2.1.4 and EC6.2.1.5), succinate dehydrogenase (SDH, EC1.3.5.1), fumarase (FH, EC4.2.1.2), acetyl-CoA acetyltransferase (ACAT, EC2.3.1.9), hydroxymethylglutaryl-CoA synthase (HMGCS, EC2.3.3.10), 3-hydroxy-3-methylglutaryl-CoA reductase (HMGCR, EC1.1.1.34), mevalonate kinase (MVK, EC2.7.1.36), phosphomevalonate kinase (PMVK, EC2.7.4.2), mevalonate pyrophosphate decarboxylase (MVD, EC4.1.1.33), isopentenyl diphosphate isomerase (IDI, EC5.3.3.2), farnesyl diphosphate synthase (FDPS, EC2.5.1.10), farnesyl-diphosphate farnesyltransferase (FDFT, EC2.5.1.21), squalene epoxidase (SQLE, EC1.14.99.7), cycloartenol synthase (CAS, EC5.4.99.8), delta24-sterol methyltransferase (SMT, EC2.1.1.41), cycloeucalenol cycloisomerase (CEI, EC5.5.1.9), sterol 14-demethylase/cytochrome p450 family 51 (CYP51, EC1.14.13.70), delta14-sterol reductase/(ERG24, EC1.3.1.70), cholestenol isomerase/emopamil binding protein (EBP, EC5.3.3.5), sterol C5 desaturase (SC5DL, EC1.14.21.6), 7-dehydrocholesterol reductase (DHCR7, EC1.3.1.21), lanosterol synthase (LSS, EC5.4.99.7), C4 sterol methyl oxidase (SMO, EC1.14.13.72), hydroxysteroid (3-beta) dehydrogenase (HSD3B, EC1.1.1.170), hydroxysteroid (17-beta) dehydrogenase (HSD17B, EC1.1.1.270), serine palmitoyltransferase (SPTLC, EC2.3.1.50), 3-ketodihydrosphingosine reductase (KDSR, EC1.1.1.102), alkaline ceramidase (ACER, EC3.5.1.23), ceramide synthetase (CERS, EC2.3.1.24), sphingolipid delta-4 desaturase/degenerative spermatocyte homolog (DEGS, EC1.14.-.-), phosphatic acid phosphatase (PPAP, EC3.1.3.4), ceramide kinase (CERK, EC2.7.1.138), sphingomyelin phosphodiesterase (SMPD, EC3.1.4.12), sphingomyelin synthase (SGMS, EC2.7.8.27), UDP-glucose ceramide glucosyltransferase (UGCG, EC2.4.1.80), glucosylceramidase (GLCM, EC3.2.1.45), sphingosine kinase (SPHK, EC2.7.1.91), sphingosine-1-phosphate lyase (SGPL, EC4.1.2.27), CDP-diacylglycerol synthase (CDS, EC2.7.7.41), CDP-diacylglycerol inositol 3-phosphatidyltransferase (CDIPT, EC2.7.8.11), phosphatidylserine synthase (PTDSS, EC2.7.8.8), phosphatidylethanolamine methyltransferase (PEMT, EC2.1.1.17), phosphatidylserine decarboxylase (PISD, EC4.1.1.65), choline/ethanolamine phosphotransferase (CEPT, EC2.7.8.1), phospholipase C (PLC, EC3.1.4.3), glycerate kinase (GLYCTK, EC2.7.1.31), aldehyde dehydrogenase (ALDH, EC1.2.1.3), alcohol dehydrogenase (ADH, EC1.1.1.2), glycerol kinase (GK, EC2.7.1.30), glycerol-3-phosphate dehydrogenase (GPD, EC1.1.1.8), glycero-3-phosphate acyltransferase (GPAT, EC2.3.1.15), 1-acylglycerol-3-phosphate acyltransferase (AGPAT, EC2.3.1.51), phospholipase D (PLD, EC3.1.4.4), diacylglycerol kinase (DGK, EC2.7.1.107), diacylglycerol acyltransferase (DGAT, EC2.3.1.20), phospholipid diacylglycerol acyltransferase (PDAT, EC2.3.1.158), triglyceride lipase (GL, EC3.1.1.3).
Figure 6FASN domain comparison.
Enzymatic domain organization of mammalian, S. cerevisiae and M. alpina Fasn is shown. Note: S. cerevisiae Fasn comprises two genes, Fas1 and Fas2, encoding two peptides. AT: acetyltransferase, ER: β-enoyl reductase, DH: dehydratase, MAT: malonyl-CoA-/acetyl-CoA-ACP transacylase, MPT: malonyl/palmitoyl transferase, ACP: acyl carrier protein, KR: β-ketoacyl reductase, KS: β-ketoacyl synthase, PPT: phosphopantetheine transferase, TE: thioesterase. Domains are not drawn to scale.
Total fatty acid distribution*.
| Glucose (g/L) | Yest extract (g/L) | KNO3 (g/L) | Temperature (°C) | SAFA | MUFA | ω6 PUFA | ω3 PUFA | |||||||||||||||||||||
| 14:0 | 16:0 | 18:0 | 20:0 | 22:0 | 24:0 | 14:1 | 16:1 | total 18:1 | oleate | vaccenate | 20:1 | 22:1 | 24:1 | 18:2 | 20:2 | 18:3 n-6 | 20:3 | 20:4 | 18:3 n-3 | 20:5 | 22:5 | 22:6 | 24:5 | 24:6 | ||||
| 20 | 5 | 10 | 25 | 0.4±0.1 |
|
| 0.8±0.1 | 1.5±0.5 | 0.8±0.1 | 0.1±0 |
| 7.7±0.7 | 0.4±0 | 0.3±0.1 | 0.1±0.1 |
| 0.5±0.1 |
| 0.1±0 | |||||||||
| 0 | 0.3±0.2 |
|
| 0.7±0 | 1.8±0.6 | 1.5±0.6 |
| 4.8±1.4 | 0.4±0.1 | 0.2±0 | 0.3±0.2 |
| 0.5±0.1 |
| ||||||||||||||
| 20 | 5 | 10 | 12 | 1.4±1.0 |
|
| 0.8±0.2 | 1.6±0.5 | 1.2±0.4 | 0.2±0.1 |
| 12.6±4.8 | 0.8±0.4 | 0.7±0.1 | 1.0±0.6 |
| 0.5±0.1 | 0.1±0 |
| 0.1±0.1 |
| |||||||
| 0 | 0.8±0.4 |
|
| 0.7±0 | 1.2±0.1 | 0.7±0.1 | 0.1±0.1 |
| 13.2±5.7 | 0.5±0.2 | 0.7±0.1 | 1.1±0.2 |
| 0.5±0.1 |
| 0.1±0.1 |
| |||||||||||
*Molar percent of each fatty acid in day 6 mycelia. Averages ± standard deviations are shown. Blank: undetectable. 14:0 myristic acid (tetradecanoic acid), 14:1 myristoleic acid (tetradecenoic acid), 16:0 palmitic acid (hexadecanoic acid), 16:1 palmitoleic acid (hexadecenoic acid), 18:0 stearic acid (octadecanoic acid), 18:1 oleic acid (9Z-octadecenoic acid) and vaccenic acid (11Z-octadecenoic acid), 18:2 linoleic acid (octadecadienoic acid), 18:3 α-linolenic acid (9Z,12Z,15Z-octadecatrienoic acid) and γ-linolenic acid (6Z,9Z,12Z-octadecatrienoic acid), 20:0 arachidic acid (eicosanoic acid), 20:1 gondoic acid (eicosenoic acid), 20:2 (Eicosadienoic acid), 20:3 dihomo-γ-linolenic acid (eicosatrienoic acid), 20:4 arachidonic acid (eicosatetraenoic acid), 20:5 EPA (eicosapentaenoic acid), 22:0 behenic acid (docosanoic acid), 22:1 erucic acid (docosenoic acid), 22:5 DPA (docosapentaenoic acid), 22:6 DHA (docosahexaenoic acid), 24:0 lignoceric acid (tetracosanoic acid), 24:1 nervonic acid (tetracosenoic acid), 24:5 (tetracosapentaenoic acid), 24:6 (tetracosahexaenoic acid).
Figure 7Glycerolipid distribution.
Major glycerolipid (GL) species detected in M. alpina cultures grown at 12°C and 25°C (A) and percent distribution (B) are shown. The number before the colon indicates the total number of carbons in the three fatty acyl chains, and the number after the colon represents the total number of double bonds (e.g. 56:8 represents several species with a combined number of fatty acyl carbon atoms of 56 and 8 double-bonds).
Figure 8Glycerophospholipid distribution.
Phosphatidylcholine (PC), phosphatidylethanolamine (PE), phosphatidylserine (PS), phosphatidylinositol (PI), phosphatidic acid (PA), and lysophosphatidylcholine (LysoPC) species distributions are shown. Arrows indicate the major differences in glycerophospholipid between cultures at 12°C and 25°C.
Figure 9Sphingolipid distribution.
Sphingolipid species (A) and percent distribution (B) are shown. Cer: ceramide; Cer-1P: ceramide-1-phosphate; GluCer: glucosyl ceramide; Sph: sphingosine; Sph-1P: sphingosine-1-phosphate; dhSph: dihydrosphingosine; dhSph-1P: dihydrosphingosine-1-phosphate. Numbers in parentheses are percent of total sphingolipids.