| Literature DB >> 28056772 |
Olaa Motwalli1, Magbubah Essack1, Boris R Jankovic1, Boyang Ji2, Xinyao Liu3, Hifzur Rahman Ansari4, Robert Hoehndorf1, Xin Gao1, Stefan T Arold1, Katsuhiko Mineta1, John A C Archer1, Takashi Gojobori1, Ivan Mijakovic2, Vladimir B Bajic5.
Abstract
BACKGROUND: Finding a source from which high-energy-density biofuels can be derived at an industrial scale has become an urgent challenge for renewable energy production. Some microorganisms can produce free fatty acids (FFA) as precursors towards such high-energy-density biofuels. In particular, photosynthetic cyanobacteria are capable of directly converting carbon dioxide into FFA. However, current engineered strains need several rounds of engineering to reach the level of production of FFA to be commercially viable; thus new chassis strains that require less engineering are needed. Although more than 120 cyanobacterial genomes are sequenced, the natural potential of these strains for FFA production and excretion has not been systematically estimated.Entities:
Keywords: Biofuel; Bioinformatics; Cell factories; Computer science; Cyanobacteria; Free fatty acids; Optimization; Screening method
Mesh:
Substances:
Year: 2017 PMID: 28056772 PMCID: PMC5217662 DOI: 10.1186/s12864-016-3389-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
List of 49 OGs relevant for FFA production
| KEGG Orthology | Definition | Effects | Method | Organism | Ref. |
|---|---|---|---|---|---|
| rOGs | |||||
| K00873 | pyruvate kinase | Carbohydrate metabolism | cyan. | [ | |
| K01007 | pyruvate, water dikinase | ||||
| K00161 | pyruvate dehydrogenase E1 component alpha subunit | ||||
| K00162 | pyruvate dehydrogenase E1 component beta subunit | ||||
| K00627 | pyruvate dehydrogenase E2 component (dihydrolipoamide acetyltransferase) | ||||
| K00382 | dihydrolipoamide dehydrogenase | ||||
| K00648 | 3-oxoacyl-[acyl-carrier-protein] synthase III | Lipid metabolism | |||
| K00645 | [acyl-carrier-protein] S-malonyltransferase | ||||
| K09458 | 3-oxoacyl-[acyl-carrier-protein] synthase II | ||||
| K02372 | 3-hydroxyacyl-[acyl-carrier-protein] dehydratase | ||||
| K00208 | enoyl-[acyl-carrier protein] reductase I | ||||
| K01046 | triacylglycerol lipase | Increase chance of strain to secrete FA | secretion & extraction | [ | |
| pOGs | |||||
| K01962 | acetyl-CoA carboxylase carboxyl transferase subunit alpha | Enhance FFA production (Increase supply of desired substrate) | secretion | cyan. | [ |
| K01963 | acetyl-CoA carboxylase carboxyl transferase subunit beta | ||||
| K01961 | acetyl-CoA carboxylase, biotin carboxylase subunit | ||||
| K02160 | acetyl-CoA carboxylase biotin carboxyl carrier protein | ||||
| K00432 | glutathione peroxidase | Reduce the toxic effect of FFA production and improve cell growth, physiology and FFA production | secretion | cyan. | [ |
| K04564 | superoxide dismutase, Fe-Mn family | ||||
| K06198 | competence protein CoiA | ||||
| K03782 | catalase-peroxidase | ||||
| K03621 | glycerol-3-phosphate acyltransferase PlsX | Lead to higher lipid levels | plant | [ | |
| K08591 | glycerol-3-phosphate acyltransferase PlsY | ||||
| K00655 | 1-acyl-sn-glycerol-3-phosphate acyltransferase | ||||
| virNOG10454 PDAT1 | IQ-domain | Enhancing FA synthesis and diverting FA from membrane lipid to Triacylglycerol | accu. | [ | |
| virNOG19439 | oleosin 1 | ||||
| K14457 | 2-acylglycerol O-acyltransferase 2 | Enhance acyl-CoA-dependent triacylglycerol TAG | [ | ||
| virNOG24576 LCIA | Anion transporter | Help regulate CO2 intake and increase biomass | algae | [ | |
| virNOG22763 | Low-CO2 inducible protein | ||||
| K00006 | glycerol-3-phosphate dehydrogenase (NAD+) | Increase glycerol and neutral lipid content (16- and 18-carbon monounsaturated FA significantly increased) | diatom | [ | |
| K01601 | ribulose-bisphosphate carboxylase large chain | Improve FFA production | cyan. | [ | |
| K01602 | ribulose-bisphosphate carboxylase small chain | ||||
| K01648 | ATP citrate (pro-S)-lyase | Enhance biofuel precursor production | yeast | [ | |
| K10804 | acyl-CoA thioesterase I | Remove feedback inhibition and increase production of FFA | secretion | cyan. | [ |
| K10781 | fatty acyl-ACP thioesterase B (Plant thioesterase) | Modify the chain length of FFAs for better fuel quality | [ | ||
| K10782 | fatty acyl-ACP thioesterase A | Release FFA | [ | ||
| K14075 | pancreatic lipase-related protein 2 | Degrade the membrane lipids into FFA with collapse of cell | extraction | [ | |
| nOGs | |||||
| K01595 | phosphoenolpyruvate carboxylase | Increase the lipid content | cyan. | [ | |
| K01897 | long-chain acyl-CoA synthetase | Channel needed substrates for synthesis of FFA into divergent or reverse pathways and preventing degradation of desired product | secretion | cyan. | [ |
| K00059 | 3-oxoacyl-[acyl-carrier protein] reductase | Divert energy into production of substantial by-products that would compete with production of FFA | [ | ||
| K00626 | acetyl-CoA C-acetyltransferase | ||||
| K11003 | hemolysin D | Enhance secretion of FFA by weakening cell walls | |||
| cyaNOG01264 | penicillin-binding protein | Enhance secretion of FFA by weakening peptidoglycan layer | |||
| K13788 | phosphate acetyltransferase | “Channel needed substrates for synthesis of FFA into divergent or reverse pathways and preventing degradation of desired product” | |||
| K13282 | cyanophycinase | “Divert energy into production of substantial by-products that would compete with production of FFA” | |||
| K03802 | cyanophycin synthetase | ||||
| cyaNOG01069 | Carbohydrate-selective porin OprB | Enhanced extracellular FFA concentration | [ | ||
| K13535 | cardiolipin-specific phospholipase | Increase lipid yields without affecting growth or biomass | accu. | diatom | [ |
| K00030 | isocitrate dehydrogenase (NAD+) | Increase intracellular citrate level which enhance biofuel precursor production | yeast | [ | |
| K03603 | GntR family transcriptional regulator, negative regulator for fad regulon and positive regulator of fabA | Fatty acid biosynthesis is feadback-inhibited at the transcriptional level by fadR | bacterium | [ | |
Abbreviations: rOGs required OGs, pOGs, OGs that positively impact FFA production, nOGs, OGs that negatively impact FFA production, FFA Free Fatty Acid, accu. Accumulation, cyan. Cyanobactia
Classification: nOG (based on reported knockout or knockdown) and pOGs (based on reported inserted or overexpressed) during genetic engineering experiments on that organism in order to secretion, extraction, or accumulation fatty acid
Fig. 1Metabolic map depicting FFA biosynthesis and associated pathways, detailing where 64 proteins impact this process (see Table 1 or Additional file 1: Table S2). Abbreviations: 3-PGA/3PG, 3-phosphoglycerate/3-phosphoglyceric acid; 2PG, 2-phosphoglyceric acid; PEP, phosphoenolpyruvic acid; F6P, fructose 6-phosphate; RuBP, ribulose-1,5-bisphosphate; CO2, carbon dioxide; G3P, glyceraldehyde 3-phosphate; ROS, reactive oxygen species; TCA, tricarboxylic acid; CoA, coenzyme A; ACP, acyl carrier protein; FAS II, type II fatty acid synthases; ATP, Adenosine triphosphate; ADP, adenosine diphosphate
Ranked list of cyanobacterial strains based on their FFA production potential score
| Ranking position | Ranked species | Values |
|---|---|---|
| 1 |
| 1.000000 |
| 2 |
| 0.999132 |
| 3 |
| 0.986870 |
| 4 |
| 0.986697 |
| 5 |
| 0.985005 |
| 6 |
| 0.979893 |
| 7 |
| 0.978688 |
| 8 |
| 0.978592 |
| 9 |
| 0.978368 |
| 10 |
| 0.975490 |
| 11 |
| 0.974863 |
| 12 |
| 0.973976 |
| 13 |
| 0.973275 |
| 14 |
| 0.968580 |
| 15 |
| 0.966391 |
| 16 |
| 0.965687 |
| 17 |
| 0.964991 |
| 18 |
| 0.962108 |
| 19 |
| 0.957499 |
| 20 |
| 0.956602 |
| 21 |
| 0.951221 |
| 22 |
| 0.947124 |
| 23 |
| 0.938174 |
| 24 |
| 0.933825 |
| 25 |
| 0.931812 |
| 26 |
| 0.931077 |
| 27 |
| 0.929529 |
| 28 |
| 0.929529 |
| 29 |
| 0.928199 |
| 30 |
| 0.922843 |
| 31 |
| 0.921061 |
| 32 |
| 0.916500 |
| 33 |
| 0.916340 |
| 34 |
| 0.887757 |
| 35 |
| 0.885889 |
| 36 |
| 0.883513 |
| 37 |
| 0.883513 |
| 101 |
| 0.432198 |
| 123 |
| 0.006115 |
The list includes all cyanobacterial strain that rank above S. elongates PCC 7942 and all reference strains (for the full set see Additional file 1: Table S8). Positive reference strains are marked with superscript + and negative reference strains with *
Weights assigned to rules after optimization that reflect the impact of these rules in the overall scoring
| Importance of features | |
|---|---|
| Features | Weight |
| overexpression_K00432_Synpcc7942_1214 | 0.999999981 |
| overexpression_K04564_Synpcc7942_0801 | 0.999999101 |
| overexpression_K03782_Synpcc7942_1656 | 0.999998942 |
| overexpression_K02160_accB | 0.999998794 |
| present_K00873_pykf | 0.999998794 |
| knockout_K11003_hemolysin | 0.999998724 |
| underexpression_K13535_Thaps3_264297 | 0.997856841 |
| knockout_cyaNOG01069_porin | 0.946931624 |
| knockout_K01897_fadD | 0.921718924 |
| present_K09458_fabF | 0.456133041 |
| overexpression_K00006_GPDH | 0.396694273 |
| present_K00208_fabI | 0.387646822 |
| present_K00161_pdhA | 0.314952182 |
| present_K02372_fabZ | 0.288150995 |
| overexpression_virNOG24576_LCIA | 0.228675187 |
| present_K00648_fabH | 0.17462096 |
| present_K00627_odhB | 0.168613677 |
| present_K00645_fabD | 0.160058392 |
| insert_K01602_rbcS | 0.150753918 |
| present_K01046_lipase | 0.14966023 |
| overexpression_K06198_Synpcc7942_0437 | 0.119438174 |
| present_K01007_pps | 0.020105541 |
| insert_K14075_gpl | 0.013465425 |
| overexpression_K00655_plsC | 0.008613511 |
| overexpression_virNOG10454_PDAT1 | 0.00833575 |
| overexpression_K01963_accD | 0.008246186 |
| overexpression_K01961_accC | 0.008089475 |
| knockout_K00059_fabG | 0.007999865 |
| overexpression_K08591_plsY | 0.007682015 |
| overexpression_K01962_accA | 0.007630664 |
| insert_K10804_tesA | 0.005907112 |
| knockout_K00626_thi | 0.004833629 |
| knockout_cyaNOG01264_PBP2 | 0.004590024 |
| knockout_K03802_slr2002 | 0.004303632 |
| knockout_K03603_fadR | 0.004102976 |
| insert_K01601_rbcL | 0.003963175 |
| knockout_K00030_idh | 0.003153309 |
| knockout_K13788_pta | 0.001763091 |
| overexpression_K03621_plsX | 0.001763091 |
| overexpression_virNOG22763_LCIB | 0.001763091 |
| present_K00162_pdhB | 0.001763091 |
| present_K00382_phdD | 0.001763091 |
| underexpression_K01595_ppc | 0.001763091 |
| overexpression_virNOG19439_oleosins | 0.001299491 |
| knockout_K13282_slr2001 | 0.00115274 |
| insert_K01648_acl | 0.001045169 |
| insert_K14457_DGTT2 | 0.001001378 |
| insert_K10781_fatB | 0.001000657 |
| insert_K10782_fat1 | 0.001000152 |
Fig. 2Heatmap visualization of the cyanobacteria screened against the 49 OGs. Clades that contain top ranked strains are represented in green in dendrogram, while the clade that contain the diatoms are represented in black and the clade that contain the negative reference strains are represented in red. Also, positive reference strains names on the x-axis are encircled with green, top ranked strains with maroon and negative reference strains with red
Fig. 3A comparison of the binary (presence/absence) output for the 41 OGs produced by both Model SEED and FFASC. The length of the bar indicates the number of strains with the predicted OG. The absence of bar means the OGs presence/absence for all 25 strains are identical in both methods
Fig. 4Silhouette plot for clustering quality shows the average silhouette value for clustering 128 species into 6 clusters. A silhouette index ranges from -1 to 1 and a value greater than 0 and closer to 1 indicates that points are in the appropriate cluster
Fig. 5Visualization results of the k-means clustering for the 128 species. The data is projected onto 2D spaces to be able to visualize results using the first two components of the principal component analysis as the axis
The analyzed strains classified under their associated order names allocated to the six clusters
| Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | Cluster 5 | Cluster 6 |
|---|---|---|---|---|---|
| Chroococcales | Chroococcales | Bacillariales | Chroococcales | Chroococcales | Chroococcales |
| Gloeobacterales | Nostocales | Thalassiosirales | Nostocales | Nostocales | Prochlorales |
| Nostocales | Oscillatoriales | Naviculales | Oscillatoriales | Oscillatoriales | |
| Oscillatoriales | Pleurocapsales | Prochlorales | Pleurocapsales | ||
| Pleurocapsales | Stigonematales | ||||
| Stigonematales |
Fig. 6Maximum-likelihood based phylogenetic tree of 124 cyanobacteria and the outgroup using 16S rRNA with bootstrap support. The branches and taxa name for positive reference strains are colored in green and for negative reference strains are colored in red, while the top predicted ranked strains are colored in blue (Table 2)
Fig. 7An example to illustrate homologues protein and domains presence and absence. As shown in the figure, if protein A has three homology hits (proteins x, y, and z), the homologous hit of protein A would only be considered if both of its domains (PFdomain1 and PFdomain2) are present in the hit. Hence, only protein x will be used in the analyses (both proteins y and z will be discarded). This stringent rule is applied to filter out weak homology hits obtained by BLAST
Fig. 8Flowchart of the ranking method employed. It defines the quantification rules based on nOG and pOG (quantification rules for rOG is not illustrated in this method, as rOG receive the values of “hitN”)
Fig. 9Pseudocode of Algorithm 1
Fig. 10Pseudocode of Algorithm 2