| Literature DB >> 35596177 |
Mateus Bernabe Fiamenghi1,2,3, João Gabriel Ribeiro Bueno2, Antônio Pedro Camargo1,2, Guilherme Borelli1,2, Marcelo Falsarella Carazzolle1,2, Gonçalo Amarante Guimarães Pereira4,5, Leandro Vieira Dos Santos2,6, Juliana José1,2.
Abstract
BACKGROUND: The need to mitigate and substitute the use of fossil fuels as the main energy matrix has led to the study and development of biofuels as an alternative. Second-generation (2G) ethanol arises as one biofuel with great potential, due to not only maintaining food security, but also as a product from economically interesting crops such as energy-cane. One of the main challenges of 2G ethanol is the inefficient uptake of pentose sugars by industrial yeast Saccharomyces cerevisiae, the main organism used for ethanol production. Understanding the main drivers for xylose assimilation and identify novel and efficient transporters is a key step to make the 2G process economically viable.Entities:
Keywords: Feature selection; Industrial biotechnology; Machine learning; Pentose metabolism; Xylose; Xylose transporter
Year: 2022 PMID: 35596177 PMCID: PMC9123741 DOI: 10.1186/s13068-022-02153-7
Source DB: PubMed Journal: Biotechnol Biofuels Bioprod ISSN: 2731-3654
Fig. 1Graphical representations of machine learning model against the dataset. a Force-plot of most important features as calculated by Recursive Feature Elimination by Cross-Validation with XGBoost. Features highlighted in red are responsible for driving the final prediction of a sample into the positive category (A probable xylose transporter) while features in blue drive the prediction into the negative category (A non-xylose transporter). The base value represents the average prediction for the samples, while the size of the feature represents its impact (higher or lower importance). b Common metrics used to evaluate a model, the grey values correspond to the base threshold model and blue to the altered threshold. c Confusion matrix showing the results of predictions against the test data
Fig. 2Snippet of fam10 phylogeny transformed into a cladogram for visualization purposes, coupled with the alignment around the site found under positive selection by MEME. In red are the transporters chosen for further characterization. Bootstraps are not shown as all of them on these clades were over 80
Fig. 3Spot-assay of EBY_Xyl1 carrying each of the indicated transporters and growing in a different sugars and b different concentrations of xylose. Initial OD600 was settled at 1 before the tenfold serial dilution. Plates were incubated in 30 °C. All experiments were performed in triplicate
Fig. 4Comparative fermentation assays of EBY_Xyl1 expressing different transporters in xylose (full lines) or glucose (dashed lines). a Growth of EBY_Xyl1 during xylose fermentation. b Xylose consumption of EBY_Xyl1 cells expressing the transporters over time. Note that SpX does not appear clearly as it overlaps with SpG. c Growth of EBY_Xyl1 expressing SuL, GXF1 as positive control and pRS426 (empty vector) as negative control during xylose/glucose co-fermentation and d sugar consumption of EBY_Xyl1 expressing SuL, GXF1 as positive control and pRS426 (empty vector) as negative control during xylose/glucose co-fermentation (note that SuL glucose fermentation overlaps with GXF1)
Fig. 5Superimposed structures of xylE coupled with xylose (blue) and predicted structures for the four xylose transporters and GXF1 (pink tones). The 2D representations show the probable interactions between xylose and amino acids in the binding site for each transporter
Docking results for the four candidate transporters, xylE (self-docking) and Gxf1
| Protein name | Xylose | Glucose | ||
|---|---|---|---|---|
| Affinity | Δ RMSD from crystal | Affinity | Δ RMSD from crystal | |
| xylE | − 5.8 | 1.776 | − 6 | 0.619 |
| SuL | − 5.0 | 0.948 | − 5.8 | 1.166 |
| Gxf1 | − 5.3 | 1.085 | − 5.7 | 1.296 |
| SpH | − 5.4 | 2.274 | − 5.5 | 2.252 |
| SpX | − 5.5 | 2.454 | − 6 | 2.668 |
| SpG | − 5.5 | 2.300 | − 5.6 | 1.223 |
Affinity represents the stability of the ligand in the binding site (the more negative the better), and ΔRMSD represents the difference in pose between docked prediction and xylE crystal position