Literature DB >> 33974036

Multimodal regularised linear models with flux balance analysis for mechanistic integration of omics data.

Giuseppe Magazzù1, Guido Zampieri1,2, Claudio Angione1,3,4.   

Abstract

MOTIVATION: High-throughput biological data, thanks to technological advances, have become cheaper to collect, leading to the availability of vast amounts of omic data of different types. In parallel, the in silico reconstruction and modelling of metabolic systems is now acknowledged as a key tool to complement experimental data on a large scale. The integration of these model- and data-driven information is therefore emerging as a new challenge in systems biology, with no clear guidance on how to better take advantage of the inherent multi-source and multi-omic nature of these data types while preserving mechanistic interpretation.
RESULTS: Here we investigate different regularisation techniques for high-dimensional data derived from the integration of gene expression profiles with metabolic flux data, extracted from strain-specific metabolic models, to improve cellular growth rate predictions. To this end, we propose ad-hoc extensions of previous regularisation frameworks including group, view-specific and principal component regularisation, and experimentally compare them using data from 1,143 Saccharomyces cerevisiae strains. We observe a divergence between methods in terms of regression accuracy and integration effectiveness based on the type of regularisation employed. In multi-omic regression tasks, when learning from experimental and model-generated omic data, our results demonstrate the competitiveness and ease of interpretation of multimodal regularised linear models compared to data-hungry methods based on neural networks. AVAILABILITY: All data, models, and code produced in this work are available on GitHub at https://github.com/Angione-Lab/HybridGroupIPFLasso_pc2Lasso. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  Lasso; Machine learning; flux balance analysis; multi-omics; regression; regularisation

Year:  2021        PMID: 33974036     DOI: 10.1093/bioinformatics/btab324

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  Loss of full-length dystrophin expression results in major cell-autonomous abnormalities in proliferating myoblasts.

Authors:  Maxime R F Gosselin; Virginie Mournetas; Malgorzata Borczyk; Suraj Verma; Annalisa Occhipinti; Justyna Róg; Lukasz Bozycki; Michal Korostynski; Samuel C Robson; Claudio Angione; Christian Pinset; Dariusz C Gorecki
Journal:  Elife       Date:  2022-09-27       Impact factor: 8.713

2.  Using machine learning as a surrogate model for agent-based simulations.

Authors:  Claudio Angione; Eric Silverman; Elisabeth Yaneske
Journal:  PLoS One       Date:  2022-02-10       Impact factor: 3.752

3.  Quantifying cumulative phenotypic and genomic evidence for procedural generation of metabolic network reconstructions.

Authors:  Thomas J Moutinho; Benjamin C Neubert; Matthew L Jenior; Jason A Papin
Journal:  PLoS Comput Biol       Date:  2022-02-07       Impact factor: 4.475

Review 4.  Exploring synergies between plant metabolic modelling and machine learning.

Authors:  Marta Sampaio; Miguel Rocha; Oscar Dias
Journal:  Comput Struct Biotechnol J       Date:  2022-04-16       Impact factor: 6.155

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.