Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Machine learning: its challenges and opportunities in plant system biology.

Literature DB >> 35575915

Machine learning: its challenges and opportunities in plant system biology.

Mohsen Hesami¹, Milad Alizadeh², Andrew Maxwell Phineas Jones¹, Davoud Torkamaneh^3,4.

Abstract

Sequencing technologies are evolving at a rapid pace, enabling the generation of massive amounts of data in multiple dimensions (e.g., genomics, epigenomics, transcriptomic, metabolomics, proteomics, and single-cell omics) in plants. To provide comprehensive insights into the complexity of plant biological systems, it is important to integrate different omics datasets. Although recent advances in computational analytical pipelines have enabled efficient and high-quality exploration and exploitation of single omics data, the integration of multidimensional, heterogenous, and large datasets (i.e., multi-omics) remains a challenge. In this regard, machine learning (ML) offers promising approaches to integrate large datasets and to recognize fine-grained patterns and relationships. Nevertheless, they require rigorous optimizations to process multi-omics-derived datasets. In this review, we discuss the main concepts of machine learning as well as the key challenges and solutions related to the big data derived from plant system biology. We also provide in-depth insight into the principles of data integration using ML, as well as challenges and opportunities in different contexts including multi-omics, single-cell omics, protein function, and protein-protein interaction. KEY POINTS: • The key challenges and solutions related to the big data derived from plant system biology have been highlighted. • Different methods of data integration have been discussed. • Challenges and opportunities of the application of machine learning in plant system biology have been highlighted and discussed.

Entities: Chemical

Keywords: Big data; Data integration; Epigenomics; Multi-omics; Plant molecular biology; Prediction; Protein function; Transcription factor

Mesh：

Year: 2022 PMID： 35575915 DOI： 10.1007/s00253-022-11963-6

Source DB: PubMed Journal: Appl Microbiol Biotechnol ISSN： 0175-7598 Impact factor: 4.813

189 in total

1. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors: M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal: Nat Genet Date: 2000-05 Impact factor: 38.330

2. Proteomics: The interaction map.

Authors: Monya Baker
Journal: Nature Date: 2012-04-11 Impact factor: 49.962

3. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors: Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal: Nat Biotechnol Date: 2015-07-27 Impact factor: 54.908

Review 4. Proteomics: Technologies and Their Applications.

Authors: Bilal Aslam; Madiha Basit; Muhammad Atif Nisar; Mohsin Khurshid; Muhammad Hidayat Rasool
Journal: J Chromatogr Sci Date: 2016-10-18 Impact factor: 1.618

Review 5. Team effort: Combinatorial control of seed maturation by transcription factors.

Authors: Milad Alizadeh; Ryan Hoy; Bailan Lu; Liang Song
Journal: Curr Opin Plant Biol Date: 2021-07-31 Impact factor: 7.834

6. Exploring single-cell data with deep multitasking neural networks.

Authors: Matthew Amodio; David van Dijk; Krishnan Srinivasan; Guy Wolf; Smita Krishnaswamy; William S Chen; Hussein Mohsen; Kevin R Moon; Allison Campbell; Yujiao Zhao; Xiaomei Wang; Manjunatha Venkataswamy; Anita Desai; V Ravi; Priti Kumar; Ruth Montgomery
Journal: Nat Methods Date: 2019-10-07 Impact factor: 28.547

7. Prediction of plant promoters based on hexamers and random triplet pair analysis.

Authors: A K M Azad; Saima Shahid; Nasimul Noman; Hyunju Lee
Journal: Algorithms Mol Biol Date: 2011-06-28 Impact factor: 1.405

8. Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets.

Authors: Ricard Argelaguet; Britta Velten; Damien Arnol; Sascha Dietrich; Thorsten Zenz; John C Marioni; Florian Buettner; Wolfgang Huber; Oliver Stegle
Journal: Mol Syst Biol Date: 2018-06-20 Impact factor: 11.429

9. Accurate prediction of protein structures and interactions using a three-track neural network.

Authors: Minkyung Baek; Frank DiMaio; Ivan Anishchenko; Justas Dauparas; Sergey Ovchinnikov; Gyu Rie Lee; Jue Wang; Qian Cong; Lisa N Kinch; R Dustin Schaeffer; Claudia Millán; Hahnbeom Park; Carson Adams; Caleb R Glassman; Andy DeGiovanni; Jose H Pereira; Andria V Rodrigues; Alberdina A van Dijk; Ana C Ebrecht; Diederik J Opperman; Theo Sagmeister; Christoph Buhlheller; Tea Pavkov-Keller; Manoj K Rathinaswamy; Udit Dalwadi; Calvin K Yip; John E Burke; K Christopher Garcia; Nick V Grishin; Paul D Adams; Randy J Read; David Baker
Journal: Science Date: 2021-07-15 Impact factor: 47.728

3. Mathematical modeling and optimizing the in vitro shoot proliferation of wallflower using multilayer perceptron non-dominated sorting genetic algorithm-II (MLP-NSGAII).

Authors: Fazilat Fakhrzad; Abolfazl Jowkar; Javad Hosseinzadeh
Journal: PLoS One Date: 2022-09-09 Impact factor: 3.752

3 in total