Literature DB >> 35743052

Editorial of Special Issue "Deep Learning and Machine Learning in Bioinformatics".

Mingon Kang1, Jung Hun Oh2.   

Abstract

In recent years, deep learning has emerged as a highly active research field, achieving great success in various machine learning areas, including image processing, speech recognition, and natural language processing, and now rapidly becoming a dominant tool in biomedicine [...].

Entities:  

Mesh:

Year:  2022        PMID: 35743052      PMCID: PMC9224509          DOI: 10.3390/ijms23126610

Source DB:  PubMed          Journal:  Int J Mol Sci        ISSN: 1422-0067            Impact factor:   6.208


In recent years, deep learning has emerged as a highly active research field, achieving great success in various machine learning areas, including image processing, speech recognition, and natural language processing, and now rapidly becoming a dominant tool in biomedicine [1]. In particular, a dramatically increasing number of deep learning-based approaches have been proposed in biomedical image analysis and biosignal processing, as well as medical prediction modeling. However, the application of deep learning to genomics and bioinformatics has been rather limited, perhaps due to the combined difficulties of interpretation as well as steep data requirements. One of the major challenges is that many approaches in deep learning and traditional machine learning are based on the assumption that the number of samples is huge in order to train models with a vast number of features. The situation in medicine is often reversed by necessity: the number of features desired to be analyzed is often one or two orders of magnitude greater than the number of samples. Researchers must contend with this fundamental issue, and in the end must be content with models that are consistent with the data. In this Special Issue entitled “Deep Learning and Machine Learning in Bioinformatics”, submissions address the application of deep learning and novel machine learning methods to diverse bioinformatic problems and provide practical guidance. These methods include useful approaches that may improve predictive performance and separately enhance our understanding of biological mechanisms of target diseases. Among the 55 submissions reviewed, 21 were accepted, including 17 research articles and 4 reviews, with 124 contributors. The contributions were global, for the accepted papers originating from 12 countries, including Australia (2), China, France, Italy (3), Japan (2), Poland, South Korea (2), Spain, Sweden, Taiwan, Thailand, and the United States (5). Figure 1 shows the map of countries with the symbol ★ for the first or corresponding authors of the accepted papers.
Figure 1

A map of countries with the symbol ★ for the first or corresponding authors of the accepted papers.

Ten research papers demonstrated the application of deep learning to various kinds of biological data. Le et al. proposed an ensemble neural network to identify essential genes via word embedding features from genomic sequences [2]. Persson Hodén et al. developed a convolutional neural network (CNN) model capable of efficiently identifying true mRNA cleavage sites, which was implemented as an R package called smartPARE [3]. Nosi et al. proposed a neural network method to detect MET exon 14 skipping events using RNAseq data from The Cancer Genome Atlas (TCGA) archive for lung cancer [4]. Alessandri et al. developed a new autoencoder model, called Sparsely Connected Autoencoders, to improve the traditional decoder model for better identifying biological features from single cell data [5]. Al Mamun et al. developed a multi-run concrete autoencoder to identify a stable set of features which was applied to TCGA genome-wide lncRNA expression profiles in 12 cancers, resulting in the identification of key lncRNAs [6]. Lee et al. introduced a peptide data augmentation method, which was employed to predict spider neurotoxic peptides, showing improved predictive power when coupled with a CNN model [7]. Madani et al. developed a novel deep learning sequence-based solubility predictor, called DSResSol, for fast, reliable, and inexpensive prediction of protein solubility [8]. Zulfiqar et al. developed a 1D CNN-based model, named Deep-4mCGP, to identify 4mC sites in Geobacter pickeringii [9]. Roethel et al. developed a deep learning architecture for a holistic sequential and structural analysis of biomolecules [10]. Hazra et al. employed generative adversarial networks (GAN) to create synthetic nucleic acid sequences of the cat genome [11]. Seven research papers used traditional (non-deep learning) machine learning approaches to analyze biological data. Two computational methods were introduced, PUP-Fuse [12] and PredNTS [13], for the prediction of pupylation sites and nitrotyrosine sites, respectively, by integrating multiple sequence representations coupled with a random forest approach. Rodin et al. proposed a novel computational pipeline to dissect the response to cancer immunotherapy, employing systems biology and Bayesian network techniques on flow cytometry data [14]. Campos et al. employed machine learning approaches to identify essential genes common to both Caenorhabditis elegans and Drosophila melanogaster [15]. Charoenkwan et al. developed a sequence-based predictor, named iBitter-Fuse, to identify bitter peptides by fusing multi-view features [16]. Jabeen et al. adopted a random forest model to identify novel high activity agonists of human ectopic olfactory receptors [17]. Pouryahya et al. proposed a network-based clustering method coupled with optimal mass transport theory to predict cell line-drug sensitivity, and showed that random forest modeling conducted on the resulting cell line-drug clusters outperformed alternative computational methods in predicting in vitro drug responses [18]. Four papers reviewed the use of deep learning or machine learning approaches to biological data analysis. Auslander et al. reviewed machine learning/deep learning approaches incorporated to establish bioinformatics and computational biology frameworks in the areas of molecular evolution, protein structure analysis, systems biology, and disease genomics [19]. Del Giudice et al. comprehensively reviewed machine learning/deep learning solutions for computational problems in bulk and single-cell RNA-sequencing data analysis [20]. Banegas-Luna et al. discussed the interpretability of machine learning/deep learning methods in cancer research [21]. Defresne et al. reviewed deep learning methods used for protein design [22]. In summary, the articles in this Special Issue provide a range of reviews and updates to the use of deep learning and machine learning in bioinformatics.
  22 in total

Review 1.  Deep learning in bioinformatics.

Authors:  Seonwoo Min; Byunghan Lee; Sungroh Yoon
Journal:  Brief Bioinform       Date:  2017-09-01       Impact factor: 11.622

Review 2.  Artificial Intelligence in Bulk and Single-Cell RNA-Sequencing Data to Foster Precision Oncology.

Authors:  Marco Del Giudice; Serena Peirone; Sarah Perrone; Francesca Priante; Fabiola Varese; Elisa Tirtei; Franca Fagioli; Matteo Cereda
Journal:  Int J Mol Sci       Date:  2021-04-27       Impact factor: 5.923

3.  A Computational Framework Based on Ensemble Deep Neural Networks for Essential Genes Identification.

Authors:  Nguyen Quoc Khanh Le; Duyen Thi Do; Truong Nguyen Khanh Hung; Luu Ho Thanh Lam; Tuan-Tu Huynh; Ngan Thi Kim Nguyen
Journal:  Int J Mol Sci       Date:  2020-11-28       Impact factor: 5.923

4.  MET Exon 14 Skipping: A Case Study for the Detection of Genetic Variants in Cancer Driver Genes by Deep Learning.

Authors:  Vladimir Nosi; Alessandrì Luca; Melissa Milan; Maddalena Arigoni; Silvia Benvenuti; Davide Cacchiarelli; Marcella Cesana; Sara Riccardo; Lucio Di Filippo; Francesca Cordero; Marco Beccuti; Paolo M Comoglio; Raffaele A Calogero
Journal:  Int J Mol Sci       Date:  2021-04-19       Impact factor: 5.923

5.  Dissecting Response to Cancer Immunotherapy by Applying Bayesian Network Analysis to Flow Cytometry Data.

Authors:  Andrei S Rodin; Grigoriy Gogoshin; Seth Hilliard; Lei Wang; Colt Egelston; Russell C Rockne; Joseph Chao; Peter P Lee
Journal:  Int J Mol Sci       Date:  2021-02-26       Impact factor: 5.923

6.  Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in Geobacter pickeringii by Using Correlation-Based Feature Selection Technique.

Authors:  Hasan Zulfiqar; Qin-Lai Huang; Hao Lv; Zi-Jie Sun; Fu-Ying Dao; Hao Lin
Journal:  Int J Mol Sci       Date:  2022-01-23       Impact factor: 5.923

7.  Pan-Cancer Prediction of Cell-Line Drug Sensitivity Using Network-Based Methods.

Authors:  Maryam Pouryahya; Jung Hun Oh; James C Mathews; Zehor Belkhatir; Caroline Moosmüller; Joseph O Deasy; Allen R Tannenbaum
Journal:  Int J Mol Sci       Date:  2022-01-19       Impact factor: 6.208

8.  Generative Adversarial Networks for Creating Synthetic Nucleic Acid Sequences of Cat Genome.

Authors:  Debapriya Hazra; Mi-Ryung Kim; Yung-Cheol Byun
Journal:  Int J Mol Sci       Date:  2022-03-28       Impact factor: 5.923

Review 9.  Incorporating Machine Learning into Established Bioinformatics Frameworks.

Authors:  Noam Auslander; Ayal B Gussow; Eugene V Koonin
Journal:  Int J Mol Sci       Date:  2021-03-12       Impact factor: 5.923

View more
  1 in total

1.  Federated Learning-Based Detection of Invasive Carcinoma of No Special Type with Histopathological Images.

Authors:  Bless Lord Y Agbley; Jianping Li; Md Altab Hossin; Grace Ugochi Nneji; Jehoiada Jackson; Happy Nkanta Monday; Edidiong Christopher James
Journal:  Diagnostics (Basel)       Date:  2022-07-09
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.