| Literature DB >> 33525807 |
William Jones1, Kaur Alasoo1, Dmytro Fishman2,3, Leopold Parts1,2.
Abstract
Deep learning is the trendiest tool in a computational biologist's toolbox. This exciting class of methods, based on artificial neural networks, quickly became popular due to its competitive performance in prediction problems. In pioneering early work, applying simple network architectures to abundant data already provided gains over traditional counterparts in functional genomics, image analysis, and medical diagnostics. Now, ideas for constructing and training networks and even off-the-shelf models have been adapted from the rapidly developing machine learning subfield to improve performance in a range of computational biology tasks. Here, we review some of these advances in the last 2 years.Entities:
Keywords: bioinformatics; computational biology; deep learning
Year: 2017 PMID: 33525807 PMCID: PMC7289034 DOI: 10.1042/ETLS20160025
Source DB: PubMed Journal: Emerg Top Life Sci ISSN: 2397-8554
Short overview of computational biology deep learning papers published until the first quarter of 2017
| Name | Title | Architecture | Input | Output | Highlight | Category |
|---|---|---|---|---|---|---|
| DeepBind | Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [ | CNN | DNA sequence | TF binding | Arbitrary length sequences | DNA binding |
| DeeperBind | DeeperBind: enhancing prediction of sequence specificities of DNA binding proteins [ | CNN-RNN | DNA sequence | TF binding | Sequences of arbitrary length. Adds LSTM to DeepBind model. | DNA binding |
| DeepSEA | Predicting effects of noncoding variants with deep learning-based sequence model [ | CNN | DNA sequence | TF binding | 3-layer CNN | DNA binding |
| DanQ | DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences [ | CNN-RNN | DNA sequence | TF binding | Adds LSTM layer to DeepSEA model | DNA binding |
| TFImpute | Imputation for transcription factor binding predictions based on deep learning [ | CNN | DNA sequence; ChIP-seq | TF binding | Impute TF binding in unmeasured cell types | DNA binding |
| Basset | Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks [ | CNN | DNA sequence | Chromatin accessibility | Uses DNAse-seq data from 164 cell types | DNA binding |
| OrbWeaver | Impact of regulatory variation across human iPSCs and differentiated cells [ | CNN | DNA sequence | Chromatin accessibility | Uses known TF motifs as fixed filters in the CNN | DNA binding |
| CODA | Denoising genome-wide histone ChIP-seq with convolutional neural networks [ | CNN | ChIP-seq | ChIP-seq | Denoise ChiP-seq data | DNA binding |
| DeepEnhancer | DeepEnhancer: predicting enhancers by convolutional neural networks [ | CNN | DNA sequence | Enhancer prediction | Convert convolutional filters to PWMs, compare to motif databases | DNA binding |
| TIDE | TIDE: predicting translation initiation sites by deep learning [ | CNN-RNN | RNA sequence | Translation initiation sites (QTI-seq) | DanQ model | RNA binding |
| ROSE | ROSE: a deep learning based framework for predicting ribosome stalling [ | CNN | RNA sequence | Ribosome stalling (ribosome profiling) | Parallel convolutions | RNA binding |
| iDeep | RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach [ | CNN-DBN | RNA sequence; | RNA binding proteins (CLiP-seq) | Integrate multiple diverse data sources | RNA binding |
| Known motifs | ||||||
| Secondary structure | ||||||
| co-binding | ||||||
| transcript region | ||||||
| Deepnet-rbp | A deep learning framework for modeling structural features of RNA-binding protein targets [ | DBN | RNA sequence | RNA binding proteins (CLiP-seq) | Uses k-mer counts instead of a CNN to capture RNA sequence features | RNA binding |
| secondary structure | ||||||
| tertiary structure | ||||||
| SPEID | Predicting enhancer-promoter interaction from genomic sequence with deep neural networks [ | CNN-RNN | DNA sequence | Promoter-enhancer interactions | Inspired by DanQ | 3D interactions |
| Rambutan | Nucleotide sequence and DNaseI sensitivity are predictive of 3D chromatin architecture [ | CNN | DNA sequence | Hi-C interactions | Binarised input signal | 3D interactions |
| DNAse-seq | ||||||
| Genomic distance | ||||||
| DeepChrome | A deep learning framework for modeling structural features of RNA-binding protein targets [ | CNN | Histone modification (ChIP-seq) | Gene expression | Binary decision: expressed or not expressed | Transcription |
| FIDDLE | FIDDLE: An integrative deep learning framework for functional genomic data inference [ | CNN | DNA sequence | Transcription start sites (TSS-seq) | DNA sequences alone not sufficient for prediction, other data helps | Transcription |
| RNA-seq | ||||||
| NET-seq | ||||||
| MNAse-seq | ||||||
| ChIP-seq | ||||||
| CNNProm | Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks [ | CNN | DNA sequence | Promoter predictions | Predicts promoters from DNA sequnce features | Transcription |
| DeepCpG | DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning [ | CNN-GRU | DNA sequence | DNA methylation state (binary) | Predict DNA methylation state in single cells based on sequence content (CNN) and noisy measurement (GRU) | DNA methylation |
| scRRBS-seq | ||||||
| CpGenie | Predicting the impact of non-coding variants on DNA methylation [ | CNN | DNA sequence | DNA methylation state (binary) | Predict genetic variants that regulate DNA methyaltion | DNA methylation |
| DNN-HMM | De novo identification of replication-timing domains in the human genome by deep learning [ | Hidden markov model (HMM) combinded with deep belief network (DBN) | Replicated DNA sequencing (Repli-seq) | Replication timing | Predict replication timing domains from Repli-seq data | Other |
| DeepCons | Understanding sequence conservation with deep learning [ | CNN | DNA sequence | Sequence conservation | Works on noncoding sequences only | Other |
| GMFR-CNN | GMFR-CNN: an integration of gapped motif feature representation and deep learning approach for enhancer prediction [ | CNN | DNA sequence | TF binding | Uses data from the DeepBind paper. Integrates gapped DNA motifs (as introduced by gkm-SVM) with a convolutional neural network | DNA binding |
| DeepVariant | Creating a universal SNP and small indel variant caller with deep neural networks [ | CNN | Image | Assignment of low confidence variant call (Illumina sequencing) | Turns sequence, base quality, and strand information into image | Basecalling |
| Goby | Compression of structured high-throughput sequencing data [ | Dense | Features | Base call (Illumina sequencing) | Part of wider variant calling framework | Basecalling |
| DeepNano | DeepNano: Deep Recurrent Neural Networks for Base Calling in MinION Nanopore Reads [ | RNN | Current | Base call (nanopore sequencing) | Uses raw nanopore sequencing signal | Basecalling |
| - | Deep learning for population genetic inference [ | Dense | Features | Effective population size; selection coefficient | Estimate multiple population genetic parameters in one model | Population genetics |
| Leveraging uncertainty information from deep neural networks for disease detection [ | BCNN | Image (retina) | Disease probability | For each image estimates an uncertainty of the network, if this uncertainty is too high, discards image | Medical diagnostics | |
| DRIU | Deep retinal image understanding [ | CNN | Image (retina) | Segmentation | Super-human performance, task customised layers | Retinal segmentation |
| IDx-DR X2.1 | Improved automated detection of diabetic retinopathy on a publicly available dataset through integration of deep learning [ | CNN | Image (retina) | DR stages | Added DL component into the algorithm and reported its superior performance | DR detection |
| Deep learning is effective for classifying normal versus age-related macular degeneration OCT images [ | CNN (VGG16) | Image (OCT) | Normal versus Age-related macular degeneration | Visualised salience maps to confirm that areas of high interest for the network match pathology areas | Age-related macular degeneration classification | |
| Medical image synthesis with context-aware generative adversarial networks [ | GAN | Image (MR patch) | CT patch | Predicts CT image from 3D MRI, could also be used for super-resolution, image denoising etc | Medical image synthesis | |
| DeepAD | DeepAD: Alzheimer's disease classification via deep convolutional neural networks using MRI and fMRI [ | CNN | Image (fMRI and MRI) | AD vs NC | 99.9% accuracy for LeNet architecture, fishy | Alzheimer's disease classification |
| Brain tumor segmentation with deep neural networks [ | CNN | Image (MRI) | Segmentation of the brain | Stacked CNNs, fast implementation | Glioblastoma | |
| Brain tumor segmentation using convolutional neural networks in MRI images [ | CNN | Image (MRI) | Segmentation of the brain | |||
| A deep learning-based segmentation method for brain tumor in MR images [ | SDAE + DNN | Image (MRI) | Segmentation of the brain | |||
| Classification of schizophrenia versus normal subjects using deep learning [ | SAE + SVM | Image (3D fMRI volume) | Disease probability | Works on directly on active voxel time series without conversion | Schizophrenia classification | |
| Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker [ | 3D CNN | Image (minimally preprocessed raw T1-weighted MRI data) | Age | Almost no preprocessing, brain age was shown to be heritable | Age prediction | |
| Mass detection in digital breast tomosynthesis: deep convolutional neural network with transfer learning from mammography [ | CNN | Image (mammography + DBT) | Disease probability | Network was first trained on mammography images, then first three conv. layers were fixed while other layers were initialised and trained again on DBT (Transfer Learning) | Medical diagnostics + Transfer Learning | |
| Large scale deep learning for computer aided detection of mammographic lesions [ | CNN + RF | Image (mammography patch) | Disease probability | Combines handcrafted features with learned by CNN to train RF | Mammography lesions classification | |
| DeepMammo | Breast mass classification from mammograms using deep convolutional neural networks [ | CNN | Image (mammography patch) | Disease probability | Transfer learning from pre-trained CNNs | Mammography lesions classification |
| Unsupervised deep learning applied to breast density segmentation and mammographic risk scoring [ | CSAE | Image (mammogram) | Segmentation and classification of lesions | Developed a novel regularisor | Mammography segmentation and classification | |
| A deep learning approach for the analysis of masses in mammograms with minimal user intervention [ | CNN + DBN | Image (mammogram) | Benign vs malignant class | End to end approach with minimal user intervention, some small tech innovation at each stage | Mammography segmentation and classification | |
| Detecting cardiovascular disease from mammograms with deep learning [ | CNN | Image (mammogram patch) | BAC vs normal | Using mammograms for cardiovascular disease diagnosis | Breast arterial calcifications detection | |
| Lung pattern classification for interstitial lung disease using a deep convolutional neural network [ | CNN | Image (CT patch) | 7 ILD classes | Maybe the first attempt to characterize lung tissue with deep CNN tailored for the problem | Medical diagnostics | |
| Multi-source transfer learning with convolutional neural networks for lung pattern analysis [ | CNN | Image (CT patch) | 7 ILD classes | Transfer learning + ensemble | ||
| Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning [ | CNN | Image (CT) | ILD classes and Lung Node detection | Transfer learning, many architectures, IDL and LN detection | ||
| Computer-aided diagnosis with deep learning architecture: applications to breast lesions in us images and pulmonary nodules in CT scans [ | SDAE | Image (US and CT ROI) | Benign vs malignant class | Used the same SDAE for both breast lesions in US images and pulmonary nodules in CT scans, concatenated handcrafted features to original ROI pixels | CAD | |
| Dermatologist-level classification of skin cancer with deep neural networks [ | CNN | Image (Skin) | Disease classes | Could be potentially used on a server side to power self-diagnosis of skin cancer | Medical diagnostics | |
| Early-stage atherosclerosis detection using deep learning over carotid ultrasound images [ | AE | Image (US) | Segmentation and classification of arterial layers | Fully automatic US segmentation | Intima-media thickness measurement | |
| Fusing deep learned and hand-crafted features of appearance, shape, and dynamics for automatic pain estimation [ | CNN + LR | Image (Face) | Pain intensity | Combines handcrafted features with learned by CNN to train Linear regressor | Pain intensity estimation | |
| Recurrent convolutional neural network regression for continuous pain intensity estimation in video [ | RCNN | Video frames | Pain intensity | Pain intensity estimation | ||
| Efficient diagnosis system for Parkinson's disease using deep belief network [ | DBN | Sound (Speech) | Parkinson vs normal | Parkinson diagnosis | ||
| Application of semi-supervised deep learning to lung sound analysis [ | DA + 2 SVM | Sound (Lung sounds) | Sound scores | Handling small data sets with DA + potential application | Pulmonary disease diagnosis | |
| Application of deep learning for recognizing infant cries [ | CNN | Sound (Infant cry) | Class scores | Sound classification | ||
| Deep learning framework for detection of hypoglycemic episodes in children with type 1 diabetes [ | DBN | ECG | Hypoglycemic episode onset | Real-time episodes detection | Hypoglycemic episodes detection | |
| Deep learning approach for active classification of electrocardiogram signals [ | SDAE | ECG | AAMI classes | Uses raw ECG | Classification of electrocardiogram signals | |
| AgingAI | Deep biomarkers of human aging: application of deep neural networks to biomarker development [ | 21 DNN | Blood test measurements | Age | Online tool which could be used to collect training data, 5 biomarkers for aging | Age prediction |
| DeepCell | Deep learning automates the quantitative analysis of individual cells in live-cell imaging experiments [ | CNN | Microscopy images | Cell segmentations | Able to segment both mammalian and bacterial cells | Segmentation |
| U-Net | U-Net: convolutional networks for biomedical image segmentation [ | CNN | Biomedical images | Segmentations | Won the ISBI 2015 EM segmentation challenge | Segmentation |
| 3D U-Net | 3D U-Net: learning dense volumetric segmentation from sparse annotation [ | CNN | Volumetic images | 3D Segmentations | Able to quickly volumetric images | Segmentation |
| V-Net | V-Net: Fully convolutional neural networks for volumetric medical image segmentation [ | CNN | Volumetic images | 3D Segmentations | Performs 3D convolutions | Segmentation |
| DeepYeast | Accurate classification of protein subcellular localization from high throughput microscopy images using deep learning [ | CNN | Microscopy images | Yeast protein localisation classification | Automatic Phenotyping | |
| Deep machine learning provides state-of-the-art performance in image-based plant phenotyping [ | CNN | Plant images | Plant section phenotyping | Automatic Phenotyping | ||
| Classifying and segmenting microscopy images with deep multiple instance learning [ | CNN | Microscopy images | Yeast protein localisation classification | Performs multi-instance localisation | Automatic Phenotyping | |
| DeadNet | DeadNet: identifying phototoxicity from label-free microscopy images of cells using Deep ConvNets [ | CNN | Microscopy images | Phototoxicity identification | Automatic Phenotyping | |
| Deep learning for imaging flow cytometry: cell cycle analysis of Jurkat cells [ | CNN | Single cell microscopy images | Cell-cycle prediction | Automatic Phenotyping | ||
| Prospective identification of hematopoietic lineage choice by deep learning [ | CNN | Brightfield time course imaging | Hematopoitic lineage choice | Lineage choice can be detected up to three generations before conventional molecular markers are observable | Automatic Phenotyping | |
| Automating morphological profiling with generic deep convolutional networks [ | CNN | Microscopy images | Feature extraction | Automatic Phenotyping | ||
While we have tried to be comprehensive, some papers may have been missed due to the rapid development of the field. Acronyms used: AE, autoencoder; BCNN, bayesian convolutional neural network; CNN, convolutional neural network; CSAE, convolutional sparse autoencoder; DA, denoising autoencoder; DBN, deep belief network; GAN, generative adversarial network; GRU, gated recurrent unit; LR, linear regression; RCNN, recurrent convolutional neural network; RF, random forest; RNN, recurrent neural network; SAE, stacked autoencoder; SDAE, stacked denoising auto-encoder; SVM, support vector machines.