Literature DB >> 29956014

A Survey of Data Mining and Deep Learning in Bioinformatics.

Kun Lan1, Dan-Tong Wang2, Simon Fong3, Lian-Sheng Liu4, Kelvin K L Wong5, Nilanjan Dey6.   

Abstract

The fields of medicine science and health informatics have made great progress recently and have led to in-depth analytics that is demanded by generation, collection and accumulation of massive data. Meanwhile, we are entering a new period where novel technologies are starting to analyze and explore knowledge from tremendous amount of data, bringing limitless potential for information growth. One fact that cannot be ignored is that the techniques of machine learning and deep learning applications play a more significant role in the success of bioinformatics exploration from biological data point of view, and a linkage is emphasized and established to bridge these two data analytics techniques and bioinformatics in both industry and academia. This survey concentrates on the review of recent researches using data mining and deep learning approaches for analyzing the specific domain knowledge of bioinformatics. The authors give a brief but pithy summarization of numerous data mining algorithms used for preprocessing, classification and clustering as well as various optimized neural network architectures in deep learning methods, and their advantages and disadvantages in the practical applications are also discussed and compared in terms of their industrial usage. It is believed that in this review paper, valuable insights are provided for those who are dedicated to start using data analytics methods in bioinformatics.

Keywords:  Bioinformatics; Biomedicine; Data mining; Deep learning; Machine learning

Mesh:

Year:  2018        PMID: 29956014     DOI: 10.1007/s10916-018-1003-9

Source DB:  PubMed          Journal:  J Med Syst        ISSN: 0148-5598            Impact factor:   4.460


  46 in total

1.  Minimum information about a microarray experiment (MIAME)-toward standards for microarray data.

Authors:  A Brazma; P Hingamp; J Quackenbush; G Sherlock; P Spellman; C Stoeckert; J Aach; W Ansorge; C A Ball; H C Causton; T Gaasterland; P Glenisson; F C Holstege; I F Kim; V Markowitz; J C Matese; H Parkinson; A Robinson; U Sarkans; S Schulze-Kremer; J Stewart; R Taylor; J Vilo; M Vingron
Journal:  Nat Genet       Date:  2001-12       Impact factor: 38.330

2.  Penalized and weighted K-means for clustering with scattered objects and prior information in high-throughput biological data.

Authors:  George C Tseng
Journal:  Bioinformatics       Date:  2007-06-27       Impact factor: 6.937

3.  Learning long-term dependencies with gradient descent is difficult.

Authors:  Y Bengio; P Simard; P Frasconi
Journal:  IEEE Trans Neural Netw       Date:  1994

4.  Model-based clustering for RNA-seq data.

Authors:  Yaqing Si; Peng Liu; Pinghua Li; Thomas P Brutnell
Journal:  Bioinformatics       Date:  2013-11-04       Impact factor: 6.937

5.  Design ensemble machine learning model for breast cancer diagnosis.

Authors:  Sheau-Ling Hsieh; Sung-Huai Hsieh; Po-Hsun Cheng; Chi-Huang Chen; Kai-Ping Hsu; I-Shun Lee; Zhenyu Wang; Feipei Lai
Journal:  J Med Syst       Date:  2011-08-03       Impact factor: 4.460

6.  Robust Density-Based Clustering To Identify Metastable Conformational States of Proteins.

Authors:  Florian Sittel; Gerhard Stock
Journal:  J Chem Theory Comput       Date:  2016-04-21       Impact factor: 6.006

Review 7.  Deep learning in bioinformatics.

Authors:  Seonwoo Min; Byunghan Lee; Sungroh Yoon
Journal:  Brief Bioinform       Date:  2017-09-01       Impact factor: 11.622

8.  Automated Chest X-Ray Screening: Can Lung Region Symmetry Help Detect Pulmonary Abnormalities?

Authors:  K C Santosh; Sameer Antani
Journal:  IEEE Trans Med Imaging       Date:  2018-05       Impact factor: 10.048

9.  Edge map analysis in chest X-rays for automatic pulmonary abnormality screening.

Authors:  K C Santosh; Szilárd Vajda; Sameer Antani; George R Thoma
Journal:  Int J Comput Assist Radiol Surg       Date:  2016-03-19       Impact factor: 2.924

10.  A new approach for interpreting Random Forest models and its application to the biology of ageing.

Authors:  Fabio Fabris; Aoife Doherty; Daniel Palmer; João Pedro de Magalhães; Alex A Freitas
Journal:  Bioinformatics       Date:  2018-07-15       Impact factor: 6.937

View more
  26 in total

1.  PreDTIs: prediction of drug-target interactions based on multiple feature information using gradient boosting framework with data balancing and feature selection techniques.

Authors:  S M Hasan Mahmud; Wenyu Chen; Yongsheng Liu; Md Abdul Awal; Kawsar Ahmed; Md Habibur Rahman; Mohammad Ali Moni
Journal:  Brief Bioinform       Date:  2021-03-12       Impact factor: 11.622

2.  Cross-registry neural domain adaptation to extract mutational test results from pathology reports.

Authors:  Anthony Rios; Eric B Durbin; Isaac Hands; Susanne M Arnold; Darshil Shah; Stephen M Schwartz; Bernardo H L Goulart; Ramakanth Kavuluru
Journal:  J Biomed Inform       Date:  2019-08-08       Impact factor: 6.317

3.  Data Mining, Quality and Management in the Life Sciences.

Authors:  Amonida Zadissa; Rolf Apweiler
Journal:  Methods Mol Biol       Date:  2022

4.  Deep learning-a first meta-survey of selected reviews across scientific disciplines, their commonalities, challenges and research impact.

Authors:  Jan Egger; Antonio Pepe; Christina Gsaxner; Yuan Jin; Jianning Li; Roman Kern
Journal:  PeerJ Comput Sci       Date:  2021-11-17

5.  GeMI: interactive interface for transformer-based Genomic Metadata Integration.

Authors:  Giuseppe Serna Garcia; Michele Leone; Anna Bernasconi; Mark J Carman
Journal:  Database (Oxford)       Date:  2022-06-03       Impact factor: 4.462

Review 6.  Potential and impact of artificial intelligence algorithms in dento-maxillofacial radiology.

Authors:  Kuo Feng Hung; Qi Yong H Ai; Yiu Yan Leung; Andy Wai Kan Yeung
Journal:  Clin Oral Investig       Date:  2022-04-19       Impact factor: 3.606

7.  Graph-based description of tertiary lymphoid organs at single-cell level.

Authors:  Nadine S Schaadt; Ralf Schönmeyer; Germain Forestier; Nicolas Brieu; Peter Braubach; Katharina Nekolla; Michael Meyer-Hermann; Friedrich Feuerhake
Journal:  PLoS Comput Biol       Date:  2020-02-21       Impact factor: 4.475

8.  Prevalence and Diagnosis of Neurological Disorders Using Different Deep Learning Techniques: A Meta-Analysis.

Authors:  Ritu Gautam; Manik Sharma
Journal:  J Med Syst       Date:  2020-01-04       Impact factor: 4.460

9.  Mining Prognosis Index of Brain Metastases Using Artificial Intelligence.

Authors:  Shigao Huang; Jie Yang; Simon Fong; Qi Zhao
Journal:  Cancers (Basel)       Date:  2019-08-09       Impact factor: 6.639

10.  Diagnosis of COVID-19 and non-COVID-19 patients by classifying only a single cough sound.

Authors:  Mesut Melek
Journal:  Neural Comput Appl       Date:  2021-07-30       Impact factor: 5.102

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.