Literature DB >> 23559637

Pattern recognition in bioinformatics.

Dick de Ridder1, Jeroen de Ridder, Marcel J T Reinders.   

Abstract

Pattern recognition is concerned with the development of systems that learn to solve a given problem using a set of example instances, each represented by a number of features. These problems include clustering, the grouping of similar instances; classification, the task of assigning a discrete label to a given instance; and dimensionality reduction, combining or selecting features to arrive at a more useful representation. The use of statistical pattern recognition algorithms in bioinformatics is pervasive. Classification and clustering are often applied to high-throughput measurement data arising from microarray, mass spectrometry and next-generation sequencing experiments for selecting markers, predicting phenotype and grouping objects or genes. Less explicitly, classification is at the core of a wide range of tools such as predictors of genes, protein function, functional or genetic interactions, etc., and used extensively in systems biology. A course on pattern recognition (or machine learning) should therefore be at the core of any bioinformatics education program. In this review, we discuss the main elements of a pattern recognition course, based on material developed for courses taught at the BSc, MSc and PhD levels to an audience of bioinformaticians, computer scientists and life scientists. We pay attention to common problems and pitfalls encountered in applications and in interpretation of the results obtained.

Entities:  

Keywords:  bioinformatics; classification; clustering; dimensionality reduction; pattern recognition

Mesh:

Year:  2013        PMID: 23559637     DOI: 10.1093/bib/bbt020

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  11 in total

1.  Analyse multiple disease subtypes and build associated gene networks using genome-wide expression profiles.

Authors:  Sara Aibar; Celia Fontanillo; Conrad Droste; Beatriz Roson-Burgo; Francisco J Campos-Laborie; Jesus M Hernandez-Rivas; Javier De Las Rivas
Journal:  BMC Genomics       Date:  2015-05-26       Impact factor: 3.969

2.  Dynamics reconstruction and classification via Koopman features.

Authors:  Wei Zhang; Yao-Chsi Yu; Jr-Shin Li
Journal:  Data Min Knowl Discov       Date:  2019-06-24       Impact factor: 3.670

3.  A multilevel approach for screening natural compounds as an antiviral agent for COVID-19.

Authors:  Mahdi Vasighi; Julia Romanova; Miroslava Nedyalkova
Journal:  Comput Biol Chem       Date:  2022-05-11       Impact factor: 3.737

Review 4.  Machine learning-enabled multiplexed microfluidic sensors.

Authors:  Sajjad Rahmani Dabbagh; Fazle Rabbi; Zafer Doğan; Ali Kemal Yetisen; Savas Tasoglu
Journal:  Biomicrofluidics       Date:  2020-12-11       Impact factor: 2.800

5.  Interpolation based consensus clustering for gene expression time series.

Authors:  Tai-Yu Chiu; Ting-Chieh Hsu; Chia-Cheng Yen; Jia-Shung Wang
Journal:  BMC Bioinformatics       Date:  2015-04-16       Impact factor: 3.169

6.  SPiCE: a web-based tool for sequence-based protein classification and exploration.

Authors:  Bastiaan A van den Berg; Marcel J T Reinders; Johannes A Roubos; Dick de Ridder
Journal:  BMC Bioinformatics       Date:  2014-03-31       Impact factor: 3.169

7.  Prediction model development of late-onset preeclampsia using machine learning-based methods.

Authors:  Jong Hyun Jhee; SungHee Lee; Yejin Park; Sang Eun Lee; Young Ah Kim; Shin-Wook Kang; Ja-Young Kwon; Jung Tak Park
Journal:  PLoS One       Date:  2019-08-23       Impact factor: 3.240

8.  A review of machine learning methods to predict the solubility of overexpressed recombinant proteins in Escherichia coli.

Authors:  Narjeskhatoon Habibi; Siti Z Mohd Hashim; Alireza Norouzi; Mohammed Razip Samian
Journal:  BMC Bioinformatics       Date:  2014-05-08       Impact factor: 3.169

9.  Visualisation of the T cell differentiation programme by Canonical Correspondence Analysis of transcriptomes.

Authors:  Masahiro Ono; Reiko J Tanaka; Manabu Kano
Journal:  BMC Genomics       Date:  2014-11-27       Impact factor: 3.969

10.  Identifying latent dynamic components in biological systems.

Authors:  Ivan Kondofersky; Christiane Fuchs; Fabian J Theis
Journal:  IET Syst Biol       Date:  2015-10       Impact factor: 1.615

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.