| Literature DB >> 35997343 |
Ricardo J Pais1,2.
Abstract
Clinical bioinformatics is a newly emerging field that applies bioinformatics techniques for facilitating the identification of diseases, discovery of biomarkers, and therapy decision. Mathematical modelling is part of bioinformatics analysis pipelines and a fundamental step to extract clinical insights from genomes, transcriptomes and proteomes of patients. Often, the chosen modelling techniques relies on either statistical, machine learning or deterministic approaches. Research that combines bioinformatics with modelling techniques have been generating innovative biomedical technology, algorithms and models with biotech applications, attracting private investment to develop new business; however, startups that emerge from these technologies have been facing difficulties to implement clinical bioinformatics pipelines, protect their technology and generate profit. In this commentary, we discuss the main concepts that startups should know for enabling a successful application of predictive modelling in clinical bioinformatics. Here we will focus on key modelling concepts, provide some successful examples and briefly discuss the modelling framework choice. We also highlight some aspects to be taken into account for a successful implementation of cost-effective bioinformatics from a business perspective.Entities:
Keywords: clinical applications; clinical bioinformatics; diagnostics; mathematical models; predictive modelling; prognostics
Year: 2022 PMID: 35997343 PMCID: PMC9397027 DOI: 10.3390/biotech11030035
Source DB: PubMed Journal: BioTech (Basel) ISSN: 2673-6284
Often used modelling techniques in clinical bioinformatics and their main characteristics.
| Modelling | Description | Application | Requirements |
|---|---|---|---|
| Statistical | Scoring and probability functions that assumes a distribution shape or behaviour. | Continuous | Data for parameter estimation. Depend on sample size. |
| Kinetic | Solving of systems of nonlinear differential equations. Do not assume any behaviour. Instead relies on rate laws of processes such as chemical reactions. | Binary | Requires reported or estimated kinetic parameter. Do not depend on sample size. |
| Logical | Solving of logical equations based on predefined rules for each component. Assumes asynchronous or synchronous update schemes. | Binary | Requires relational knowledge of its components. Do not depend on sample size. |
| Regression | Fitting of an assumed mathematical equation on data. Often are used models that describe a particular assumed data behaviour such as linear, polynomial, exponential, and logistic. | Binary | Data for model fitting. Depend on sample size. |
| Random | Supervised machine leaning algorithm based on averaging multiple generated decision trees. | Binary | Data for model training and validation. Requires large datasets |
| Support | Supervised machine leaning algorithm based on clustering algorithms such as principal component analyses. | Binary | Data for model training and validation. Requires large datasets |
| Neural | Supervised machine leaning algorithm based on defining a set of neuron and layers as model components. Assumes all possible relational interactions between neurons. | Binary | Data for model training and validation. Requires large datasets |