| Literature DB >> 33801002 |
Muhamed Wael Farouq1,2, Wadii Boulila3,4, Zain Hussain5, Asrar Rashid6, Moiz Shah7, Sajid Hussain8, Nathan Ng5, Dominic Ng9, Haris Hanif9, Mohamad Guftar Shaikh7, Aziz Sheikh5, Amir Hussain2.
Abstract
Machine learning (ML)-based algorithms are playing an important role in cancer diagnosis and are increasingly being used to aid clinical decision-making. However, these commonly operate as 'black boxes' and it is unclear how decisions are derived. Recently, techniques have been applied to help us understand how specific ML models work and explain the rational for outputs. This study aims to determine why a given type of cancer has a certain phenotypic characteristic. Cancer results in cellular dysregulation and a thorough consideration of cancer regulators is required. This would increase our understanding of the nature of the disease and help discover more effective diagnostic, prognostic, and treatment methods for a variety of cancer types and stages. Our study proposes a novel explainable analysis of potential biomarkers denoting tumorigenesis in non-small cell lung cancer. A number of these biomarkers are known to appear following various treatment pathways. An enhanced analysis is enabled through a novel mathematical formulation for the regulators of mRNA, the regulators of ncRNA, and the coupled mRNA-ncRNA regulators. Temporal gene expression profiles are approximated in a two-dimensional spatial domain for the transition states before converging to the stationary state, using a system comprised of coupled-reaction partial differential equations. Simulation experiments demonstrate that the proposed mathematical gene-expression profile represents a best fit for the population abundance of these oncogenes. In future, our proposed solution can lead to the development of alternative interpretable approaches, through the application of ML models to discover unknown dynamics in gene regulatory systems.Entities:
Keywords: coupled reaction PDE; diffusion equation; explainable machine learning; gene expression; non-small cell lung cancer
Mesh:
Year: 2021 PMID: 33801002 PMCID: PMC8003942 DOI: 10.3390/s21062190
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
List of mathematical notations.
| Notation | Description |
|---|---|
|
| Messenger RNA (mRNA) Population |
|
| Non-coding small interfering RNA (siRNA) Population |
|
| Production rate of mRNA |
|
| Self-degradation rate of mRNA |
|
| Coupled Degradation rate of mRNA due to siRNA |
|
| Production rate of siRNA |
|
| Self-degradation rate of siRNA |
|
| Diffusion coefficient of mRNA |
|
| Diffusion coefficient of siRNA |
Boundaries at the initial condition (IC) and Dirichlet boundary conditions.
| Boundaries of IC | Dirichlet B.Cs |
|---|---|
|
|
|
Figure 1Initial condition consistency correction. (a) Two-dimensional initial condition consistency correction. (b) Three-dimensional initial condition consistency correction.
Figure 2mRNA population structure at t = 0, 1, 2, 3, and 4 for each model.
Dataset notations.
| NT | Non-Treatment or Control Group |
| SE | Short-exposure non-thermal plasma treatment, measured post 1 h of treatment |
| LE post 1 h | Long-exposure non-thermal plasma treatment, measured post 1 h of treatment |
| LE post 2 h | Long-exposure non-thermal plasma treatment, measured post 2 h of treatment |
| LE post 4 h | Long-exposure non-thermal plasma treatment, measured post 4 h of treatment |
Transcription, self-degradation, and coupled degradation functions.
| Constant | Exponential | Log |
|---|---|---|
|
|
| |
| Epigenetic | ||
|
| ||
| Experimental | ||
|
|
Transcription, self-degradation, and coupled degradation functions for each model.
| Model |
|
|
|
|
|
|---|---|---|---|---|---|
| 1 |
|
| 10 | 1 | |
| 2 |
|
|
|
| |
| 3 |
|
|
|
|
|
| 4 |
|
|
|
|
|
| 5 |
|
|
|
| |
| 6 |
|
|
| 10 | 1 |
| 7 |
|
| 10 | 1 | |
| 8 |
|
|
| 1 | |
| 9 |
|
|
|
| |
| 10 |
|
|
|
|
|
| 11 |
|
|
|
|
|
| 12 |
|
|
|
|
|
| 13 |
| 1 |
|
|
|
| 14 |
|
|
| 1 | |
| 15 |
|
|
|
|
|
Figure 3Total mRNA population abundance at t = 0, 1, 2, 3, and 4 for each model.
Normalized scale of population abundance.
| Normalized Scale | ||
|---|---|---|
|
| 0 | 2.265458 |
|
| 2.267602 | 4.383277 |
|
| 4.383667 | 7.001413 |
|
| 7.002388 | 12 |
Molecular profiling of non-small lung cancer (NSLC) oncogenes.
| Fusion NT | Fusion LE Post 1 h | Fusion LE Post 2 h | Fusion LE Post 4 h | Model | |
|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|