| Literature DB >> 33817323 |
Abstract
A new logistic regression-based method to distinguish between cancerous and noncancerous RNA genomic data is developed and tested with 100% precision on 595 healthy and cancerous prostate samples. A logistic regression system is developed and trained using whole-exome sequencing data at a high-level, i.e., normalized quantification of RNAs obtained from 495 prostate cancer samples from The Cancer Genome Atlas and 100 healthy samples from the Genotype-Tissue Expression project. We could show that both sensitivity and specificity of the method in the classification of cancerous and noncancerous cells are perfectly 100%.Entities:
Keywords: RNA sequencing; classification; diagnosis; high throughput technologies; logistic regression; machine learning; prostate cancer; transcriptome
Year: 2021 PMID: 33817323 PMCID: PMC8005780 DOI: 10.1515/med-2021-0238
Source DB: PubMed Journal: Open Med (Wars)
Classification report
| Summary | Precision | Recall | f1 score | Support |
|---|---|---|---|---|
| Class 0 | 1.00 | 1.00 | 1.00 | 9 |
| Class 1 | 1.00 | 1.00 | 1.00 | 51 |
| Micro avg. | 1.00 | 1.00 | 1.00 | 60 |
| Macro avg. | 1.00 | 1.00 | 1.00 | 60 |
| Weighted avg. | 1.00 | 1.00 | 1.00 | 60 |
Figure 1ROC curve of LGR classifier performance in distinguishing cancerous and noncancerous prostate cells.