| Literature DB >> 32948183 |
Nicolas Borisov1,2, Maxim Sorokin3,4, Victor Tkachev3, Andrew Garazha3, Anton Buzdin3,5,4,6.
Abstract
BACKGROUND: Machine learning (ML) methods still have limited applicability in personalized oncology due to low numbers of available clinically annotated molecular profiles. This doesn't allow sufficient training of ML classifiers that could be used for improving molecular diagnostics.Entities:
Keywords: Biomarkers detection; Cancer; Chemotherapy; Clinical oncology; Gene expression; Machine learning; Microarrays; Molecular diagnostics; Personalized medicine; RNA sequencing; Transcriptomics
Year: 2020 PMID: 32948183 PMCID: PMC7499993 DOI: 10.1186/s12920-020-00759-0
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Overview of selected transcriptomic datasets of responders/non-responders to cancer chemotherapy, responders (R) vs non-responders (NR)
| Reference | Dataset ID | Disease type | Therapy | Experimental platform | Number | Number of |
| [ | GSE25066 | Breast cancer with different hormonal and HER2 status | Neoadjuvant taxane + anthracycline | Affymetrix Human Genome U133 Array | 508 (118 R: 389 NR: | 20 |
| [ | GSE41998 | Breast cancer with different hormonal and HER2 status | Neoadjuvant doxorubicin + cyclophosphamide, followed by paclitaxel | Affymetrix Human Genome U133 Array | 124 (90 R: 34 NR: | 11 |
| [ | GSE20271 | Breast cancer with different hormonal and HER2 status | Paclitaxel + fluorouracil + adriamycin + cyclophosphamide | Affymetrix Human Genome U133A Array | 85 (18 R: 66 NR: | 11 |
| [ | GSE50948 | Breast cancer with different hormonal and HER2 status | Paclitaxel + doxorubincin followed by cyclophos-phamide + methotrexate/ fluorouracil followed by trastuzumab | Affymetrix Human Genome U133 Plus 2.0 Array | 156 (53 R: | 19 |
| [ | GSE9782 | Multiple myeloma | Bortezomib monotherapy | Affymetrix Human Genome U133 Array | 169 (85 R: 84 NR: | 18 |
| [ | GSE39754 | Multiple myeloma | Vincristine + adriamycin + dexamethasone followed by autologous stem cell transplantation (ASCT) | Affymetrix Human Exon 1.0 ST Array | 136 (74 R: | 16 |
| [ | GSE68871 | Multiple myeloma | Bortezomib-thalidomide-dexamethasone | Affymetrix Human Genome U133 Plus | 118 (69 R: | 12 |
| [ | GSE55145 | Multiple myeloma | Bortezomib followed by ASCT | Affymetrix Human Exon 1.0 ST Array | 61 (33 R: | 14 |
| [ | TARGET-50 | Childhood kidney Wilms tumor | Vincristine sulfate + cyclosporine, cytarabine, daunorubicin + conventional surgery + radiation therapy | Illumina HiSeq 2000 | 122 (36 R: | 14 |
| [ | TARGET-10 | Childhood B acute lymphoblastic leukemia | Vincristine sulfate + carboplatin, cyclophosphamide, doxorubicin | Illumina HiSeq 2000 | 98 (30 R, 68 NR: see Fig. | 14 |
| [ | TARGET-20 | Childhood acute myeloid leukemia | Non-target drugs (asparaginase, cyclosporine, cytarabine, daunorubicin, etoposide; methotrexate, mitoxantrone) including busulfan and cyclo-phosphamide | Illumina HiSeq 2000 | 54 (31 R, 23 NR: see Fig. | 10 |
| [ | TARGET-20 | Childhood acute myeloid leukemia | Same non-target drugs, but excluding busulfan and cyclo- phosphamide | Illumina HiSeq 2000 | 142 (62 R, 80 NR: see Fig. | 16 |
| Reference | Dataset ID | Disease type | Therapy | Experimental platform | Number | Number of |
| [ | GSE18728 | Breast cancer | Docetaxel, capecitabine | Affymetrix Human Genome U133 Plus 2.0 Array | 61 (23R: | 16 |
| [ | GSE20181 | Breast cancer | Letrozole | Affymetrix Human Genome U133A Array | 52 (37 R: | 11 |
| [ | GSE20194 | Breast cancer | Paclitaxel; (tri) luoroacetyl chloride; 5-fluorouracil, epirubicin, cyclophosphamide | Affymetrix Human Genome U133A Array | 52 (11 R: | 10 |
| [ | GSE23988 | Breast cancer | Docetaxel, capecitabine | Affymetrix Human Genome U133A Array | 61 (20 R: | 18 |
| [ | GSE22358 | Breast cancer | Docetaxel, capecitabine | Agilent UNC Perou Lab | 122 (116 R: | 2 |
| [ | GSE32646 | Breast cancer | Paclitaxel, 5-fluorouracil, epirubicin, cyclophosphamide | Affymetrix Human Genome U133 Plus 2.0 Array | 115 (27 R: | 17 |
| [ | GSE37946 | Breast cancer | Trastuzumab | Affymetrix Human Genome U133A Array | 50 (27 R: | 14 |
| [ | GSE42822 | Breast cancer | Docetaxel, 5-fluorouracil, epirubicin, cyclophosphamide, capecitabine | Affymetrix Human Genome U133A Array | 91 (38 R: | 13 |
| [ | GSE5122 | Acute myeloid leukemia | Tipifarnib | Affymetrix Human Genome U133A Array | 57 (13 R: | 10 |
| [ | GSE59515 | Breast cancer | Letrozole | Illumina HumanHT-12 V4.0 expression beadchip | 75 (51 R: | 15 |
| [ | GSE76360 | Breast cancer | Trastuzumab | Illumina HumanHT-12 V3.0 expression beadchip | 48 (42 R: 6 NR: | 3 |
| [ | TCGA-LGG | Low-grade glioma | Temozolomide + (optionally) mibefradil | Illumina HiSeq 2000 | 131 (100 R: | 9 |
| [ | TCGA-LC | Lung cancer all types | Paclitaxel + (optionally), cisplatin/carboplatin, reolysin | Illumina HiSeq 2000 | 41 (24 R: | 7 |
| [ | TCGA-UC | Uterine corpus endothelial carcinoma | Paclitaxel + (optionally) carboplatin, cisplatin, doxorubicin | Illumina HiSeq 2000 | 57 (57 R: | 2 |
Fig. 1Distribution of event-free survival time for the patients with (a) childhood kidney Wilms tumor from TARGET-50 dataset, (b) childhood ALL from TARGET-10 dataset and (c) childhood AML from TARGET-20 dataset [35]. Patients on the left from vertical threshold can be considered as the non-responders, and on the right – as the responders to the treatment
Fig. 2Possible scenarios of using ML to build classifiers based on gene expression datasets. a Methods data dimensionality reduction; b approaches to merging and enlarging of gene expression datasets for ML application; c general workflow for a core marker set determination