| Literature DB >> 36171613 |
Chansub Lee1,2, Sungyoung Lee3,4, Eunchae Park1,2, Junshik Hong1,2,5, Dong-Yeop Shin1,2,5, Ja Min Byun1,2,5, Hongseok Yun6,7, Youngil Koh8,9,10, Sung-Soo Yoon11,12,13.
Abstract
BACKGROUND: Although anti-apoptotic proteins of the B-cell lymphoma-2 (BCL2) family have been utilized as therapeutic targets in acute myeloid leukaemia (AML), their complicated regulatory networks make individualized therapy difficult. This study aimed to discover the transcriptional signatures of BCL2 family genes that reflect regulatory dynamics, which can guide individualized therapeutic strategies.Entities:
Keywords: Acute myeloid leukaemia; B-cell lymphoma-2 family; Transcriptional signatures
Mesh:
Substances:
Year: 2022 PMID: 36171613 PMCID: PMC9520894 DOI: 10.1186/s13073-022-01115-w
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 15.266
Fig. 1Flowchart of the study. From AML RNA-seq datasets (BeatAML, LeuceGene, and TCGA), gene optimization is independently conducted for capturing regulation factors of the BCL2 family. Afterward, the BCL2 family signatures (BFSigs) are calculated using the selected genes. Using the signatures, three novel subtypes are identified and functionally characterized. A classifier for predicting venetoclax response is developed and validated. Additionally, drug response analysis reveals the signature-based subtype-specific drug sensitivity. Finally, the validity of our selected genes is confirmed in a custom NanoString panel. *For external validation of BeatAML results, the BFSigs were re-extracted after batch effect correction
Fig. 2Identification of BCL2 family-based acute myeloid leukaemia (AML) subtypes. A Profiles of BCL2 family signatures calculated using optimized genes in each RNA-seq dataset (BeatAML, LeuceGene, and TCGA). These datasets show three distinct clusters annotated as BCL2, MCL1/BCL2, and BFL1/MCL1 signature subtypes. Columns are clustered using hierarchical clustering with average distance. B Sample proportion of these subtypes. C Comparison of BCL2 family signatures between venetoclax response groups in BeatAML (81 sensitive and 72 resistant) and LeuceGene (20 sensitive and 3 resistant). P-values are calculated by Welch’s t-test. * < 0.05, ** < 0.01, ns > 0.10. D A profile of BCL2 family signatures in the NanoString dataset. The samples are also divided into three clusters resulting from RNA-seq datasets
Fig. 3Concordance of BCL2 family signatures between acute myeloid leukaemia (AML) datasets. Weight of optimized genes in the definition of the BCL2 family signatures (BCL2, MCL1/BCL2, and BFL1/MCL1 signature). BCL2, MCL1, and BFL1 are marked in cyan, magenta, and yellow, respectively. Some determinant components of the signatures are marked in black. Four AML datasets show the consistent weight of optimized genes. Each weight of genes is normalized to sum 1. Correlation coefficients are calculated using Spearman’s rho
Fig. 4Functional analysis of BCL2 family-based acute myeloid leukaemia (AML) subtypes. Gene set enrichment analysis (GSEA) from the comparison between one subtype and the others identifies enriched gene sets in each subtype. Enrichment patterns are consistent across three AML datasets. NES indicates a normalized enrichment score. The gene set of the MAPK pathway is from the GO database. The others are from the hallmark database
Fig. 5Association between BFSigs and drug responses. Comparison of drug responses between the subtypes in BeatAML and Tavor datasets. The lower the y-axis value, the more sensitive to the drugs. P-values above the panel and between the box plots are calculated by Kruskal-Wallis test and Wilcoxon rank-sum test, respectively. * < 0.05, ** < 0.01, *** < 0.001, ns > 0.05
Fig. 6Prediction response to venetoclax. A Probability of sensitivity to venetoclax calculated from BCL2 family signature-based logistic regression model in training set (BeatAML; 81 sensitive and 72 resistant) and external validation set (LeuceGene and Tavor; 20+32 sensitive and 3+2 resistant). The sensitivity probabilities of BeatAML represent the average probability from the 10-times repeated training-testing scheme. Those of LeuceGene and Tavor are calculated from the whole BeatAML-based classifier. P-values are calculated using Wilcoxon rank-sum test by comparing the probability rank between the response groups. B Comparison of prediction performance between venetoclax response classifiers. The black bar indicates the BCL2 family signature-based logistic regression model. The dark grey bars indicate logistic regression models using the original expression of five BCL family genes (BCL2+MCL1+BFL1+BCLXL+BCLW), three BCL2 family genes (BCL2+MCL1+BFL1), and top differentially expressed genes (DEGs), respectively. The light grey bars indicate machine learning-based models using total genes or pre-collected genes related to BCL2 family regulation. The used machine learning methods are support vector machine, Lasso, and random forest (RF). Error bar indicates 95% confidence interval (CI). P-values are calculated compared with the signature model using DeLong’s test. * < 0.05, ** < 0.01, *** < 0.001