| Literature DB >> 35664298 |
Aneta Polewko-Klim1, Sibo Zhu2,3,4, Weicheng Wu2,3, Yijing Xie2,3, Ning Cai2,3, Kexun Zhang2,3, Zhen Zhu2,3, Tao Qing3, Ziyu Yuan3, Kelin Xu2,3, Tiejun Zhang2,3, Ming Lu3,5, Weimin Ye3,6, Xingdong Chen3,4, Chen Suo2,3,7, Witold R Rudnicki1,8.
Abstract
The standard therapy administered to patients with advanced esophageal cancer remains uniform, despite its two main histological subtypes, namely esophageal squamous cell carcinoma (SCC) and esophageal adenocarcinoma (AC), are being increasingly considered to be different. The identification of potential drug target genes between SCC and AC is crucial for more effective treatment of these diseases, given the high toxicity of chemotherapy and resistance to administered medications. Herein we attempted to identify and rank differentially expressed genes (DEGs) in SCC vs. AC using ensemble feature selection methods. RNA-seq data from The Cancer Genome Atlas and the Fudan-Taizhou Institute of Health Sciences (China). Six feature filters algorithms were used to identify DEGs. We built robust predictive models for histological subtypes with the random forest (RF) classification algorithm. Pathway analysis also be performed to investigate the functional role of genes. 294 informative DEGs (87 of them are newly discovered) have been identified. The areas under receiver operator curve (AUC) were higher than 99.5% for all feature selection (FS) methods. Nine genes (i.e., ERBB3, ATP7B, ABCC3, GALNT14, CLDN18, GUCY2C, FGFR4, KCNQ5, and CACNA1B) may play a key role in the development of more directed anticancer therapy for SCC and AC patients. The first four of them are drug targets for chemotherapy and immunotherapy of esophageal cancer and involved in pharmacokinetics and pharmacodynamics pathways. Research identified novel DEGs in SCC and AC, and detected four potential drug targeted genes (ERBB3, ATP7B, ABCC3, and GALNT14) and five drug-related genes.Entities:
Keywords: Feature Selection (FS); drug target genes; ensemble learning (EL); esophageal cancer (ESCA); random forest (RF)
Year: 2022 PMID: 35664298 PMCID: PMC9161154 DOI: 10.3389/fgene.2022.844542
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Procedures involved in selecting the most informative biomarkers.
Comparison of feature selection methods.
| Metric | Ttest | MDFS1D | MDFS2D | FCBF | ReliefF | MRMR |
|---|---|---|---|---|---|---|
| AUC | 0.996 | 0.998 | 0.997 | 0.994 | 0.996 | 0.999 |
| MCC | 0.994 | 0.997 | 0.996 | 0.991 | 0.993 | 0.998 |
| ASM | 0.52 | 0.43 | 0.40 | 0.05 | 0.34 | 0.53 |
Note: The first two rows display AUC and MCC obtained for RF classifier on 100 most relevant genes selected with each feature selection method. The last row displays the stability of these, which was measured using ASM. Fifty repeats of 5-fold cross-validation were performed. Standard deviation of mean AUC and MCC was <0.001. See notation in the main text.
FIGURE 2Boxplot of Log2 normalized RNA-Seq gene expression of 4 membrane encoding genes related with SCA anti-cancer drugs. Boxplot contains the p-value of mean differential expression between AC and SCC patients groups with a two-sample t-test.
FIGURE 3Network of drugs important for chemotherapy in patients with esophageal cancer, and genes identified from the set of the most informative biomarkers predictive of the two main histological subtypes of esophageal cancer.
Top 10 membrane protein-encoding genes that were under- or overexpressed in SCC vs. AC
| Top down-expressed membrane protein encoding genes in SCC | |||
|---|---|---|---|
| No | Gene | Log2FC | Drugs |
| 1 |
| 7.57 | CLAUDIXIMAB |
| 2 |
| 7.27 | |
| 3 |
| 6.03 | |
| 4 |
| 5.87 | |
| 5 |
| 5.86 | ACARBOSE, SCOPOLAMINE, DEXAMETHASONE, MIFEPRISTONE, STREPTOZOTOCIN, FURAN, SODIUM BETA-NICOTINAMIDE ADENINE DINUCLEOTIDE PHOSPHATE, HEXAMETHYLENEBISACETAMIDE |
| 6 |
| 5.77 | |
| 7 |
| 5.49 | LINACLOTIDE, PLECANATIDE, PANITUMUMAB, PIRIBEDIL (CHEMBL1371770), PHOSPHORIC ACID, LINACLOTIDE ACETATE, GUANOSINE MONOPHOSPHATE, CYCLIC GMP |
| 8 |
| 5.29 | CALCIUM |
| 9 |
| 5.16 | PROSCILLARIDIN, BUMETANIDE, FUROSEMIDE, TRANSTORINE, PAMOIC ACID, ZAPRINAST, PYRANTEL, KYNURENIC ACID |
| 10 |
| 5.06 | |
Note: Where applicable, each gene is accompanied by a list of drugs that were associated with it in at least one of the following databases: DrugBank, PharmGKB, ClinicalTrials.gov, DGIdb, and FDA Approved Drugs. Standard deviation of the expression values at the Log2FC level was <0.002.
FIGURE 4The gene-gene interaction network of membrane protein-encoding genes obtained with GeneMANIA. Green edges correspond to the functional associations between genes (nodes), while pink edges represent the predicted gene-gene interaction. The black edges correspond to genes functionally associated with the MUC1 hub gene (sub-network 1). The solid blue edges correspond to genes functionally associated with the KCNQ5 and GJA1 hub genes (sub-network 2), while the blue dashed edges correspond to genes functionally associated with the KCNQ5, GJA1, and MUC1.