| Literature DB >> 35053601 |
Anetta Sulewska1, Jacek Niklinski1, Radoslaw Charkiewicz1,2, Piotr Karabowicz3, Przemyslaw Biecek4, Hubert Baniecki4, Oksana Kowalczuk1, Miroslaw Kozlowski5, Patrycja Modzelewska3, Piotr Majewski6, Elzbieta Tryniszewska6, Joanna Reszec3,7, Zofia Dzieciol-Anikiej3,8, Cezary Piwkowski9, Robert Gryczka10, Rodryg Ramlau10.
Abstract
LncRNAs have arisen as new players in the world of non-coding RNA. Disrupted expression of these molecules can be tightly linked to the onset, promotion and progression of cancer. The present study estimated the usefulness of 14 lncRNAs (HAGLR, ADAMTS9-AS2, LINC00261, MCM3AP-AS1, TP53TG1, C14orf132, LINC00968, LINC00312, TP73-AS1, LOC344887, LINC00673, SOX2-OT, AFAP1-AS1, LOC730101) for early detection of non-small-cell lung cancer (NSCLC). The total RNA was isolated from paired fresh-frozen cancerous and noncancerous lung tissue from 92 NSCLC patients diagnosed with either adenocarcinoma (LUAD) or lung squamous cell carcinoma (LUSC). The expression level of lncRNAs was evaluated by a quantitative real-time PCR (qPCR). Based on Ct and delta Ct values, logistic regression and gradient boosting decision tree classifiers were built. The latter is a novel, advanced machine learning algorithm with great potential in medical science. The established predictive models showed that a set of 14 lncRNAs accurately discriminates cancerous from noncancerous lung tissues (AUC value of 0.98 ± 0.01) and NSCLC subtypes (AUC value of 0.84 ± 0.09), although the expression of a few molecules was statistically insignificant (SOX2-OT, AFAP1-AS1 and LOC730101 for tumor vs. normal tissue; and TP53TG1, C14orf132, LINC00968 and LOC730101 for LUAD vs. LUSC). However for subtypes discrimination, the simplified logistic regression model based on the four variables (delta Ct AFAP1-AS1, Ct SOX2-OT, Ct LINC00261, and delta Ct LINC00673) had even stronger diagnostic potential than the original one (AUC value of 0.88 ± 0.07). Our results demonstrate that the 14 lncRNA signature can be an auxiliary tool to endorse and complement the histological diagnosis of non-small-cell lung cancer.Entities:
Keywords: biomarkers; diagnosis; epigenetics; lncRNA; lung cancer
Year: 2022 PMID: 35053601 PMCID: PMC8773641 DOI: 10.3390/cancers14020439
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
The brief characteristics of 14 lncRNA selected for the experiment.
| Symbol | Detected Transcripts | Qiagen ID | Name | Target | Mechanism of Tumorigenesis | Biological Activity | Ref. |
|---|---|---|---|---|---|---|---|
| HAGLR | NR_033979 | LPH04484A | HOXD Antisense Growth-Associated Long Non-Coding RNA | MMP9; miR-147a | cell proliferation (+); cell migration (+); cell invasion (+) | ceRNA(miR-133b); sponge | [ |
| ADAMTS9-AS2 | ENST00000460833 | LPH14614A | ADAMTS9 Antisense RNA 2 | TGFBR3 | cell proliferation (−); cell migration (−); apoptosis rate (+) | ceRNA(miR-223-3p); sponge | [ |
| LOC344887 | ENST00000306399 | LPH01284A | NmrA Like Redox Sensor 2, Pseudogene | lymph node metastasis (+); advance stage (+); poorer differentiation (+) | [ | ||
| SOX2-OT | ENST00000410534 | LPH15037A | SOX2 Overlapping Transcript | AKT/ERK, SOX2/GLI-1 | resistance to (TKI)-erlotinib (+) resistance to cisplatin-based therapy (+), clinical prognosis (−); lung malignant phenotype (+) | interact with protein | [ |
| LINC00261 | NR_001558 | LPH22443A | Long Intergenic Non-Protein Coding RNA 261 | FOXO1; SFRP2; FHL1 | growth (−); metastasis (−) | ceRNA(miR-1269a); ceRNA(miR-522-3p); ceRNA(miR-105); sponge | [ |
| LINC00673 | NR_036488 | LPH00090A | Long Intergenic Non-Protein Coding RNA 673 | EZH2, miR-150-5p, HOXA5, NCALD, KDM1A | oncogenic role (+), cell proliferation (+); cell invasion (+); cell migration (+); epithelial to mesenchymal transition (+) | histone modification; interact with protein; sponge | [ |
| MCM3AP-AS1 | NR_002776 | LPH03207A | MCM3AP Antisense RNA 1 | KPNA4, E2F3 | angiogenesis (+); proliferation (+), migration (+), invasion (+) | ceRNA(miR-340-5p); ceRNA(miR-195-5p); sponge | [ |
| TP53TG1 | NR_015381 | LPH01569A | TP53 Target 1 | PTEN | cisplatin sensitivity (+); apoptosis (+) | ceRNA(miR-18a); sponge | [ |
| C14orf132 | ENST00000556728 | LPH41601A | Chromosome 14 Open Reading Frame 132 | downregulated in NSCLC | [ | ||
| LINC00968 | ENST00000499425 | LPH15879A | Long Intergenic Non-Protein Coding RNA 968 | CPEB3; SMAD7 | cell migration (−); colony formation (−); EMT (−), metastasis (−) | ceRNA(miR-9-5p); ceRNA(miR-21-5p); sponge | [ |
| LINC00312 | NR_024065 | LPH13268A | Long Intergenic Non-Protein Coding RNA 312 | YBX1; HOXA5 | Migration (+); invasion (+); VM (+); cell proliferation (+); apoptosis (+) | interact with protein | [ |
| TP73-AS1 | NR_033708 | LPH06577A | TP73 Antisense RNA 1 | EZH2; LAPTM4B | cancer progression (+) | ceRNA(miR-449a); ceRNA(miR-27b-3p) | [ |
| AFAP1-AS1 | ENST00000608442 | LPH27887A | AFAP1 Antisense RNA 1 | CDKN1A | cell proliferation (+); prognosis (−) | epigenetic regulation | [ |
| LOC730101 | ENST00000462729 | LPH03357A | Uncharacterized LOC730101 | CCND1, CCNE1, β-catenin | progression of cell cycle (+); cell proliferation (+); growth (+) | [ |
Characteristic of patients including age, overall survival (OS), recurrence-free survival (RFS), and pack-years. Non-small cell lung cancer (NSCLC), squamous cell carcinoma (LUSC), adenocarcinoma (LUAD).
|
|
|
|
|
| |
| age | 92 | 65.04 | 7.61 | 61.00 | 70.00 |
| rfs | 92 | 16.18 | 8.73 | 9.00 | 23.50 |
| os | 92 | 17.55 | 8.24 | 11.00 | 24.00 |
| pack-years | 92 | 39.20 | 21.36 | 29.00 | 49.00 |
|
|
|
|
|
| |
| age | 54 | 65.33 | 7.28 | 61.00 | 69.00 |
| rfs | 54 | 16.37 | 8.12 | 10.25 | 22.75 |
| os | 54 | 18.20 | 7.28 | 12.00 | 24.00 |
| pack-years | 54 | 40.31 | 20.39 | 28.50 | 48.75 |
|
|
|
|
|
| |
| age | 38 | 64.63 | 8.06 | 60.25 | 70.00 |
| rfs | 38 | 15.92 | 9.54 | 9.00 | 24.00 |
| os | 38 | 16.63 | 9.38 | 9.25 | 24.00 |
| pack-years | 38 | 37.61 | 22.61 | 30.25 | 49.00 |
Expression of 14 lncRNA in tumor tissue vs. normal tissue; (A) non-small cell lung cancer (NSCLC); (B) squamous cell lung carcinoma (LUSC); (C) lung adenocarcinoma (LUAD).
|
|
|
|
| ||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| HAGLR | 92 | −0.73 | 2.04 | −2.32 | 0.54 | −3.73 | 2.06 | −4.54 | −2.68 | <0.003 | 2.99 | 2.69 | 2.43 | 3.55 | down |
| ADAMTS9-AS2 | 92 | 0.37 | 2.2 | −1.08 | 2.02 | −4.36 | 1.73 | −5.07 | −3.29 | <0.003 | 4.73 | 2.85 | 4.14 | 5.32 | down |
| LOC344887 | 92 | −0.25 | 3.11 | −2.61 | 2.06 | 1.11 | 1.59 | 0.34 | 2.12 | <0.003 | −1.37 | 3.22 | −2.03 | −0.7 | up |
| SOX2-OT | 92 | 3.14 | 3.1 | 1.65 | 5.43 | 3.72 | 1.77 | 2.68 | 4.92 | 0.1604 | −0.58 | 3.17 | −1.24 | 0.08 | up |
| LINC00261 | 92 | 4.24 | 4.11 | 0.94 | 6.97 | −1.86 | 2.17 | −2.81 | −0.46 | <0.003 | 6.11 | 4.89 | 5.09 | 7.12 | down |
| LINC00673 | 92 | −0.33 | 1.49 | −1.16 | 0.63 | 1.34 | 1.33 | 0.5 | 2.11 | <0.003 | −1.67 | 1.93 | −2.07 | −1.27 | up |
| MCM3AP-AS1 | 92 | 0.4 | 1.36 | −0.42 | 1.22 | −1.04 | 1.7 | −1.92 | 0.18 | <0.003 | 1.44 | 2.26 | 0.97 | 1.91 | down |
| TP53TG1 | 92 | −1.75 | 1.31 | −2.58 | −0.83 | −2.84 | 2.07 | −3.57 | −1.59 | <0.003 | 1.09 | 2.23 | 0.63 | 1.56 | down |
| C14orf132 | 92 | 6.66 | 2.66 | 4.96 | 7.77 | 2.01 | 2.19 | 0.92 | 3.18 | <0.003 | 4.64 | 3.25 | 3.97 | 5.32 | down |
| LINC00968 | 92 | 2.87 | 2.18 | 1.24 | 4.48 | −2.54 | 2.1 | −3.33 | −1.5 | <0.003 | 5.42 | 2.88 | 4.82 | 6.01 | down |
| LINC00312 | 92 | 0.82 | 1.56 | −0.18 | 1.88 | −2.91 | 1.77 | −3.92 | −1.75 | <0.003 | 3.73 | 2.35 | 3.24 | 4.21 | down |
| TP73-AS1 | 92 | −0.15 | 1.52 | −1.21 | 0.92 | −1.88 | 1.51 | −2.67 | −0.9 | <0.003 | 1.73 | 1.96 | 1.33 | 2.14 | down |
| AFAP1-AS1 | 92 | 1.33 | 2.37 | −0.57 | 3.07 | 1.26 | 1.51 | 0.48 | 2.43 | 0.7688 | 0.07 | 2.81 | −0.51 | 0.65 | down |
| LOC730101 | 92 | −0.38 | 1.48 | −1.28 | 0.55 | −0.37 | 1.79 | −1.37 | 0.83 | 0.9488 | −0.01 | 2.36 | −0.49 | 0.48 | up |
|
|
|
|
| ||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| HAGLR | 54 | −0.33 | 1.99 | −1.87 | 0.92 | −4.18 | 2.27 | −4.94 | −2.93 | <0.003 | 3.84 | 2.47 | 2.05 | 5.56 | down |
| ADAMTS9-AS2 | 54 | 0.97 | 2.28 | −0.48 | 2.53 | −4.72 | 1.97 | −5.39 | −3.46 | <0.003 | 5.7 | 2.9 | 4.03 | 7.27 | down |
| LOC344887 | 54 | −1.16 | 3.16 | −3.74 | 1.12 | 1.11 | 1.57 | 0.41 | 2.2 | <0.003 | −2.26 | 3.24 | −5.06 | 0.28 | up |
| SOX2-OT | 54 | 2 | 3.35 | 0.43 | 4.64 | 3.36 | 1.93 | 2.22 | 4.44 | <0.003 | −1.36 | 3.55 | −3.17 | 0.97 | up |
| LINC00261 | 54 | 5.49 | 3.82 | 3.41 | 8.02 | −2.28 | 2.43 | −3.58 | −0.6 | <0.003 | 7.78 | 4.6 | 4.34 | 10.15 | down |
| LINC00673 | 54 | 0.02 | 1.37 | −0.71 | 0.85 | 1.19 | 1.32 | 0.51 | 1.88 | <0.003 | −1.18 | 1.75 | −2.4 | 0.07 | up |
| MCM3AP-AS1 | 54 | 0.79 | 1.38 | −0.01 | 1.44 | −1.35 | 1.86 | −2.01 | −0.17 | <0.003 | 2.14 | 2.29 | 0.63 | 3.54 | down |
| TP53TG1 | 54 | −1.65 | 1.3 | −2.39 | −0.79 | −3.24 | 2.33 | −4.13 | −1.98 | <0.003 | 1.59 | 2.37 | −0.07 | 3.08 | down |
| C14orf132 | 54 | 6.69 | 2.49 | 5.05 | 7.72 | 1.56 | 2.44 | 0.41 | 2.77 | <0.003 | 5.13 | 3.05 | 3.11 | 6.63 | down |
| LINC00968 | 54 | 3.21 | 2.34 | 1.26 | 4.92 | −2.98 | 2.41 | −3.44 | −1.73 | <0.003 | 6.19 | 3.04 | 4.04 | 8.12 | down |
| LINC00312 | 54 | 1.09 | 1.55 | 0.08 | 2.33 | −3.35 | 1.96 | −4.26 | −1.98 | <0.003 | 4.44 | 2.48 | 2.78 | 5.73 | down |
| TP73-AS1 | 54 | 0.26 | 1.61 | −0.79 | 1.22 | −2.17 | 1.6 | −2.94 | −1.12 | <0.003 | 2.43 | 1.91 | 1.26 | 3.99 | down |
| AFAP1-AS1 | 54 | 2.38 | 1.96 | 1.26 | 3.6 | 0.98 | 1.64 | 0.36 | 1.87 | <0.003 | 1.39 | 2.35 | −0.35 | 2.79 | down |
| LOC730101 | 54 | −0.34 | 1.64 | −1.28 | 0.76 | −0.7 | 1.94 | −1.62 | 0.62 | 0.3175 | 0.35 | 2.55 | −1.16 | 1.79 | down |
|
|
|
|
| ||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| HAGLR | 38 | −1.3 | 1.99 | −2.66 | −0.4 | −3.08 | 1.51 | −4.09 | −2.21 | <0.003 | 1.78 | 2.54 | 0.06 | 2.34 | down |
| ADAMTS9-AS2 | 38 | −0.49 | 1.77 | −1.73 | 0.66 | −3.85 | 1.17 | −4.76 | −3.05 | <0.003 | 3.37 | 2.15 | 1.89 | 5.05 | down |
| LOC344887 | 38 | 1.03 | 2.58 | −1.09 | 2.39 | 1.12 | 1.63 | 0.33 | 1.7 | 0.7548 | −0.09 | 2.77 | −2.17 | 1.76 | up |
| SOX2-OT | 38 | 4.75 | 1.76 | 3.6 | 6.31 | 4.22 | 1.38 | 3.35 | 5.27 | 0.2541 | 0.52 | 2.12 | −0.98 | 1.67 | down |
| LINC00261 | 38 | 2.47 | 3.88 | 0.07 | 3.51 | −1.26 | 1.57 | −2.42 | −0.39 | <0.003 | 3.73 | 4.33 | 0.7 | 5.52 | down |
| LINC00673 | 38 | −0.82 | 1.52 | −1.86 | −0.22 | 1.54 | 1.33 | 0.54 | 2.41 | <0.003 | −2.36 | 1.98 | −3.55 | −0.86 | up |
| MCM3AP-AS1 | 38 | −0.15 | 1.16 | −0.84 | 0.41 | −0.6 | 1.36 | −1.83 | 0.48 | 0.1378 | 0.45 | 1.81 | −0.65 | 1.64 | down |
| TP53TG1 | 38 | −1.9 | 1.32 | −2.7 | −1.02 | −2.29 | 1.47 | −3.49 | −1.38 | 0.1249 | 0.39 | 1.82 | −0.7 | 1.62 | down |
| C14orf132 | 38 | 6.61 | 2.92 | 4.93 | 8.33 | 2.65 | 1.58 | 1.87 | 3.39 | <0.003 | 3.96 | 3.43 | 1.38 | 7.06 | down |
| LINC00968 | 38 | 2.39 | 1.87 | 1.29 | 3.58 | −1.92 | 1.35 | −2.93 | −0.84 | <0.003 | 4.31 | 2.26 | 2.9 | 5.63 | down |
| LINC00312 | 38 | 0.44 | 1.52 | −0.32 | 1.1 | −2.27 | 1.23 | −3.14 | −1.29 | <0.003 | 2.71 | 1.7 | 1.49 | 3.72 | down |
| TP73-AS1 | 38 | −0.74 | 1.15 | −1.69 | 0.08 | −1.48 | 1.27 | −2.38 | −0.67 | <0.003 | 0.75 | 1.58 | −0.08 | 1.91 | down |
| AFAP1-AS1 | 38 | −0.16 | 2.09 | −1.85 | 0.96 | 1.64 | 1.24 | 0.87 | 2.57 | <0.003 | −1.8 | 1.58 | −3.38 | 0.18 | up |
| LOC730101 | 38 | −0.42 | 1.25 | −1.18 | 0.5 | 0.09 | 1.44 | −0.96 | 1.13 | 0.0951 | −0.51 | 1.98 | −0.42 | 1.35 | up |
Figure 1Visualization of differences in the delta Ct distributions across all lncRNAs. Additional numbers indicate the IDs of the furthest outlier patients in data: (A) Differences between the cancerous and noncancerous lung tissue. Two patients with extreme delta Ct values of multiple lncRNAs are observed (60 and 71); (B) Differences between the LUSC and LUAD subtypes of the tumor tissue. No patients with multiple extreme delta Ct values are observed.
Expression of 14 lncRNA in LUAD tumor vs. LUSC tumor.
| lncRNA Delta Ct LUSC Tumor | lncRNA Delta Ct LUAD Tumor | lncRNA Delta Delta Ct | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | Mean | SD | 0.25 | 0.75 | N | Mean | SD | 0.25 | 0.75 | Mean | 0.25 | 0.75 | Expression Profile LUSC | Expression Profile LUAD | ||
| HAGLR | 54 | −0.34 | 2.00 | −0.88 | 0.21 | 38 | −1.30 | 2.00 | −1.96 | −0.65 | <0.003 | 0.97 | 0.13 | 1.81 | down | up |
| ADAMTS9-AS2 | 54 | 0.97 | 2.28 | 0.35 | 1.60 | 38 | −0.49 | 1.77 | −1.07 | 0.97 | <0.003 | 1.46 | 0.58 | 2.34 | down | up |
| LOC344887 | 54 | −1.16 | 3.16 | −2.02 | −0.30 | 38 | 1.03 | 2.58 | 0.18 | 1.87 | <0.003 | −2.18 | −3.41 | 0.95 | up | down |
| SOX2-OT | 54 | 2.00 | 3.35 | 1.09 | 2.92 | 38 | 4.75 | 1.76 | 4.17 | 5.33 | <0.003 | −2.74 | −3.92 | −1.56 | up | down |
| LINC00261 | 54 | 5.49 | 3.82 | 4.45 | 6.54 | 38 | 2.47 | 3.88 | 1.19 | 3.74 | <0.003 | 3.03 | 1.41 | 4.64 | down | up |
| LINC00673 | 54 | 0.02 | 1.38 | −0.36 | 0.39 | 38 | −0.82 | 1.52 | −1.32 | −0.32 | <0.003 | 0.84 | 0.23 | 1.44 | down | up |
| MCM3AP-AS1 | 54 | 0.78 | 1.38 | 0.41 | 1.16 | 38 | −0.15 | 1.16 | −0.53 | 0.23 | <0.003 | 0.93 | 0.39 | 1.48 | down | up |
| TP53TG1 | 54 | −1.65 | 1.30 | −2.00 | −1.29 | 38 | 1.90 | 1.32 | −2.33 | −1.46 | 0.2105 | 0.25 | −0.30 | 0.80 | down | up |
| C14orf132 | 54 | 6.68 | 2.49 | 6.00 | 7.36 | 38 | 6.61 | 2.92 | 5.66 | 7.57 | 0.5145 | 0.07 | −1.05 | 1.20 | down | up |
| LINC00968 | 54 | 3.21 | 2.34 | 2.57 | 3.85 | 38 | 2.39 | 1.87 | 1.78 | 3.01 | 0.0916 | 0.82 | −0.91 | 1.72 | down | up |
| LINC00312 | 54 | 1.09 | 1.55 | 0.67 | 1.52 | 38 | 0.43 | 1.52 | −0.06 | 0.94 | <0.003 | 0.66 | 0.01 | 1.31 | down | up |
| TP73-AS1 | 54 | 0.26 | 1.61 | −0.18 | 0.70 | 38 | −0.74 | 1.15 | −1.12 | −0.37 | <0.003 | 1.00 | 0.40 | 1.61 | down | up |
| AFAP1-AS1 | 54 | 2.38 | 1.96 | 1.84 | 2.91 | 38 | −0.16 | 2.09 | −0.85 | 0.53 | <0.003 | 2.54 | 1.69 | 3.38 | down | up |
| LOC730101 | 54 | −0.35 | 1.64 | −0.79 | 0.10 | 38 | −0.42 | 1.25 | −0.83 | −0.01 | 0.8249 | 0.08 | −0.55 | 0.70 | down | up |
Expression of lncRNAs and clinicopathological parameters of NSCLC patients.
| Symbol | N | Sex | Histology | Recurrence | Death | Smoking | TNM |
|---|---|---|---|---|---|---|---|
| HAGLR | 92 | 0.2 | <0.003 | 0.76 | 0.2 | 0.34 | 0.07 |
| ADAMTS9-AS2 | 92 | 0.14 | <0.003 | 0.77 | 0.21 | 0.19 | 0.26 |
| LOC344887 | 92 | 0.72 | <0.003 | 0.13 | <0.003 | 0.06 | 0.18 |
| SOX2-OT | 92 | 0.34 | <0.003 | 0.33 | 0.51 | <0.003 | 0.22 |
| LINC00261 | 92 | 0.19 | <0.003 | 0.79 | 0.09 | 0.09 | 0.38 |
| LINC00673 | 92 | <0.003 | <0.003 | 0.36 | 0.09 | 0.7 | 0.25 |
| MCM3AP-AS1 | 92 | <0.003 | <0.003 | 0.15 | <0.003 | 0.49 | 0.15 |
| TP53TG1 | 92 | 0.15 | <0.003 | 0.23 | 0.06 | 0.9 | 0.08 |
| C14orf132 | 92 | <0.003 | 0.07 | 0.56 | 0.17 | 0.9 | 0.17 |
| LINC00968 | 92 | 0.08 | <0.003 | 0.29 | 0.06 | 0.47 | 1 |
| LINC00312 | 92 | 0.12 | <0.003 | 0.1 | 0.1 | 0.78 | 0.91 |
| TP73-AS1 | 92 | 0.07 | <0.003 | 0.2 | 0.05 | 0.78 | 0.1 |
| AFAP1-AS1 | 92 | 0.15 | <0.003 | 0.4 | 0.26 | 0.09 | 0.32 |
| LOC730101 | 92 | 0.19 | <0.003 | 0.06 | <0.003 | 0.11 | 0.2 |
Figure 2Metrics for logistic regression classifier for differentiating normal tissue from tumor tissue and LUSC tissue from LUAD tissue: (A) Mean ROC ± SD curve, and AUC for logistic regression classifier for differentiating normal tissue from cancer tissue; (B) Mean and 95% CI for precision, recall, f1-score, and AUC metrics for logistic regression classifier for differentiating normal tissue from cancer tissue; (C) Mean ROC ± SD curve, and mean AUC for logistic regression classifier for differentiating LUSC tissue from LUAD tissue; (D) Mean and 95% CI of precision, recall, f1-score metrics, and AUC for logistic regression classifier for differentiating LUSC tissue from LUAD tissue.
Figure 3Metrics for gradient boosting decision tree and simplified logistic regression classifiers for differentiating LUSC tissue from LUAD tissue: (A) mean ROC ± SD curve, and mean AUC for the two classifiers; (B) mean and 95% CI of precision, recall, f1-score metrics, and AUC; (C) summary table with coefficients and significance for the simplified logistic regression classifier; GBDT—gradient boosting decision tree, LR(S)—logistic regression (simplified).
Figure 4Shapley Additive Explanation for the gradient boosting decision tree classifier. The y-axis indicates a ranking of variables, Ct and delta Ct values of lncRNAs, sorted from the most important in the model (top) to the least important (bottom). The x-axis indicates an impact of a given variable on the model’s predictions; the SHAP values are sorted from the negative impact leading towards the LUAD subtype (class 0 on the left) to the positive impact leading towards the LUSC subtype (class 1 on the right). There are 92 points per row, one point per patient, where each indicates an attribution of a given variable to the probability model output. The color-axis indicates the variables’ values from low with blue to high with red. The visible distinction in colors between negative and positive SHAP values might be viewed as indicating a significant expression profile (up or down).