| Literature DB >> 33968062 |
Di Jiang1,2,3, Xue Zhang1,2,3, Man Liu1,2,3, Yulin Wang1,2,3, Tingting Wang4, Lu Pei5, Peng Wang3,6, Hua Ye3,6, Jianxiang Shi1,3, Chunhua Song3,6, Kaijuan Wang3,6, Xiao Wang1,3, Liping Dai1,2,3, Jianying Zhang1,3.
Abstract
Substantial studies indicate that autoantibodies to tumor-associated antigens (TAAbs) arise in early stage of lung cancer (LC). However, since single TAAbs as non-invasive biomarkers reveal low diagnostic performances, a panel approach is needed to provide more clues for early detection of LC. In the present research, potential TAAbs were screened in 150 serum samples by focused protein array based on 154 proteins encoded by cancer driver genes. Indirect enzyme-linked immunosorbent assay (ELISA) was used to verify and validate TAAbs in two independent datasets with 1,054 participants (310 in verification cohort, 744 in validation cohort). In both verification and validation cohorts, eight TAAbs were higher in serum of LC patients compared with normal controls. Moreover, diagnostic models were built and evaluated in the training set and the test set of validation cohort by six data mining methods. In contrast to the other five models, the decision tree (DT) model containing seven TAAbs (TP53, NPM1, FGFR2, PIK3CA, GNA11, HIST1H3B, and TSC1), built in the training set, yielded the highest diagnostic value with the area under the receiver operating characteristic curve (AUC) of 0.897, the sensitivity of 94.4% and the specificity of 84.9%. The model was further assessed in the test set and exhibited an AUC of 0.838 with the sensitivity of 89.4% and the specificity of 78.2%. Interestingly, the accuracies of this model in both early and advanced stage were close to 90%, much more effective than that of single TAAbs. Protein array based on cancer driver genes is effective in screening and discovering potential TAAbs of LC. The TAAbs panel with TP53, NPM1, FGFR2, PIK3CA, GNA11, HIST1H3B, and TSC1 is excellent in early detection of LC, and they might be new target in LC immunotherapy.Entities:
Keywords: autoantibody; diagnostic model; lung cancer; protein array; tumor-associated antigen
Year: 2021 PMID: 33968062 PMCID: PMC8102818 DOI: 10.3389/fimmu.2021.658922
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Characteristics of populations in this study.
| 100 | 50 | 155 | 155 | 300 | 144 | 300 | |
| Mean ± SD (years) | 61 ± 11 | 40 ± 13 | 61 ± 10 | 60 ± 11 | 61 ± 11 | 60 ± 10 | 57 ± 11 |
| Range (years) | 26–85 | 20–71 | 30–83 | 28–81 | 26–87 | 29–85 | 25–89 |
| Male | 66 (66.0) | 23 (46.0) | 116 (74.8) | 116 (74.8) | 185 (61.7) | 103 (71.5) | 156 (52.0) |
| Female | 34 (34.0) | 27 (54.0) | 39 (25.2) | 39 (25.2) | 115 (38.3) | 41 (28.5) | 144 (48.0) |
| Yes | 45 (45.0) | 98 (63.2) | 111 (37.0) | 78 (54.2) | |||
| No | 55 (55.0) | 57 (36.8) | 178 (59.3) | 66 (45.8) | |||
| Unknown | 0 (0.0) | 0 (0.0) | 11 (3.7) | 0 (0.0) | |||
| Yes | 26 (26.0) | 45 (29.0) | 54 (18.0) | 36 (25.0) | |||
| No | 74 (74.0) | 110 (71.0) | 233 (77.7) | 108 (75.0) | |||
| Unknown | 0 (0.0) | 0 (0.0) | 13 (4.3) | 0 (0.0) | |||
| Yes | 12 (12.0) | 28 (18.1) | 22 (7.3) | 18 (12.5) | |||
| No | 88 (88.0) | 127 (81.9) | 263 (87.7) | 126 (87.5) | |||
| Unknown | 0 (0.0) | 0 (0.0) | 15 (5.0) | 0 (0.0) | |||
| Stage I | 18 (18.0) | 11 (7.1) | 51 (17.0) | ||||
| Stage II | 12 (12.0) | 11 (7.1) | 12 (4.0) | ||||
| Stage III | 33 (33.0) | 58 (37.4) | 44 (14.7) | ||||
| Stage IV | 37 (37.0) | 60 (38.7) | 81 (27.0) | ||||
| Unknown | 0 (0.0) | 15 (9.7) | 112 (37.3) | ||||
| SCC | 31 (31.0) | 42 (27.1) | 64 (21.3) | ||||
| AD | 68 (68.0) | 58 (37.4) | 177 (59.0) | ||||
| SCLC | 0 (0.0) | 43 (27.7) | 32 (10.7) | ||||
| Others | 1 (1.0) | 12 (7.8) | 15 (5.0) | ||||
| Unknown | 0 (0.0) | 0 (0.0) | 12 (4.0) | ||||
| ≤ 5 cm | 60 (60.0) | 59 (38.1) | 126 (42.0) | ||||
| >5 cm | 40 (40.0) | 80 (51.6) | 79 (26.3) | ||||
| Unknown | 0 (0.0) | 16 (10.3) | 95 (31.7) | ||||
| Yes | 69 (69.0) | 99 (63.9) | 124 (41.3) | ||||
| No | 31 (31.0) | 41 (26.4) | 72 (24.0) | ||||
| Unknown | 0 (0.0) | 15 (9.7) | 104 (34.7) | ||||
| Yes | 38 (38.0) | 61 (39.4) | 109 (36.3) | ||||
| No | 62 (62.0) | 79 (50.9) | 112 (37.4) | ||||
| Unknown | 0 (0.0) | 15 (9.7) | 79 (26.3) | ||||
| COPD | 72 (50.0) | ||||||
| Chronic bronchitis | 72 (50.0) | ||||||
AD, adenocarcinoma; BLD, benign lung disease; COPD, chronic obstructive pulmonary disease; LC, lung cancer; NC, normal control; SCC, squamous cell carcinoma; SCLC, small cell lung cancer; SD, standard deviation.
Figure 1Overall study design.
Figure 2Protein array customization and preliminary results. (A) Protein array layout. (B) The operation process and principle of the protein array. (C,D) Protein fluorescence quantification results of LC and NC, respectively. The red and blue frames highlight the positive control (anti-human IgG) and negative control (buffer). (E) The result of repeated experiments by the same serum sample. The lower left showed the distribution of the results after linear fitting, and the upper right showed correlation results between samples after linear fitting (***P < 0.001). The middle graph was the cumulative density distribution of a single sample.
Figure 3(A) SNR of autoantibodies against 12 TAAs in discovery cohort with 100 LCs and 50 NCs. (B) ROC analysis of autoantibodies against 12 TAAs for LC detection in discovery cohort. C, cancer; N, normal; ***P < 0.001; **P < 0.01; *P < 0.05.
Figure 4(A) The expression of autoantibodies against eight TAAs in validation cohort with 300 LCs, 144 BLDs, and 300 NCs. (B) ROC analysis of autoantibodies against eight TAAs for LC and NC groups in validation cohort. C, cancer; B, benign; N, normal; ***P < 0.001; **P < 0.01; *P < 0.05.
Figure 5ROC analysis of multiple models for the differential diagnosis of LCs and NCs in training set and test set of validation cohort.
The performance of multiple models in training set and test set for lung cancer detection.
| Fisher | 5 | <0.0001 | 0.753 | 64.2 | 86.4 | 74.9 | <0.0001 | 0.732 | 55.3 | 91.1 | 74.7 |
| Logistic | 6 | <0.0001 | 0.794 | 74.4 | 84.4 | 79.2 | <0.0001 | 0.776 | 67.1 | 88.1 | 78.5 |
| DT C5.0 | 7 | <0.0001 | 0.897 | 94.4 | 84.9 | 89.9 | <0.0001 | 0.838 | 89.4 | 78.2 | 83.3 |
| MLP | 8 | <0.0001 | 0.824 | 75.4 | 89.5 | 82.1 | <0.0001 | 0.738 | 63.5 | 84.2 | 74.7 |
| RBF | 8 | <0.0001 | 0.761 | 81.9 | 70.4 | 76.3 | <0.0001 | 0.716 | 72.9 | 70.3 | 71.5 |
| SVM | 8 | <0.0001 | 0.792 | 73.5 | 84.9 | 79.0 | <0.0001 | 0.742 | 61.2 | 87.1 | 75.3 |
AUC, area under the receiver operating characteristic curve; CI, confidence interval; DT C5.0, Decision Tree C5.0; Fisher, Fisher discriminant analysis; Logistic, Logistic regression analysis; MLP, multilayer perception; RBF, radial basis function; SVM, support vector machines; TAAbs, autoantibodies to tumor-associated antigens; 5 TAAbs, TP53, NPM1, FGFR2, GNA11, and HIST1H3B; 6 TAAbs, TP53, NPM1, FGFR2, PIK3CA, GNA11, and HIST1H3B; 7 TAAbs, TP53, NPM1, FGFR2, PIK3CA, GNA11, HIST1H3B, and TSC1; 8 TAAbs, TP53, NPM1, SRSF2, FGFR2, PIK3CA, GNA11, HIST1H3B, and TSC1.
The diagnostic performance of DT 5.0 model and the seven TAAbs in early and late stage LC.
| TP53 | 0.840 (0.782–0.898) | 0.000 | 48.6 | 92.7 | 0.413 | 86.89 | 64.33 | 70.64 |
| NPM1 | 0.837 (0.778–0.897) | 0.000 | 48.6 | 94.0 | 0.426 | 89.01 | 64.65 | 71.31 |
| GNA11 | 0.733 (0.672–0.793) | 0.000 | 26.4 | 95.3 | 0.217 | 84.97 | 56.43 | 60.86 |
| HIST1H3B | 0.567 (0.484–0.650) | 0.078 | 13.9 | 95.3 | 0.092 | 74.85 | 52.54 | 54.61 |
| FGFR2 | 0.639 (0.558–0.719) | 0.000 | 15.3 | 94.0 | 0.093 | 71.80 | 52.60 | 54.64 |
| TSC1 | 0.749 (0.683–0.816) | 0.000 | 18.1 | 92.0 | 0.101 | 69.30 | 52.89 | 55.03 |
| PIK3CA | 0.668 (0.592–0.744) | 0.000 | 15.3 | 93.3 | 0.086 | 69.62 | 52.42 | 54.31 |
| DT C5.0 | 0.886 (0.845–0.926) | 0.000 | 94.4 | 82.7 | 0.771 | 84.49 | 93.70 | 88.56 |
| TP53 | 0.710 (0.651–0.769) | 0.000 | 35.5 | 92.7 | 0.281 | 82.86 | 58.95 | 64.06 |
| NPM1 | 0.707 (0.650–0.764) | 0.000 | 27.0 | 94.0 | 0.210 | 81.79 | 56.27 | 60.48 |
| GNA11 | 0.727 (0.679–0.774) | 0.000 | 19.1 | 95.3 | 0.145 | 80.41 | 54.11 | 57.24 |
| HIST1H3B | 0.565 (0.506–0.624) | 0.027 | 9.2 | 95.3 | 0.046 | 66.39 | 51.22 | 52.28 |
| FGFR2 | 0.509 (0.448–0.571) | 0.750 | 7.8 | 91.0 | −0.012 | 46.43 | 49.67 | 49.40 |
| TSC1 | 0.641 (0.582–0.701) | 0.000 | 9.2 | 92.0 | 0.012 | 53.54 | 50.33 | 50.61 |
| PIK3CA | 0.576 (0.516–0.636) | 0.010 | 14.9 | 90.0 | 0.049 | 59.82 | 51.40 | 52.45 |
| DT C5.0 | 0.864 (0.826–0.902) | 0.000 | 90.1 | 82.7 | 0.727 | 83.86 | 89.28 | 86.37 |
AUC, area under the receiver operating characteristic curve; CI, confidence interval; DT C5.0, Decision Tree C5.0; LC, lung cancer; NPV, negative predictive value; PPV, positive predictive value; TAAbs, autoantibodies to tumor-associated antigens; YI, Youden's Index.