| Literature DB >> 29021294 |
Jianbo Pan1, Guang Song2, Dunyan Chen3,4, Yadong Li3,4, Shuang Liu2, Shaohui Hu5, Christian Rosa5, Daniel Eichinger5, Ignacio Pino5, Heng Zhu6,7, Jiang Qian8,7, Yi Huang9,4.
Abstract
Lung cancer (LC) remains the leading cause of mortality from malignant tumors worldwide. Currently, a lack of serological biomarkers for early LC diagnosis is a major roadblock for early intervention and prevention of LC. To undertake this challenge, we employed a two-phase strategy to discover and validate a biomarker panel using a protein array-based approach. In Phase I, we obtained serological autoimmune profiles of 80 LC patients and 20 healthy subjects on HuProt arrays, and identified 170 candidate proteins significantly associated with LC. In Phase II, we constructed a LC focused array with the 170 proteins, and profiled a large cohort, comprised of 352 LC patients, 93 healthy individuals, and 101 patients with lung benign lesions (LBL). The comparison of autoimmune profiles between the early stage LC and the combined group of healthy and LBL allowed us to identify and validate a biomarker panel of p53, HRas, and ETHE1 for diagnosis of early stage LC with 50% sensitivity at >90% specificity. Finally, the performance of this biomarker panel was confirmed in ELISA tests. In summary, this study represents one of the most comprehensive proteome-wide surveys with one of the largest (i.e. 1,101 unique samples) and most diverse (i.e. nine disease groups) cohorts, resulting in a biomarker panel with good performance.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29021294 PMCID: PMC5724172 DOI: 10.1074/mcp.RA117.000212
Source DB: PubMed Journal: Mol Cell Proteomics ISSN: 1535-9476 Impact factor: 5.911
Fig. 1.Overall study design.
Characteristics of the samples in Phase I
| Variable | LC ( | Healthy ( | |||||
|---|---|---|---|---|---|---|---|
| No. | Mean | % | No. | Mean | % | ||
| Age (years) | 0.086 | ||||||
| Mean | 60.4 | 56.4 | |||||
| Standard deviation | 8.5 | 11.1 | |||||
| Sex | 0.223 | ||||||
| Male | 66 | 82.5 | 14 | 70.0 | |||
| Female | 14 | 17.5 | 6 | 30.0 | |||
| Smoking history (pack-years) | 1 | ||||||
| 0 | 22 | 27.5 | 5 | 25.0 | |||
| <20 | 16 | 20.0 | 6 | 30.0 | |||
| ≥20 | 42 | 52.5 | 9 | 45.0 | |||
| Type | |||||||
| Small Cell Lung Cancer | 20 | 25.0 | |||||
| Large Cell Lung Cancer | 13 | 16.3 | |||||
| Adenocarcinoma | 24 | 30.0 | |||||
| Squamous Cell Carcinoma | 23 | 28.8 | |||||
Fig. 2.Serum profiling assays on HuProt arrays in Phase I. A. The left image represents serum profiling results from a portion of a HuProt array probed with Cy5- and Cy3-labeled anti-human IgG and -IgM antibodies, respectively. The right panels show two examples of positive proteins, in which p53 was only recognized by IgG antibodies of a LC patient and YARS only recognized by IgM antibodies of a LC patient. B. Scatter plot analysis of sensitivity and specificity for all the proteins on the HuProt arrays in the anti-IgG (left) and anti-IgM (right) channels. Each dot represents a protein. Red dots represent those that were selected for Phase II studies.
Characteristics of the samples in Phase II
| Variable | Early LC ( | Healthy ( | LBL ( | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| No. | Mean | % | No. | Mean | % | No. | Mean | % | ||
| Age (years) | 0.165 | |||||||||
| Mean | 61.2 | 58.3 | 61.1 | |||||||
| Standard deviation | 10.0 | 8.4 | 8.8 | |||||||
| Sex | 0.102 | |||||||||
| Male | 101 | 77.1 | 64 | 68.8 | 69 | 68.3 | ||||
| Female | 30 | 22.9 | 29 | 31.2 | 32 | 31.7 | ||||
| Smoking history (pack-years) | 0.108 | |||||||||
| 0 | 24 | 18.3 | 23 | 24.7 | 28 | 27.7 | ||||
| <20 | 29 | 22.1 | 24 | 25.8 | 21 | 20.8 | ||||
| ≥20 | 78 | 59.6 | 46 | 49.5 | 52 | 51.5 | ||||
| Type | ||||||||||
| Small Cell Lung Cancer | 30 | 22.9 | ||||||||
| Adenocarcinoma | 55 | 42.0 | ||||||||
| Squamous Cell Carcinoma | 46 | 35.1 | ||||||||
Performance of eight biomarkers in discovery and validation stages of Phase II
| Protein | Discovery | Validation | ||||
|---|---|---|---|---|---|---|
| AUC | Cut off | Sensitivity | Specificity | Sensitivity | Specificity | |
| p53 | 0.809 | 1.209 | 24.1% | 93.8% | 22.7% | 96.9% |
| ETHE1 | 0.785 | 1.861 | 32.2% | 91.5% | 29.5% | 93.8% |
| CTAG1A | 0.784 | 1.200 | 17.2% | 96.1% | 18.2% | 93.8% |
| C1QTNF1 | 0.763 | 1.577 | 26.4% | 90.7% | 22.7% | 93.8% |
| TEX264 | 0.759 | 2.088 | 23.0% | 92.2% | 20.5% | 93.8% |
| CLDN2 | 0.744 | 1.820 | 26.4% | 90.7% | 22.7% | 95.4% |
| NSG1 | 0.740 | 1.735 | 27.6% | 91.5% | 29.5% | 92.3% |
| HRas | 0.692 | 1.980 | 13.8% | 96.9% | 18.2% | 93.8% |
Fig. 3.Biomarker discovery and validation in Phase II studies. A, Examples of three proteins discovered and validated in Phase II. Left panels (Discovery): Box plots show that the signal intensities of the three proteins are significantly higher in the early LC group than the control group. The corresponding ROC curves are shown to the right of the box plots. Values of sensitivity and specificity obtained at the optimal cut off value for each protein are also shown. Right panels (Validation): Box plot analysis confirmed that the signal intensities of all three proteins are significantly higher in the early LC group than the control group. B, Performance of the top biomarker panel, comprised of p53, ETHE1, and HRas, in the discovery and validation stages. Orange and light blue lines represent samples scored as positives and negatives, respectively. A sample was predicted as LC positive when any one of the three proteins in this panel scored positive. Values of sensitivity and specificity in both discovery and validation stages are listed below. C, Performance of this biomarker panel in late LC and other types of cancer. Positive rates, defined as percentage of samples scored as positives in each disease category, are shown at the bottom.
Fig. 4.ELISA validation. A, Box plot analysis of ELISA data obtained with samples used in Phase II study (upper panel) and newly recruited samples (lower panel). p values were calculated between the early LC and control groups (i.e. healthy and LBL). B, Performance of this biomarker panel in early LC, late LC, healthy, and LBL groups. Positive rates of each category are shown at the bottom.