| Literature DB >> 29371917 |
Allison S Cohen1, Farah K Khalil2, Eric A Welsh3, Matthew B Schabath4, Steven A Enkemann5, Andrea Davis1, Jun-Min Zhou6, David C Boulware6, Jongphil Kim7,8, Eric B Haura9, David L Morse1,8,10.
Abstract
Lung cancer is the leading cause of cancer deaths in the United States. Novel lung cancer targeted therapeutic and molecular imaging agents are needed to improve outcomes and enable personalized care. Since these agents typically cannot cross the plasma membrane while carrying cytotoxic payload or imaging contrast, discovery of cell-surface targets is a necessary initial step. Herein, we report the discovery and characterization of lung cancer cell-surface markers for use in development of targeted agents. To identify putative cell-surface markers, existing microarray gene expression data from patient specimens were analyzed to select markers with differential expression in lung cancer compared to normal lung. Greater than 200 putative cell-surface markers were identified as being overexpressed in lung cancers. Ten cell-surface markers (CA9, CA12, CXorf61, DSG3, FAT2, GPR87, KISS1R, LYPD3, SLC7A11 and TMPRSS4) were selected based on differential mRNA expression in lung tumors vs. non-neoplastic lung samples and other normal tissues, and other considerations involving known biology and targeting moieties. Protein expression was confirmed by immunohistochemistry (IHC) staining and scoring of patient tumor and normal tissue samples. As further validation, marker expression was determined in lung cancer cell lines using microarray data and Kaplan-Meier survival analyses were performed for each of the markers using patient clinical data. High expression for six of the markers (CA9, CA12, CXorf61, GPR87, LYPD3, and SLC7A11) was significantly associated with worse survival. These markers should be useful for the development of novel targeted imaging probes or therapeutics for use in personalized care of lung cancer patients.Entities:
Keywords: cell-surface; lung cancer; molecular imaging; target biomarker; targeted therapeutics
Year: 2017 PMID: 29371917 PMCID: PMC5768334 DOI: 10.18632/oncotarget.23009
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Representative microarray mRNA expression profiles for four of the selected lung cancer cell-surface markers in patient specimens of normal lung, lung tumors and other normal tissues: CA12 (A), GPR87 (B), LYPD3 (C), and SLC7A11 (D). Values are presented as whisker/box plots with whiskers representing the full range of values, the bottom and top of the boxes represent the 25th and 75th percentile, and middle lines represent the median.
Adjusted p values by Dunnett’s multiple comparisons for lung tumor (control) versus normal tissues.
| Tissue type | CA9 | CA12 | CXorf61 | DSG3 | FAT2 | GPR87 | KISS1R | LYPD3 | SLC7A11 | TMPRSS4 |
|---|---|---|---|---|---|---|---|---|---|---|
| Normal Lung | ||||||||||
| Adrenal Gland | 0.15 | |||||||||
| Heart | 0.88 | 0.92 | ||||||||
| Kidney | 0.53 | 0.13 | >0.99 | |||||||
| Liver | ||||||||||
| Lymph Node | 0.96 | |||||||||
| Small Intestines | 0.13 | 0.13 | 0.18 | 0.85 | 0.14 |
Figure 2Representative microarray mRNA expression profiles for four of the selected markers in patient specimens of normal lung and lung tumors of various lung cancer histologies: CA9 (A), CXorf61 (B), DSG3 (C), and KISS1R (D). Values are presented as whisker/box plots with whiskers representing the full range of values, the bottom and top of the boxes represent the 25th and 75th percentile, and the middle lines represent the median.
Adjusted p values by Dunnett’s multiple comparisons for normal lung (control) versus lung cancer histologies
| Cancer type | CA9 | CA12 | CXorf61 | DSG3 | FAT2 | GPR87 | KISS1R | LYPD3 | SLC7A11 | TMPRSS4 |
|---|---|---|---|---|---|---|---|---|---|---|
| Squamous Cell Carcinoma | 0.29 | |||||||||
| Adenocarcinoma | 0.08 | 0.48 | ||||||||
| Large Cell Carcinoma | 0.32 | 0.67 | 0.95 | >0.99 | 0.95 | 0.55 | 0.57 | |||
| Other |
Figure 3Representative images of IHC stained patient lung tumor and normal lung tissue specimens from the tissue microarray (TMA) for half of the selected markers
A representative normal lung sample and representative lung tumor samples with scores of 0, 1+, 2+, and 3+ are shown for each marker. The images are taken at 10x magnification. *Protein expression is stained but gene names are used to conserve space.
Figure 4Representative images of IHC stained patient lung tumor and normal lung tissue specimens from the tissue microarray (TMA) for the remaining selected markers
A representative normal lung sample and representative lung tumor samples with scores of 0, 1+, 2+, and 3+ are shown for each marker. The images are taken at 10x magnification. *Protein expression is stained but gene names are used to conserve space.
IHC scoring of marker expression in normal lung and lung cancer patient tissue samples: markers with limited or no expression in normal lung
| Target* | Tissue type | Patient Tissue Samples ( | Pathology Score | Heterogeneity Score# (Average ± SD) | ||||
|---|---|---|---|---|---|---|---|---|
| 0 | 1+ | 2+ | 3+ | %≥1+ | ||||
| 8 | 0 | 0 | 0 | 0% | 100% ± 0% | |||
| 16 | 26 | 44 | 11 | 83% | 84% ± 26% | |||
| 8 | 0 | 0 | 0 | 0% | 100% ± 0% | |||
| 5 | 35 | 39 | 19 | 95% | 92% ± 17% | |||
| 8 | 0 | 0 | 0 | 0% | 100% ± 0% | |||
| 14 | 52 | 26 | 8 | 86% | 90% ± 22% | |||
| 8 | 0 | 0 | 0 | 0% | 100% ± 0% | |||
| 34 | 46 | 12 | 5 | 65% | 61% ± 36% | |||
| 8 | 0 | 0 | 0 | 0% | 100% ± 0% | |||
| 14 | 59 | 23 | 3 | 86% | 99% ± 5% | |||
| 7 | 0 | 1 | 0 | 12.5% | 5% | |||
| 12 | 39 | 37 | 12 | 88% | 83% ± 29% | |||
*Protein expression is scored but gene names are used to conserve space.
#Heterogeneity score indicates the average percentage of cell staining in samples that stained regardless of pathology score. For samples with pathology scores of 0 only, 100% heterogeneity score indicates uniformly unstained.
IHC scoring of marker expression in normal lung and lung cancer patient tissue samples: markers with some expression in normal lung
| Target* | Tissue type | Patient Tissue Samples ( | Pathology Score | Heterogeneity Score# (Average ± SD) | ||||
|---|---|---|---|---|---|---|---|---|
| 0 | 1+ | 2+ | 3+ | %≥2+ | ||||
| 4 | 2 | 0 | 2 | 25% | 88% ± 13% | |||
| 13 | 24 | 26 | 29 | 60% | 79% ± 32% | |||
| 4 | 4 | 0 | 0 | 0% | 100% ± 0% | |||
| 12 | 35 | 45 | 5 | 52% | 95% ± 17% | |||
| 1 | 7 | 0 | 0 | 0% | 100% ± 0% | |||
| 6 | 45 | 38 | 6 | 46% | 99% ± 5% | |||
| 2 | 6 | 0 | 0 | 0% | 100% ± 0% | |||
| 0 | 24 | 63 | 9 | 75% | 100% ± 0% | |||
*Protein expression is scored but gene names are used to conserve space.
#Heterogeneity score indicates the average percentage of cell staining in samples that stained regardless of pathology score. For samples with pathology scores of 0 only, 100% heterogeneity score indicates uniformly unstained.
IHC scoring of marker expression in lung cancer patient tissue samples analyzed by lung cancer histology sub-type: markers with limited or no expression in normal lung
| Target* | Tissue type | Patient Tissue Samples ( | Pathology Score | ||||
|---|---|---|---|---|---|---|---|
| 0 | 1+ | 2+ | 3+ | %≥1+ | |||
| 6 | 16 | 32 | 7 | 90% | |||
| 3 | 2 | 5 | 0 | 70% | |||
| 7 | 8 | 7 | 4 | 73% | |||
| 2 | 17 | 27 | 14 | 97% | |||
| 1 | 7 | 1 | 1 | 90% | |||
| 2 | 11 | 11 | 4 | 93% | |||
| 5 | 34 | 17 | 6 | 92% | |||
| 1 | 5 | 4 | 1 | 91% | |||
| 8 | 13 | 5 | 1 | 70% | |||
| 19 | 29 | 10 | 3 | 69% | |||
| 2 | 7 | 0 | 1 | 80% | |||
| 13 | 10 | 2 | 1 | 50% | |||
| 8 | 36 | 15 | 2 | 87% | |||
| 1 | 6 | 2 | 1 | 90% | |||
| 5 | 17 | 6 | 0 | 82% | |||
| 5 | 25 | 25 | 7 | 92% | |||
| 1 | 4 | 4 | 2 | 91% | |||
| 6 | 11 | 8 | 3 | 79% | |||
*Protein expression is scored but gene names are used to conserve space.
IHC scoring of marker expression in lung cancer patient tissue samples analyzed by lung cancer histology sub-type: markers with some expression in normal lung
| Target* | Tissue type | Patient tissue samples ( | Pathology score | ||||
|---|---|---|---|---|---|---|---|
| 0 | 1+ | 2+ | 3+ | %≥2+ | |||
| 8 | 17 | 18 | 13 | 55% | |||
| 1 | 3 | 2 | 4 | 60% | |||
| 4 | 4 | 6 | 12 | 69% | |||
| 6 | 22 | 28 | 4 | 53% | |||
| 2 | 5 | 3 | 0 | 30% | |||
| 4 | 8 | 14 | 1 | 56% | |||
| 4 | 25 | 27 | 4 | 52% | |||
| 1 | 5 | 3 | 1 | 40% | |||
| 1 | 15 | 8 | 1 | 36% | |||
| 0 | 14 | 40 | 6 | 77% | |||
| 0 | 5 | 4 | 1 | 50% | |||
| 0 | 5 | 19 | 2 | 81% | |||
*Protein expression is scored but gene names are used to conserve space.
Figure 5Marker expression in human lung cancer cell lines
mRNA microarray data was analyzed for non-small cell lung cancer cell lines with high and low/no endogenous expression of each of the markers. The graph shows three cell lines with high and three cell lines with low/no expression for each marker.
Significance of marker expression relative to survival by Affymetrix probe
| Probe Name | Gene | Log Rank | Log Rank |
|---|---|---|---|
| merck_NM_001216_at_CA9 | CA9 | ||
| merck2_DQ892208_at_CA9 | CA9 | ||
| merck_NM_001218_s_at_CA12 | CA12 | ||
| merck_AK096845_a_at_CA12 | CA12 | ||
| merck2_BC087838_at_CA12 | CA12 | ||
| merck_NM_001017978_s_at_CXorf61 | CXorf61 | 0.08 | |
| merck_BX538327_at_DSG3 | DSG3 | 0.53 | 0.10 |
| merck_NM_001944_a_at_DSG3 | DSG3 | 0.33 | 0.65 |
| merck2_M76482_at_DSG3 | DSG3 | 0.48 | 0.88 |
| merck_NM_001447_at_FAT2 | FAT2 | 0.78 | 0.49 |
| merck_NM_023915_s_at_GPR87 | GPR87 | 0.15 | |
| merck_NM_032551_s_at_KISS1R | KISS1R | 0.94 | 0.88 |
| merck_NM_014400_at_LYPD3 | LYPD3 | ||
| merck_NM_014331_at_SLC7A11 | SLC7A11 | ||
| merck_AI924527_a_at_TMPRSS4 | TMPRSS4 | 0.75 | 0.46 |
| merck2_NM_001083947_at_TMPRSS4 | TMPRSS4 | 0.41 | 0.74 |
| merck2_NM_183247_a_at_TMPRSS4 | TMPRSS4 | 0.93 | 0.57 |
*p-values < 0.05 are considered significant and are emboldened.
Figure 6Representative Kaplan–Meier survival curves for lung cancer markers using mRNA expression data dichotomized based on the median-cut point
The five-year survival for patients with high mRNA expression (dashed line) vs. low mRNA expression (solid line) was plotted for each of the markers. Shown are data for CA9 (A), CA12 (B), CXorf61 (C), LYPD3 (D), and SLC7A11 (E). Each of these markers shows a statistically significant difference in survival for patients with high vs. low mRNA expression.
Figure 7Representative Kaplan–Meier survival curves for lung cancer markers using mRNA expression data analyzed as tertiles
The five-year survival for the third of patients with the highest mRNA expression (short dashed line); the middle third (dashed line); and the third with lowest expression (solid line) was plotted for each of the markers. Shown are data for CA9 (A), CA12 (B), GPR87 (C), LYPD3 (D), and SLC7A11 (E). Each of these markers shows a statistically significant difference in survival for patients with high vs. low mRNA expression.
Figure 8Kaplan–Meier survival curves for (A) metagene and (B) both LYPD3 and CA12 groupings. (A) For the metagene analysis, data were dichotomized based on the median cut-point. The five-year survival for patients with high metagene expression (dashed line) vs. low metagene expression (solid line) was plotted. There is a statistically significant difference in survival (P < 0.01). (B) For the LYPD3 and CA12 combined analysis, mRNA expression data were divided into four subgroups. The five-year survival for patients with low LYPD3/low CA12 (solid line), low LYPD3/high CA12 (dashed line), high LYPD3/low CA12 (short dashed line) and high LYPD3/high CA12 (long dashed line) was plotted. There is a statistically significant difference in survival (P < 0.0001).
Figure 9Kaplan–Meier survival curves for (A) LYPD3 and (B) CA-IX using IHC scoring data. The data were normalized by multiplying the staining intensity by the tumor cell staining percent. The five-year survival for patients with high protein expression (red line) vs. low protein expression (blue line) was plotted.
Literature review of the selected markers and comparison to results herein
| Gene name | Protein name | Normal expression & function | Cancer expression & function | Comparison to results | References |
|---|---|---|---|---|---|
| CA9 | Carbonic anhydrase IX (CAIX) | GI tract. Catalyzes the reversible hydration of CO2 to H2CO3. | ↑ breast, lung, renal-cell cancers. ↑ SCC relative to other NSCLC. Correlates with poor survival, with a few exceptions. | General agreement. | [ |
| CA12 | Carbonic anhydrase XII (CAXII) | Brain, colon, rectum, esophagus, kidney, ovary, pancreas, prostate, testis and uterus. Catalyzes the reversible hydration of CO2 to H2CO3. | ↑ breast, lung, renal-cell cancers. ↑ protein correlates with ↑ survival. | ↓ survival with ↑ mRNA. | [ |
| CXorf61 | Kita-kyushu lung cancer antigen 1 (KK-LC-1) | Testis. Cancer testis antigen family. | ↑ mRNA in NSCLC. No correlation with survival. | mRNA results were not significant for SCC. ↑ mRNA correlates with ↓ survival. First report of ↑ protein in lung adenocarcinoma. | [ |
| DSG3 | Desmoglein 3 | Normal stratified squamous epithelia, GI tract. Cadherin superfamily. | ↑ SCCs, ↓ adenocarcinoma. ↑ increased survival. ↓ higher tumor grade. | ↑ SCCs, ↑ adenocarcinoma. No correlation with survival. | [ |
| FAT2 | Protocadherin Fat 2 | Cerebellum, epidermis. Cadherin superfamily. | ↑ esophageal, gastric, head and neck, ovarian, pancreatic cancers, cutaneous SCC and NSCLC. ↑ poor NSCLC survival. | No correlation with survival. | [ |
| GPR87 | G-protein coupled receptor 87 | Prostate, placenta, head and neck. P2Y purin receptor family. | ↑ bladder cancer and SCC of lung, cervix, head and neck, skin. Not elevated in adenocarcinoma. ↑ mRNA ↓ survival. | General agreement except ↑ in both SCC and adenocarcinoma. | [ |
| KISS1R | Kiss-1R | Placenta, pancreas, pituitary gland, brain. Role in normal and pathologic physiology, reproduction and pubertal development, hypothalamic-pituitary-gonadal axis. | ↑ bladder, hepatocellular, ovarian, pancreatic, renal cell, thyroid cancers. ↓ endometrial, esophageal, NSCLC and prostate cancers. ↑ NSCLC survival. | ↑ NSCLC and no correlation with survival. | [ |
| LYPD3 | Ly6/PLAUR domain-containing protein 3 | Squamous epithelia, placenta and peripheral blood leukocytes. | ↑ breast, colorectal, gastric, lung, melanoma and urothelial cancers. Correlates with poor NSCLC survival. | General agreement. | [ |
| SLC7A11 | Cystine/glutamate transporter (xCT) | Role in regulation of oxidative stress and maintenance of the cysteine-cystine redox cycle. Brain, spinal cord and pancreas. | ↑ NSCLC and many cancer types. Role in drug resistance. | Confirmed previous results in NSCLC with a larger data set and determined association with ↓ survival. | [ |
| TMPRSS4 | Transmembrane protease serine 4 protein | GI tract, urogenital tract, eye and skin. | ↑ breast, cervical, colorectal, gallbladder, gastric, liver, lung, ovarian, pancreatic and thyroid cancers. Cell invasion, migration and adhesion. ↑ correlates with poor survival. | No correlation with survival. | [ |