| Literature DB >> 25325012 |
Konstantinos Kerkentzes1, Vincenzo Lagani2, Ioannis Tsamardinos1, Mogens Vyberg3, Oluf Dimitri Røe4.
Abstract
OBJECTIVE: Novel statistical methods and increasingly more accurate gene annotations can transform "old" biological data into a renewed source of knowledge with potential clinical relevance. Here, we provide an in silico proof-of-concept by extracting novel information from a high-quality mRNA expression dataset, originally published in 2001, using state-of-the-art bioinformatics approaches.Entities:
Keywords: bioinformatics; carcinoid; lung adenocarcinoma; mesothelioma; microarray; small cell; squamous
Year: 2014 PMID: 25325012 PMCID: PMC4178426 DOI: 10.3389/fonc.2014.00251
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Flow-chart of the performed analyses. This figure illustrates the analyses performed during this study. The raw CEL files were obtained from the site of the authors of the original publication and were preprocessed with the RMA algorithm. Three different branches of analyses were performed; first, PCA was used for investigating the relationship among different types of tumors and against normal tissues. Second, differentially expressed and diagnostic genes were identified in each histological type versus the rest. On the basis of these lists of differentially expressed genes, enriched GO biological processes, and KEGG pathways were identified with the hyper-geometric test. Finally, the association between clinicopathological and transcriptional information were investigated in the primary adenocarcinoma samples. We tried to identify genes correlated with the tumor size, differentially expressed and diagnostic genes of lymph node status and metastasis, and single gene or gene signatures predictive of survival. The sample size differs because complete clinicopathological information was not always provided and in the survival analysis only the primary adenocarcinomas were taken into account.
Differentially regulated genes between each diagnostic group versus the rest (.
| Genes | Normal | Lung AD | SQ | COID | SCLC |
|---|---|---|---|---|---|
| Up | 1338 | 1527 | 1611 | 3247 | 493 |
| Down | 1680 | 1975 | 2064 | 2130 | 349 |
Up, upregulated; Down, downregulated; AD, adenocarcinoma; SQ, squamous cell carcinoma; COID, carcinoids; SCLC, small-cell lung cancer.
Figure 2Distribution of normal tissues and cancer subtypes in the principal component space. This visualization shows that normal and carcinoids (COID) have distinct expression profiles, while adenocarcinomas (AD) overlap with both squamous (SQ) and small-cell carcinoma (SCLC).
Figure 3Distribution of normal tissues and cancer subtypes except adenocarcinoma in the principal component space. Small-cell, squamous, and carcinoid subgroups show distinctive expression profile when the adenocarcinomas are omitted, indicating that adenocarcinoma is the most heterogeneous group.
Figure 4Distribution of primary versus metastatic adenocarcinoma samples in the principal component space. PCA plot showing the expression of the adenocarcinoma metastases to the lung from other locations (light blue) in relation to primary lung cancers including adenocarcinoma (red). The general expression profile of the adenocarcinoma metastases versus primary adenocarcinomas did not vary significantly.
Genes significantly correlated with primary lung adenocarcinoma tumor size.
| Symbol | Correlation | Adjusted |
|---|---|---|
| 0.4527 | 0.00564 | |
| −0.4255 | 0.01395 | |
| −0.3952 | 0.0369 | |
| −0.3946 | 0.0369 |
Positive correlation implies higher expression in larger tumors.
Immunomarker occurrence expressed as percentage of cases in lung adenocarcinoma, squamous cell carcinoma, small-cell carcinoma, and carcinoid.
| Lung cancer | CGA (%) | CK5 (%) | p63/p40 (%) | SNP (%) | TTF1 (%) | NKX2–1/TTF1, AUC | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| AD | 2 | 0.59D | 5–30 | 0.71D | 5–30 | 0.72D | 10 | 0.56D | 60–90 | 0.76U |
| SQ | 0 | ND | 90–100 | 90–100 | 0 | 0.78D | 0–20 | |||
| SCLC | 70–90 | ND | 0–5 | ND | 0–5 | ND | 70–90 | ND | 70–100 | 0.85U ( |
| COID | 95 | 1,0U | 0 | ND | 0 | ND | 100 | 0.99U | 50–70 | 0.72D |
aPredominantly focal reactions
.
Values are based on recent large studies [Ref. (.
CGA, chromogranin A; CHGA, CGA gene; CK5, cytokeratin 5; KRT5, CK5 gene; p63/p40, tumor protein p63/isoform of p63; TP63, p63 gene; SNP, synaptophysin; TTF1, thyroidea transcription factor 1; NKX2-1, TTF1 encoding gene; NKX2-2, homolog of NKX2; D, downregulated; U, upregulated; ND, not differentially expressed; COID, carcinoids; AD, adenocarcinoma; SCLC, small-cell lung cancer; SQ, squamous cell lung cancer.
Figure 5Comparison of protein and gene-expression patterns in primary lung neoplasias. (A) hematoxylin and eosin (H&E), (B) thyroid transcription factor-1 (TTF1), (C) cytokeratin 5 (CK5), (D) p63, and (E) synaptophysin (SNP). All photos ×200. (1) Adenocarcinoma, (A) H&E showing low differentiation with solid growth, (B) TTF1 showing strong nuclear staining of all tumor cells, (C) CK5 showing no cytoplasmic staining of tumor cells, (D) p63 showing nuclear staining of a few scattered tumor cells, and (E) SNP showing cytoplasmic staining of a few scattered tumor cells. (2) Squamous cell carcinoma, (A) H&E showing solid growth with slight squamous maturation to the left, (B) TTF1 showing no nuclear staining of tumor cells, scattered entrapped alveolar cells with normal TTF1 expression is seen, (C) CK5 showing strong cytoplasmic staining of most tumor cells, (E) p63 showing widespread nuclear staining of tumor cells, in the upper right corner an entrapped bronchiolus with normal p63 expression of basal cells is seen, and (E) SNP showing no cytoplasmic staining of tumor cells. (3) Small-cell carcinoma, (A) H&E showing sheet-like growth, (B) TTF1 showing very strong nuclear staining of tumor cells, (C) CK5 showing no staining of tumor cells, (D) p63 showing widespread but weak nuclear staining of tumor cells, and (E) SNP showing strong cytoplasmic staining of tumor cells. (4) Carcinoid, (A) H&E showing trabecular growth, (B) TTF1 showing moderate nuclear staining of most tumor cells, (C) CK5 showing no cytoplasmic staining of tumor cells, (D) p63 showing no nuclear staining of tumor cells; a few entrapped alveolar cells are positive, and (E) SNP showing strong cytoplasmic staining of tumor cells.
Figure 6Cell cycle pathway. Tumor suppressor genes (CDKN1A/p21, CDKN1C/p57, RBL2/p130) and the cyclin D family were overexpressed in normal lung tissue while there was type-specific overexpression of oncogenes, cyclins, cyclin dependent kinases, and other cell cycle driving genes in tumors. Genes belonging to the mini-chromosome maintenance complex were overexpressed in squamous and small-cell lung cancer, as well as in mesothelioma. The same genes were not differentially expressed in AD and downregulated in COID. Fifty-six out of 63 genes in the Cell Cycle pathway were overexpressed in SQ, including CCNE/CDK2, CCNA/CDK2, CCNA/CDK1, CCNB/CDK1 complexes, BUB1, BUB1B, BUB3, and MYC oncogenes, damage response genes (ATR, CHEK1/2) and five genes of the 14-3-3 family. Red is overexpressed, dark green downregulated. SQ, squamous type; AD, adenocarcinoma; SCLC, small-cell lung cancer; COID, carcinoid; MESO, mesothelioma.
Figure 7Nucleotide excision repair (NER) pathway. Genes belonging to all the principal DNA repair systems were predominantly overexpressed in tumors. The NER is the most analyzed pathway related to cisplatin resistance. In normal lung tissue (Normal), only one DNA repair gene was overexpressed, the XPC. In the COID, SCLC, SQ, and MESO tumors, an array of genes was overexpressed. Key genes of the whole repair process were overexpressed. In the SQ, 25 NER genes were differentially expressed, and only four were downregulated, the CUL4, XPD, ERCC2, and ERCC8. The most heterogeneous tumor group, AD, had the least NER genes overexpressed. Red indicates significantly overexpression, dark green significant downregulation, and light green no differential expression. COID, carcinoids; AD, adenocarcinoma; SCLC, small-cell lung cancer; SQ, Squamous cell lung cancer; MESO, mesothelioma.
Differential expression for each tumor type versus rest in KEGG pathways related to DNA repair.
| Genes | NER | BER | NHEJ | HR | MMR |
|---|---|---|---|---|---|
For each pathway, only the overexpressed genes are shown. Genes that are common between several lung cancer subtypes are in bold. The mesothelioma results are based on mesothelioma versus parietal pleura (see .
BER, base excision repair; HR, homologous recombination; MMR, mismatch repair; NER, nucleotide excision repair; NHEJ, non-homologous end joining; NOR, normal lung tissue; COID, carcinoids; AD, adenocarcinoma; SCLC, small-cell lung cancer; SQ, squamous cell lung cancer; MESO, mesothelioma.
Figure 8Homologous recombination (HR) pathway. The HR is the principal, non-error-prone, repair mechanism for DNA double-strand breaks. SQ and SCLC had 10 of 11 overexpressed genes throughout the pathway, including the very important BRCA2, RAD51 paralogs, and BLM (Bloom syndrome mutated). Red indicates significantly overexpression, dark green significant downregulation, and light green no differential expression. COID, carcinoids; AD, adenocarcinoma; SCLC, small-cell lung cancer; SQ, Squamous cell lung cancer; MESO, mesothelioma.
Figure 9Vascular smooth muscle contraction pathway. Genes belonging to the vascular smooth muscle contraction pathway were significantly overrepresented and upregulated in normal tissue (15/36 genes, p < 0.0001). None of the genes of the smooth muscle cell membrane (according to KEGG pathways) were overexpressed in SQ, SCLC, and mesothelioma. Red indicates significantly overexpression, dark green significant downregulation, and light green no differential expression. COID, carcinoids; AD, adenocarcinoma; SCLC, small- cell lung cancer; SQ, squamous cell lung cancer; MESO, mesothelioma.