| Literature DB >> 24299561 |
Vladimir Lazar1, Chen Suo, Cedric Orear, Joost van den Oord, Zsofia Balogh, Justine Guegan, Bastien Job, Guillaume Meurice, Hugues Ripoche, Stefano Calza, Johanna Hasmats, Joakim Lundeberg, Ludovic Lacroix, Philippe Vielh, Fabienne Dufour, Janne Lehtiö, Rudolf Napieralski, Alexander Eggermont, Manfred Schmitt, Jacques Cadranel, Benjamin Besse, Philippe Girard, Fiona Blackhall, Pierre Validire, Jean-Charles Soria, Philippe Dessen, Johan Hansson, Yudi Pawitan.
Abstract
BACKGROUND: Non-small cell lung cancer (NSCLC), a leading cause of cancer deaths, represents a heterogeneous group of neoplasms, mostly comprising squamous cell carcinoma (SCC), adenocarcinoma (AC) and large-cell carcinoma (LCC). The objectives of this study were to utilize integrated genomic data including copy-number alteration, mRNA, microRNA expression and candidate-gene full sequencing data to characterize the molecular distinctions between AC and SCC.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24299561 PMCID: PMC4222074 DOI: 10.1186/1755-8794-6-53
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Characteristics of the patients
| Age median (range) | 63 (40.9-84.6) |
| Males n (%) | 89 (72%) |
| Smoking Current | 64 (52%) |
| Former | 51 (42%) |
| Never | 7 (6%) |
| Histology AC | 57 (46%) |
| SCC | 50 (41%) |
| LCC | 13 (11%) |
| Other | 3 (3%) |
| Stage 1 | 56 (50%) |
| 2 | 25 (22%) |
| 3 | 28 (25%) |
| 4 | 4 (4%) |
| Adjuvant chemo (%) | 61 (50%) |
Characteristics of the patients in the study population.
Figure 1Flowchart of DGS algorithm. Driver and targets are identified in a three step process, as shown. Candidate drivers are firstly selected from genes/miRNAs that reside in copy-number altered (CNA) regions and filtered by various procedures, for example based on fold-change and consistency between expression level and copy number status. The rests of genes/miRNAs are candidate targets, which are grouped based on correlation with the candidate drivers. Correlation between all drivers and targets in each module is highlighted using sparse canonical correlation analysis (SCCA).
Figure 2Differential genomic regions for AC vs LCC vs SCC populations aCGH profiles. The three upper panels display the average profiles of AC, LCC and SCC subpopulations as their respective frequencies of gains (green, from 0 to 100%) and losses (red, from 0 to −100%) along the human genome. Darker green bars correspond to the frequencies of amplifications, defined as regions with a log2 (ratio) above 1.0. The lowest panel shows the significance of the ANOVA tests displaying the minus log10-transformed raw (lighter blue) and BH-adjusted (darker blue) p-values. The horizontal red line corresponds to a BH-adjusted p-value < 1.0E-05. Arrows point to the two most significant differential regions: 3q26.2-3q29 and 22q12.1-22q13.1.
Figure 3Principal component plots of the mRNA data. The first and second principal component plot (left) and the first and third principal component plot (right) of the mRNA data revealed the separation of squamous-cell carcinoma (S) from the adeno-carcinoma (A) and large-cell carcinoma (L).
List of top 5 secreted and 5 non-secreted markers
| SPP1 | Secreted phosphoprotein 1 | 96 | 98 | 0.41 | 0.40 |
| (<10-10) | (<10-10) | (0.70) | (0.76) | ||
| CTHRC1 | Collagen triple helix containing 1 | 96 | 98 | 0.42 | 0.40 |
| (<10-10) | (<10-10) | (0.63) | (0.75) | ||
| GREM1 | Gremlin 1 | 88 | 98 | 0.40 | 0.40 |
| (<10-10) | (<10-10) | (0.77) | (0.78) | ||
| SPINK1 | Serine peptidase inhibitor Kazal type 1 | 93 | 80 | 0.80 | 0.81 |
| (<10-10) | (1.14 × 10-7) | (0.03) | (0.01) | ||
| BMP7 | Bone morphogenetic protein 7 | 34 | 90 | 0.86 | 0.82 |
| (0.01) | (<10-10) | (0.003) | (0.01) | ||
| KRT6A | Keratin 6A | 73 | 96 | 0.90 | 0.88 |
| (9.18 × 10-5) | (<10-10) | (0.002) | (0.002) | ||
| TP63 | Tumor protein p63 | 39 | 98 | 0.95 | 0.85 |
| (0.09) | (<10-10) | (<0.001) | (0.01) | ||
| LGALS7 | Lectin, galactoside-binding, soluble, 7 | 73 | 94 | 0.87 | -- |
| (9.18 × 10-5) | (<10-10) | (0.003) | -- | ||
| GCNT3 | Glucosaminyl (N-acetyl) transferase 3 | 88 | 80 | 0.72 | 0.68 |
| (<10-10) | (1.14 × 10-7) | (0.08) | (0.10) | ||
| SPRR2D | Small proline-rich protein 2D | 52 | 96 | 0.84 | -- |
| (0.76) | (<10-10) | (0.003) | -- |
Notes: T = tumour, N = Normal; Overexpression is defined as log(T/N) > 0. P-values in the brackets are from the test that whether the frequency differs from 50%. AUC values refer to prediction of AC vs SCC. Median values of individual AUCs of the ten biomarkers in classifying histology type are computed. Monte-Carlo p-values based on 1,000 runs are indicated in the bracket. LGALS7 and SPRR2D probes were not in Bild et al. data.
List of top 5 secreted (the first five entries) and 5 non-secreted markers (the second five entries) that are over-expressed in tumours compared to normal tissues.
AUCs of the candidate driver genes and miRNAs
| Genes | MRPS22 | RNF7 | NDRG1 | FAM49B |
| Chemores | 0.81 (0.02) | 0.75 (0.05) | 0.73 (0.07) | 0.40 (0.76) |
| Bild | 0.74 (0.04) | 0.81 (0.01) | 0.72 (0.05) | 0.40 (0.77) |
| miRNAs | hsa-miR-944 | hsa-miR-570 | hsa-miR-16-2* | hsa-miR-31* |
| Chemores | 0.88 (0.001) | 0.59 (0.10) | 0.56 (0.18) | 0.40 (0.61) |
| Ming You | 0.78 (<0.001) | 0.52 (0.44) | 0.44 (0.76) | 0.56 (0.31) |
Median values of individual AUCs of the four candidate driver genes and the four candidate driver miRNAs in classifying histology type. Monte-Carlo p-values based on 1,000 runs are indicated in the bracket.
Figure 4Network links between 23 genes in pathway of mismatch repair and three driver genes. Network links between 23 genes in pathway of mismatch repair and driver genes MRPS22, NDRG1, RNF7. Links shown include physical interactions, metabolic and signaling links from the functional coupling network (http://FunCoup.sbc.su.se).