Literature DB >> 28056099

Integrated Analysis of Genome-Wide Copy Number Alterations and Gene Expression Profiling of Lung Cancer in Xuanwei, China.

Yanliang Zhang1,2,3, Qiuyue Xue1,2,3, Guoqing Pan4, Qing H Meng5, Xiaoyu Tuo4, Xuemei Cai1,2,3, Zhenghui Chen1,2,3, Ya Li1,2,3, Tao Huang6, Xincen Duan7, Yong Duan1,2,3.   

Abstract

OBJECTIVES: Lung cancer in Xuanwei (LCXW), China, is known throughout the world for its distinctive characteristics, but little is known about its pathogenesis. The purpose of this study was to screen potential novel "driver genes" in LCXW.
METHODS: Genome-wide DNA copy number alterations (CNAs) were detected by array-based comparative genomic hybridization and differentially expressed genes (DEGs) by gene expression microarrays in 8 paired LCXW and non-cancerous lung tissues. Candidate driver genes were screened by integrated analysis of CNAs and DEGs. The candidate genes were further validated by real-time quantitative polymerase chain reaction.
RESULTS: Large numbers of CNAs and DEGs were detected, respectively. Some of the most frequently occurring CNAs included gains at 5p15.33-p15.32, 5p15.1-p14.3, and 5p14.3-p14.2 and losses at 11q24.3, 21q21.1, 21q22.12-q22.13, and 21q22.2. Integrated analysis of CNAs and DEGs identified 24 candidate genes with frequent copy number gains and concordant upregulation, which were considered potential oncogenes, including CREB3L4, TRIP13, and CCNE2. In addition, the analysis identified 19 candidate genes with a negative association between copy number change and expression change, considered potential tumor suppressor genes, including AHRR, NKD2, and KLF10. One of the most studied oncogenes, MYC, may not play a carcinogenic role in LCXW.
CONCLUSIONS: This integrated analysis of CNAs and DEGs identified several potential novel LCXW-related genes, laying an important foundation for further research on the pathogenesis of LCXW and identification of novel biomarkers or therapeutic targets.

Entities:  

Mesh:

Year:  2017        PMID: 28056099      PMCID: PMC5215791          DOI: 10.1371/journal.pone.0169098

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Lung cancer is the leading cause of cancer mortality worldwide. It is the fastest-increasing cancer in China and has been the leading cause of cancer death in China since 2004 [1]. The incidence of lung cancer is particularly high in some regions of the country, such as Xuanwei and Gejiu. Despite recent advances in surgical and chemo/radiation therapies, the prognosis of lung cancer is still very poor, with a 5-year overall survival rate of only ~15%. Thus, the need to combat lung cancer in China is unprecedented and still growing. Xuanwei City (formerly known as Xuanwei County) is located in the northeast of Yunnan Province, China. It is 102 km from east to west, and 91 km from north to south, with a total area of 6,257 km2. The morbidity and mortality rates of lung cancer in Xuanwei are the highest in China and have shown clear upward trends since the mid-1970s [2]. Coal is the major resource in the Xuanwei area. Xuanwei residents traditionally use one or more of three different types of fuel—smoky coal (bituminous coal), smokeless coal (anthracite coal), and wood—in unvented indoor firepits for cooking and heating [3]. When burning smoky coal, the indoor air concentrations of particulate matter and extractable organic matter may reach as high as 24.4 mg/m3 and 17.6 mg/m3, respectively, and the corresponding benzo[a]pyrene concentration, an indicator of carcinogenic polycyclic aromatic hydrocarbons (PAHs), can reach as high as 19.3 μg/m3 which is comparable to exposure levels experienced by coke oven workers [4]. Epidemiological studies have suggested that the high incidence of LCXW is due mainly to the burning of smoky coal indoors without adequate ventilation [3,5-7]. The lung cancers that develop in Xuanwei show distinct characteristics [1] and are referred to as LCXW. In some villages, the mortality rate of female patients is as high as 400 per 100,000. In fact, women in Xuanwei, who are mostly nonsmokers (smoking rate < 1%), have the highest lung cancer rate in China. The sex ratio of lung cancer mortality rates between males and females in Xuanwei is 1.09, which is significantly lower than the national average of 2.09. LCXW incidence peaks at a younger age (41–50 years), more than 10 years younger than the peak incidence of lung cancer in other areas of China. Finally, LCXW mortality is strongly correlated with domestic use of smoky coal. Most cancers are characterized by differentially expressed genes (DEGs), genes whose expression is significantly different in cancerous cells than in their nearby normal cells. These genes are assumed to play important roles in the occurrence and development of cancers. Gene expression profiling by microarray analysis has been shown to be a powerful tool for identification of cancer-related genes. This analysis, however, usually detects a large number of DEGs, and therefore the key challenge in expression profiling analysis is how to pinpoint which DEGs are critical to cancer formation (“driver genes”) and which are not (“passenger genes”). Cancer is a genetic disease of altered somatic cells arising from accumulation of genetic changes. DNA copy number alteration (CNA), an important type of genetic alterations in various cancers, can contribute to the development and progression of cancer by altering the expression of genes within the regions of copy number changes [8]. Recent studies have indicated that integrated analysis of DNA CNAs and corresponding DEGs is an effective approach to identify the driver genes in multiple cancer types [9,10]. Previous studies of LCXW are focused mainly on its epidemiology, and little is known about its pathogenesis. Because of its distinctive etiology and characteristics, the pathogenesis of LCXW may be different from that of lung cancers occurring in other geographic areas. LCXW provides us with a unique opportunity to research the pathogenesis of non-tobacco-related lung cancer. Our purpose here was to screen for potential novel driver genes in LCXW through integrated analysis of genome-wide DNA CNAs and DEGs from paired LCXW and non-cancerous lung (NCL) tissues.

Methods

Sample Collection

Primary lung adenocarcinoma and paired NCL tissues (> 5 cm from carcinoma tissues) were collected from 84 patients from Xuanwei at the First Affiliated Hospital of Kunming Medical University, Kunming, China. The samples were fresh frozen and stored. The 8 paired samples collected at first were tested by microarrays and the rest samples were used for validation analysis. Written informed consent was obtained from all patients. The study was approved by the Institutional Review Board for the Use of Human Subjects at Kunming Medical University. All samples were assessed by an experienced pathologist to confirm the presence (> 80%) or absence of cancer cells. Clinicopathological characteristics of all patients were collected (Table 1). None of the patients received chemotherapy or radiotherapy treatment prior to surgery.
Table 1

Baseline clinicopathologic features of a cohort of lung cancer patients in Xuanwei, China.

CharacteristicNo. of patients (%) N = 84
SexMale50 (59%)
Female34 (41%)
Age, years< 5560 (71%)
≥ 5524 (29%)
Smoking, everYes34 (41%)
No50 (59%)
FIGO stagingI + II64 (76%)
III20 (24%)
Lymphatic metastasisYes36 (43%)
No48 (57%)

Isolation of Nucleic Acids

Genomic DNA was extracted by using the DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany), and RNA was isolated by using the PureLink® RNA Mini Kit (Thermo Fisher Scientific, Waltham, MA, USA), both according to the manufacturers’ protocols.

Array-Based Comparative Genomic Hybridization Analysis

Oligonucleotide array-based comparative genomic hybridization (array-CGH) analysis was carried out on the 8 paired samples using Roche NimbleGen Human CGH 3×720K WG-T v3.0 Array (NimbleGen, Madison, WI, USA) according to the manufacturer’s protocol. All array-CGH coordinates in this study were mapped against the human genome as defined by the UCSC build hg18. The log2 copy-number ratio calculation and CNA calls were determined by using the segMNT algorithm in NimbleScan. Log2 ratio test/control thresholds of 0.25 and –0.25 were defined as copy number gains and losses, respectively. Deviant signal intensity ratios involving 5 or more neighboring probes were considered genomic aberrations.

Gene Expression Microarray

Gene expression profiling analysis was performed on the same 8 paired samples using the Agilent Oligo Microarray Kit 8×60K according to the Agilent One-Color Microarray-based Gene Expression Analysis Protocol (Agilent Technologies, Santa Clara, CA, USA). The data were analyzed by GeneSpring software GX 12.6 (Agilent Technologies). Significantly DEGs were identified by using the mixed model analysis of variance [11] with a false discovery rate (Benjamini–Hochberg test) adjusted p value of ≤ 0.05 and absolute fold-change values ≥ 2 or ≤ 0.5. Hierarchical clustering was generated to visualize patterns of expression using cluster 3.0. Gene ontology (GO) analysis and Pathway analysis were performed using MAS 3.0. Pathway enrichment analysis was performed by using the latest KEGG database (http://www.kegg.jp/).

Integrated Analysis

Integrated analysis for array-CGH data and gene expression data consisted of 4 steps as follows. In step 1, recurrent CNAs across samples were identified. Recurrent CNAs were defined as genomic segments that were altered in at least 3 samples. In step 2, concordant recurrent CNAs were identified. Three kinds of recurrent CNAs from step 1 were filtered out: the CNAs whose changes were inconsistent among samples, the CNAs that did not include any gene, and the copy number gains that include only partial segments of a gene. In step3, DEGs in CNAs were identified. The DEGs presented in the concordant recurrent CNA regions from step 2 were selected, while unchanged genes were filtered out. In step 4, candidate driver genes were pinpointed by searching the PubMed database (http://www.ncbi.nlm.nih.gov/pubmed) to retrieve current knowledge about DEGs identified from step 3, their function and role in cancer; genes that had a potential role in tumorigenesis and had not previously been reported in lung cancer were screened out for further study.

Real-Time Quantitative Polymerase Chain Reaction (RT-qPCR) Analysis

Firstly, the candidate genes selected by the integrated analysis were validated in 8 paired samples by real-time quantitative polymerase chain reaction (RT-qPCR). Then, RT-qPCR also was used to determine copy number changes in these genes in the other 76 paired samples and gene expression changes in 50 of the paired samples. Gene expression analysis was not possible for 26 of the paired samples because of sample degradation. GAPDH was selected as an internal control. The primer sets were designed using the Primer Premier 5.0 (Primer, Canada) (Table 2). RT-qPCR was performed using SYBR®Premix Ex TaqTM SYBR Green I (TaKaRa, Dalian, China) on the ABI 7300 Sequence Detection System (Applied Biosystems, Foster City, CA, USA) and replicated three times. Cycling conditions were 95°C for 15 s followed by 40 cycles of 95°C (5 s), 60°C (15 s) and one cycle of 95°C (15 s), 60°C (60 s), 95°C (15 s). The data were analyzed by the 2-ΔΔCt method. 2-ΔΔCt ≥ 1.5 or ≤ 0.5 was defined as copy number gain or loss, respectively, and 2-ΔΔCt ≥ 2 or ≤ 0.5 was defined as upregulation or downregulation, respectively.
Table 2

Primers used for detecting both copy number changes and expression changes in 7 candidate genes.

AnalysisGenePrimersLength of products (bp)
Copy number detectionCREB3L4F: 5′-TTCCGTTTGTGGACCCTCAG-3′296
R: 5′-CCTCACCTGTCCCCTCGATA-3′
TRIP13F: 5′-CCCCAGCACTTCGGTTCA-3′116
R: 5′-GCCCTTTCTCCCGCCTTT-3′
CCNE2F: 5′-CATGGTCGGATTAACTCACACG-3′336
R: 5′-CCTGCATTCTGTCCCACCTTA-3′
AHRRF: 5′-CAGACAGGCAGGAATGAACA-3′116
R: 5′-TAGGAAGGAAGGGAAGG-3′
KLF10F: 5′-TTGTCATCCAAATGACACACAGA-3′256
R: 5′-GTGCCTCTCTCCCATGAACG-3′
MYCF: 5′-AGAGTTTCATCTGCGACCCG-3′259
R: 5′-AGAGGGTAGGGGAAGACCAC-3′
NKD2F: 5′-CCTAAACTGGGCATCTGTGG-3′119
R: 5′-CTCTCTGGCTCCTGCTGACT-3′
GAPDHF: 5′-CCACCACACTGAATCTCCCC-3′262
R: 5′-CGAAGCAAGCAAGGCTGTTT-3′
Gene expression detectionCREB3L4F: 5′-CTGCCCTGTCAAACCCTGTT-3′142
R: 5′-GCTTGTTACGGATTTTCCTCCT-3′
TRIP13F: 5′-CTGGAGGAAGAGACAGAAAACATAA-3′134
R: 5′-GTTGTCATCACATAATCGAGGAGAT-3′
CCNE2F: 5′-GGAACCACAGATGAGGTCCAT-3′237
R: 5′-CCATCAGTGACGTAAGCAAACT-3′
AHRRF: 5′-GCGGACGGTTCTTCCTAATC-3′116
R: 5′-GCAGTTTCCTGTGTCTTCTC-3′
KLF10F: 5′-CTTCCGGGAACACCTGATTTT-3′161
R: 5′-GCAATGTGAGGTTTGGCAGTATC-3′
MYCF: 5′-GGCTCCTGGCAAAAGGTCA-3′119
R: 5′-CTGCGTAGTTGTGCTGATGT-3′
NKD2F: 5′-CCGACAGCAAACAGCAACT-3′156
R: 5′-AGCCTTAGAGCCAGGAAACA-3′
GAPDHF: 5′-TGTTGCCATCAATGACCCCTT-3′202
R: 5′-CTCCACGACGTACTCAGCG-3′

Results

Copy Number Alterations

Array-CGH detected 592 CNAs in the 8 paired LCXW samples (S1 Table). Copy number profiles were very heterogeneous: some cases showed multiple distinctive chromosomal aberrations, whereas others showed few chromosomal aberrations (Fig 1, S1–S8 Files).
Fig 1

Array-CGH rainbows showed significant copy number heterogeneity across 8 paired LCXW samples.

Gene Expression Profiling

A total of 5,129 genes were identified as DEGs. Of these DEGs, 3,248 were upregulated while the other 1,881 genes were downregulated (S2 Table). Cluster analysis of these DEGs showed a distinct separation between the LCXW and NCL tissues (Fig 2). GO analysis indicated that these DEGs were involved in a wide range of cancer-related processes, including cell division, cell adhesion, cell proliferation and DNA replication. Pathway analysis showed these DEGs were involved in many pathways, such as those regulating p53 signaling, MAPK, Jak-STAT signaling, hedgehog signaling, and non-small cell lung cancer.
Fig 2

Hierarchical clustering of gene expression data showed a clear separation between the LCXW (A) and NCL tissues (P).

Integrated Analysis of Copy Number Alterations and Gene Expression Profiling

To identify candidate CNAs from the 592 CNAs, sporadic CNAs among samples were removed from the dataset, leaving 95 recurrent CNAs detected in at least 3 samples (S3 Table). Among these 95 recurrent CNAs, 32 CNAs were inconsistent among samples and therefore were removed; thus, 63 concordant recurrent CNAs, comprising 56 gains and 7 losses, were identified (S4 Table). Among these 63 concordant recurrent CNAs, 14 contained no gene and 11 gains contained only partial gene segments; these CNAs were removed, revealing 38 candidate CNAs, including 34 gains and 4 losses (S5 Table). Of these candidate CNAs, the most frequent gains were 5p15.33-p15.32, 5p15.1-p14.3, and 5p14.3-p14.2, and the most frequent losses were 11q24.3, 21q21.1, 21q22.12-q22.13, and 21q22.2. These 38 candidate CNAs affected 246 genes, including protein-coding genes and hypothetical genes (S5 Table). The integrated analysis of CNAs and gene expression results identified 24 genes (9.6%) that exhibited frequent copy number gains and concordant upregulation in the tumors (Table 3). A negative association between copy number and gene expression level was observed in 19 genes (7.7%) (Table 4), 15 genes (6.1%) that exhibited frequent copy number gains but were downregulated and 4 genes (1.6%) that were frequently deleted but upregulated. Review of the literature on these genes identified 3 genes in the positively correlated set (CREB3L4, TRIP13, and CCNE2) as potential oncogenes and 4 genes in the negatively correlated set (AHRR, NKD2, MYC, and KLF10) as potential tumor suppressor genes. KEGG pathway enrichment analysis revealed that the significantly enriched pathways were PI3K-Akt signaling (S1 Fig), prostate cancer (S2 Fig), cell cycle (S3 Fig), Wnt signaling (S4 Fig), and pathways in cancer (S5 Fig); this analysis also showed that CCNE2, MYC, and CREB3L4 were the key involved genes. No pathway involving TRIP13, AHRR, or KLF10 was found. The remaining 203 genes (82.5%) exhibited copy number changes, but no changes in transcript levels were observed in 49 of genes (19.9%) or detected in the other 154 genes (62.6%).
Table 3

Concordantly changed genes located in candidate copy number alterations.

RegionCytobandSize (bp)Gain/LossGeneGene expression
chr1:152175304,1522181601q21.342,857gainCREB3L4up
chr5:98922,12021325p15.331,103,211gainCEP72up
TRIP13up
chr5:1202132,45342125p15.33-p15.323,332,081gainSLC6A3up
chr5:4534212,107603685p15.32-p15.26,226,157gainADAMTS16up
SRD5A1up
LOC442132up
DNAH5up
chr5:15167194,194555975p15.1-p14.34,288,404gainBASP1up
chr5:21612069,244426055p14.3-p14.22,830,537gainPMCHL1up
chr5:24442605,275197795p14.2-p14.13,077,175gainLOC643401up
chr8:80437231,807532858q21.13316,055gainSTMN2up
chr8:95372666,963485268q22.1975,861gainRAD54Bup
INTS8up
CCNE2up
chr8:102462829,1034742568q22.31,011,428gainGRHL2up
chr8:104149770,1047655608q22.3615,791gainCTHRC1up
RIMS2up
chr8:121617619,1223309898q24.12713,371gainSNTB1up
chr8:124301834,1244884368q24.13186,603gainATAD2up
chr8:124529533,1249851468q24.13455,614gainFBXO32up
ANXA13up
FAM91A1up
chr8:128501258,1296664388q24.211,165,181gainPVT1up
Table 4

Inconsistently changed genes located in candidate copy number alterations.

RegionCytobandSize (bp)Gain/LossGeneGene expression
chr5:98922,12021325p15.331,103,211gainAHRRdown
TPPPdown
NKD2down
chr5:4534212,107603685p15.32-p15.26,226,157gainSEMA5Adown
FAM105Adown
chr5:15167194,194555975p15.1-p14.34,288,404gainFBXL7down
chr8:79587922,800037508q21.12415,829gainPKIAdown
chr8:82435662,827178338q21.13282,172gainFABP4down
chr8:102462829,1034742568q22.31,011,428gainNCALDdown
chr8:103630996,1038492288q22.3218,233gainKLF10down
chr8:107627585,1085038808q23.1876,296gainABRAdown
ANGPT1down
chr8:108962975,1095340118q23.1571,037gainRSPO2down
chr8:110496232,1106896738q23.1-q23.2193,442gainPKHD1L1down
chr8:128501258,1296664388q24.211,165,181gainMYCdown
chr21:36356593,3682153621q22.12-q22.13464,944lossCBR1up
CBR3up
CHAF1Bup
CLDN14up

Validation of the Candidate Genes by RT-qPCR

The results of the RT-qPCR analysis of the 7 candidate genes in the 8 paired samples were consistent with the microarray results (data not shown), indicating that the microarrays were accurate. Further RT-qPCR analysis of copy number changes in the 7 candidate genes in the total paired patient samples showed that each gene had copy number gains in at least 40% (34–58) of the 84 (8+76) LCXW samples. Analysis of gene expression changes in the total patient samples showed that CREB3L4, TRIP13, and CCNE2 were upregulated in at least 55% (32–40) of the 58 (8+50) LCXW samples, while AHRR, NKD2, MYC, and KLF10 were downregulated in at least 48% (28–34) of the 58 LCXW samples (Table 5).
Table 5

Validation of copy number changes and expression of 7 candidate genes.

GeneCopy number validation (N = 84)Gene expression validation (N = 58)
Gain / Lossn (%)Upregulation / Downregulationn (%)
CREB3L4Gain40 (48%)Upregulation32 (55%)
TRIP13Gain58 (69%)Upregulation40 (69%)
CCNE2Gain46 (55%)Upregulation40 (69%)
AHRRGain34 (40%)Downregulation32 (55%)
NKD2Gain52 (62%)Downregulation28 (48%)
MYCGain40 (48%)Downregulation30 (52%)
KLF10Gain42 (50%)Downregulation34 (59%)

Discussion

This integrated analysis of genomic DNA CNAs and gene expression profiling in LCXW and paired normal tissue was designed to screen potential novel “driver genes” in LCXW. Overall, the 38 candidate CNAs featured more gains than losses (34 vs. 4). The recurrent gains were located mainly on chromosomes 5p, 8q, 7p, and 1q, and losses were located on 21q and 11q. Comparison of these results with reported data for lung adenocarcinoma [12-14] identified Amp_5p15.33, Amp_7p11.2, and Amp_8q24.21 as common recurrent CNAs in all the studies (Table 6), suggesting that these regions may be variant hotspots in lung adenocarcinoma. CNAs, such as Del_9p21.3 and Amp_14q13.3 that have been reported to have the highest mutation frequencies in lung adenocarcinoma [12-14] were not identified in our LCXW samples and, similarly, many concordant recurrent CNAs detected in our study did not overlap with the reported data [12-14] (Table 6), suggesting that genomic copy number changes in LCXW may differ from those of other lung cancers. Because of the small sample size in our study, however, further studies are needed to determine the characteristic CNAs in LCXW.
Table 6

Comparison of concordant recurrent CNAs with literature.

This studyStaaf et al [12]The Cancer Genome Atlas Research Network [13]Barbara et al [14]Top candidate gene
CNARegion (bp)CNARegion (bp)CNARegion (bp)CNARegion (Mb)
Amp_1q21.3chr1:152175304–152218160Amp_1q21.3chr1:120523956–152743148CREB3L4
Amp_5p15.33chr5:98922–1202132Amp_5p15.33chr5:120000–1686000AHRR, TRIP13, NKD2
Amp_5p15.33-p15.32chr5:1202132–4534212Amp_5p15.33chr5:120000–1686000Amp_5p15.33chr5:1288616–1300024Amp_5p15.33chr5:0.75–1.62TERT, SLC6A3
Amp_5p15.32-p15.2chr5:4534212–10760368Amp_5p15.31chr5:8.88–10.51
Amp_5p15.1-p14.3chr5:15167194–19455597
Amp_5p14.3chr5:19455597–20320969Amp_5p14.3chr5:19.72–23.09
Amp_5p14.3chr5:21488497–21612069Amp_5p14.3chr5:19.72–23.09
Amp_5p14.3-p14.2chr5:21612069–24442605Amp_5p14.3chr5:19.72–23.09
Amp_5p14.2-p14.1chr5:24442605–27519779
Amp_7p22.3chr7:136363–478785
Amp_7p11.2chr7:54989787–55769659Amp_7p11.2chr7:54795000–55455000Amp_7p11.2chr7:54535672–55737616Amp_7p11.2chr7:54.65–55.52EGFR
Amp_8q21.12chr8:79282515–79584188
Amp_8q21.12chr8:79587922–80003750
Amp_8q21.12-q21.13chr8:80175394–80433401
Amp_8q21.13chr8:80437231–80753285Amp_8q21.13chr8:80.66–82.55
Amp_8q21.13chr8:80756352–81238430Amp_8q21.13chr8:80.66–82.55
Amp_8q21.13chr8:81592461–81654094Amp_8q21.13chr8:80.66–82.55
Amp_8q21.13chr8:81718686–82433172Amp_8q21.13chr8:80.66–82.55
Amp_8q21.13chr8:82435662–82717833Amp_8q21.13chr8:80.66–82.55
Amp_8q22.1chr8:95372666–96348526CCNE2, RAD54B
Amp_8q22.3chr8:102462829–103474256Amp_8q22.3chr8:102908001–103565001GRHL2, NCALD
Amp_8q22.3chr8:103630996–103849228KLF10
Amp_8q22.3chr8:103856352–104144193
Amp_8q22.3chr8:104149770–104765560CTHRC1
Amp_8q23.1chr8:107627585–108503880
Amp_8q23.1chr8:108962975–109534011
Amp_8q23.1chr8:110143020–110414628
Amp_8q23.1-q23.2chr8:110496232–110689673
Amp_8q23.2chr8:110697169–110915327
Amp_8q24.11chr8:118584139–118626715
Amp_8q24.12chr8:121617619–122330989
Amp_8q24.13chr8:124301834–124488436
Amp_8q24.13chr8:124529533–124985146ANXA13
Amp_8q24.21chr8:128501258–129666438Amp_8q24.21chr8:128729001–128873001Amp_8q24.21chr8:129157821–129195260Amp_8q24.21chr8:129.18–129.34MYC, PVT1
Del_11q24.3chr11:129705556–129762130Del_11q24.3-q25chr11:127528001–131659001
Del_21q21.1chr21:17559651–17823071Del_21q21.1chr21:1–32497730
Del_21q22.12-q22.13chr21:36356593–36821536CHAF1B, CBR1
Del_21q22.2chr21:39568333–39679987
In carrying out this analysis, the initial array-CGH detected a large number of CNAs. To screen out the best candidate CNAs, we filtered out the CNAs that were inconsistent between samples, the CNAs without a gene, and the gains containing only a partial gene segment. Through this process of elimination, 38 concordant recurrent CNAs were selected. This approach, on the one hand, may be an effective screening method for vital CNAs; on the other hand, focusing only on concordant recurrent CNAs may exclude important sporadic CNAs that may have a role in the cancerous phenotype of interest. A total of 246 genes were located in the 38 candidate CNAs. Of these, only 24 genes were upregulated and concordantly increased in copy number, and none was downregulated with loss in copy number. In fact, the change in expression in many genes was inconsistent with the copy number change, in some cases even showing negative correlation. Of the 19 negatively correlated genes, 15 genes located in copy number gains were significantly downregulated, and the other 4 genes located in copy number losses were significantly upregulated. This paradoxical negative relationship between copy number status and gene expression has also been observed in other cancers [10]. It might be attributable to the multiple mechanisms that are responsible for normal and abnormal control of gene expression, including those related to gene mutation, promoter methylation, and non-coding RNA regulation. Overall, the upregulated genes represent potential candidate oncogenes, while the downregulated genes represent potential candidate tumor suppressor genes in LCXW. There were 49 genes with copy number changes, including some known cancer-related genes, such as TERT and EGFR, that did not show expression changes, suggesting that their expression may be not gene-dose dependent and that they are likely to be passenger genes or play a role in LCXW carcinogenesis in other ways. The remaining 154 genes, including a large number of hypothetical genes, were undetected by microarrays, and thus, they were removed from the analysis. Integrated analysis of CNAs and corresponding DEGs has been shown to be an effective approach to identify genes with altered copy numbers directly impacting on the expression levels [9,10], however, not all of these genes are cancer-related. In order to further narrowing the scope of candidate genes, the literature on the 43 positively or negatively correlated genes were reviewed, then we selected 7 genes, including 3 positively correlated genes (CREB3L4, TRIP13, and CCNE2) and 4 negatively correlated genes (AHRR, NKD2, MYC, and KLF10) as candidate driver genes, which were further validated by RT-qPCR. KEGG pathway enrichment analysis showed that CCNE2, MYC, CREB3L4, and NKD2 are involved in many tumor-related pathways, suggesting that these genes may play an essential role in cancer development. CREB3L4 (cAMP responsive element binding protein 3-like 4) is located on chromosome 1q21.3 and encodes a cAMP responsive element binding protein which functions in a number of processing pathways, such as transcriptional regulation, signal transduction, and cell homeostasis. CREB3L4 has been shown to be associated with the development of cancers [15, 16]. CREB3L4 is upregulated in both a prostate cancer cell line (LNCaP) and in primary prostate cancer cells. In addition, the 1q21 amplicon containing CREB3L4 is frequently detected in hepatocellular carcinoma, and CREB3L4 is significantly overexpressed in tumor tissues compared with nontumorous tissue counterparts [16]. TRIP13 (thyroid hormone receptor interactor 13) encodes a protein that is a novel mitotic checkpoint-silencing protein and plays centrally important roles in mitotic checkpoint complex (MCC) disassembly and checkpoint inactivation. TRIP13 knockdown can delay metaphase-to-anaphase transition, while TRIP13 overexpression can trigger premature mitotic checkpoint silencing and thereby promote cancer development [17]. Overexpression of TRIP13 has been shown to result in malignant transformation of non-malignant cells and high expression of TRIP13 in squamous cell carcinoma of the head and neck can lead to aggressive, treatment-resistant tumors and enhanced repair of DNA damage [18]. CCNE2 (cyclin E2) specifically interacts with the CIP/KIP family of CDK inhibitors and plays a role in cell cycle G1/S transition. Elevated CCNE2 level can lead to genomic instability such as increased proportion of abnormal mitoses, micronuclei, and chromosomal aberrations [19]. Significantly increased expression levels of CCNE2 have been observed in various tumors such as those of the lung, breast, pancreas, and nasopharyngx, and have been shown to play important roles in the proliferation, invasion, metastasis, and poor prognosis of these cancers [20, 21]. The copy number gains and upregulation of expression of CREB3L4, TRIP13 and CCNE2 in more than 55% of our LCXW samples suggest that their expression might be gene-dose sensitive and that they are potential oncogenes in LCXW. To the best of our knowledge, expression changes in neither CREB3L4 nor TRIP13 have been reported in lung cancer, suggesting previously unknown associations with LCXW and lung cancer in general. AHRR (aryl-hydrocarbon receptor repressor) encodes a protein participating in the aryl hydrocarbon receptor (AhR) signaling cascade, which mediates dioxin toxicity and is involved in regulation of cell growth and differentiation. AHRR functions as a feedback modulator by repressing AhR-dependent gene expression. The genetic polymorphisms in AHRR have been shown to be risk factors for cancer via ameliorating this AhR repressor activity [22, 23], and DNA methylation change in AHRR has been linked to smoking exposure and lung cancer [24]. Thus, AHRR has been proposed to function as a putative new tumor suppressor gene in multiple types of human cancers [25]. NKD2 (naked cuticle homolog 2) encodes a protein that participates in the delivery of transforming growth factor alpha (TGFα)-containing vesicles and functions as a negative regulator of Wnt receptor signaling through interaction with members of the Dishevelled family. Downregulation of NKD2 is frequently regulated by hypermethylation of the promoter region and can cause Wnt activation and TGFα misdelivery, which often leads to tumorigenesis [26-28]. KLF10 (Kruppel-like factor 10) encodes a transcriptional repressor that acts as an effector of TGF-β signaling. KLF10 functions as a toggle by differential coupling of Sin3-histone deacetylase and P300/PCB-associated factor to integrate antagonistic signals regulating FOXP3, resulting in immune activation, and it also can directly bind to the TGF-β RII promoter in CD8(+)T cells, leading to enhanced gene expression and tumor immune response. KLF10 can inhibit breast cancer invasion and metastasis by inhibiting epidermal growth factor receptor (EGFR) transcription and the EGFR signaling pathway [29]. The expression of KLF10 is inversely correlated with pancreatic cancer stage, prognosis and overall survival [30]. In our study, the copy number of AHRR, NKD2, and KLF10 increased in at least 40% of the LCXW samples (34/84), whereas their expression was downregulated in at least 48% (28/58), suggesting that their expression is not gene-dose dependent, and if decreased, might promote the development of LCXW. To the best of our knowledge, expression changes in neither NKD2 nor KLF10 have been reported in lung cancer, suggesting that these genes are previously unknown tumor suppressor genes in LCXW and in lung cancer in general. MYC (v-myc avian myelocytomatosis viral oncogene homolog) encodes a multifunctional nuclear phosphoprotein that plays a role in cell cycle progression, apoptosis, and cellular transformation and regulates transcription of specific target genes. MYC, one of the most studied oncogenes [31], is typically overexpressed in variety of malignant tumors such as lung cancer, lymphomas, breast cancer, gastric cancer, and colon cancer and is involved in cell proliferation, differentiation, apoptosis and cell cycle [32-37]. Unexpectedly, the expression of MYC was significantly decreased in 52% (30/58) of the LCXW samples tested, although its copy number increased in 48% (40/84) of the LCXW samples tested, which indicates that MYC may not play a carcinogenic role in LCXW. This might reflect one aspect of the different pathogenesis of LCXW and lung cancers in other geographic areas. In conclusion, this study provided an integrative analysis of genome-wide DNA CNAs and gene expression to identify candidate driver genes in LCXW. Our findings suggest that CREB3L4, TRIP13, and CCNE2 are potential oncogenes, AHRR, NKD2, and KLF10 are potential tumor suppressor genes in LCXW, while MYC, one of the most studied oncogenes, might not play a carcinogenic role in LCXW. These discoveries will help us understand the pathogenesis and provide novel potential therapeutic targets for LCXW.

Multi_panel array-CGH result of sample 529550A01.

(PDF) Click here for additional data file.

Multi_panel array-CGH result of sample 529550A02.

(PDF) Click here for additional data file.

Multi_panel array-CGH result of sample 529550A03.

(PDF) Click here for additional data file.

Multi_panel array-CGH result of sample 529552A02.

(PDF) Click here for additional data file.

Multi_panel array-CGH result of sample 529564A02.

(PDF) Click here for additional data file.

Multi_panel array-CGH result of sample 529564A03.

(PDF) Click here for additional data file.

Multi_panel array-CGH result of sample 529609A02.

(PDF) Click here for additional data file.

Multi_panel array-CGH result of sample 529609A03.

(PDF) Click here for additional data file.

CCNE2, MYC, and CREB3L4 were involved in the PI3K-Akt signaling pathway.

(TIF) Click here for additional data file.

CCNE2 and CREB3L4 were involved in the prostate cancer signaling pathway.

(TIF) Click here for additional data file.

CCNE2 and MYC were involved in the cell cycle signaling pathway.

(TIF) Click here for additional data file.

MYC and NKD2 were involved in the Wnt signaling pathway.

(TIF) Click here for additional data file.

CCNE2 and MYC were involved in the pathways in cancer.

(TIF) Click here for additional data file.

Copy number alterations in lung cancer in Xuanwei identified by array-CGH analysis.

(XLSX) Click here for additional data file.

Differentially expressed genes in lung cancer in Xuanwei identified by microarray analysis.

(XLSX) Click here for additional data file.

The recurrent CNAs presented in at least 3 samples.

(XLSX) Click here for additional data file.

The concordant recurrent CNAs presented in at least 3 samples.

(XLSX) Click here for additional data file.

38 candidate CNAs and the expression changes of genes located in these CNAs.

(XLSX) Click here for additional data file.
  36 in total

1.  Variance component estimation for mixed model analysis of cDNA microarray data.

Authors:  Barbara Sarholz; Hans-Peter Piepho
Journal:  Biom J       Date:  2008-12       Impact factor: 2.207

2.  c-Myc enhances colon cancer cell-mediated angiogenesis through the regulation of HIF-1α.

Authors:  Cheng Chen; Shaoxin Cai; Guihua Wang; Xiaonian Cao; Xi Yang; Xuelai Luo; Yongdong Feng; Junbo Hu
Journal:  Biochem Biophys Res Commun       Date:  2012-12-10       Impact factor: 3.575

3.  Genetic susceptibility to dioxin-like chemicals' induction of cytochrome P4501A2 in the human adult linked to specific AhRR polymorphism.

Authors:  Wan-Ting Hung; George H Lambert; Ping-Wei Huang; Donald G Patterson; Yue Leon Guo
Journal:  Chemosphere       Date:  2012-11-17       Impact factor: 7.086

Review 4.  MYC oncogenes and human neoplastic disease.

Authors:  C E Nesbit; J M Tersak; E V Prochownik
Journal:  Oncogene       Date:  1999-05-13       Impact factor: 9.867

5.  [Expressions of APC and c-Myc and its implication on non-small cell lung cancer].

Authors:  Shi-feng Wang; Qian Liu; Shang-fu Zhang; Dian-ying Liao; Huan Xu; Wen-yan Zhang; Wei-min Li; Lun-xu Liu
Journal:  Sichuan Da Xue Xue Bao Yi Xue Ban       Date:  2010-09

6.  Cyclin E2 induces genomic instability by mechanisms distinct from cyclin E1.

Authors:  C Elizabeth Caldon; C Marcelo Sergio; Andrew Burgess; Andrew J Deans; Robert L Sutherland; Elizabeth A Musgrove
Journal:  Cell Cycle       Date:  2013-01-16       Impact factor: 4.534

Review 7.  The epidemic status and risk factors of lung cancer in Xuanwei City, Yunnan Province, China.

Authors:  Yize Xiao; Ying Shao; Xianjun Yu; Guangbiao Zhou
Journal:  Front Med       Date:  2012-12-07       Impact factor: 4.592

8.  Silica-volatile interaction and the geological cause of the Xuan Wei lung cancer epidemic.

Authors:  David J Large; Shona Kelly; Baruch Spiro; Linwei Tian; Longyi Shao; Robert Finkelman; Mingquan Zhang; Chris Somerfield; Steve Plint; Yasmin Ali; Yiping Zhou
Journal:  Environ Sci Technol       Date:  2009-12-01       Impact factor: 9.028

9.  CREB3L4, INTS3, and SNAPAP are targets for the 1q21 amplicon frequently detected in hepatocellular carcinoma.

Authors:  Yoshikazu Inagaki; Kohichiroh Yasui; Mio Endo; Tomoaki Nakajima; Keika Zen; Kazuhiro Tsuji; Masahito Minami; Shinji Tanaka; Masafumi Taniwaki; Yoshito Itoh; Shigeki Arii; Takeshi Okanoue
Journal:  Cancer Genet Cytogenet       Date:  2008-01-01

10.  Mutation spectra of smoky coal combustion emissions in Salmonella reflect the TP53 and KRAS mutations in lung tumors from smoky coal-exposed individuals.

Authors:  Courtney A Granville; Nancy M Hanley; Judy L Mumford; David M DeMarini
Journal:  Mutat Res       Date:  2003-04-09       Impact factor: 2.433

View more
  8 in total

1.  The aryl hydrocarbon receptor repressor - More than a simple feedback inhibitor of AhR signaling: Clues for its role in inflammation and cancer.

Authors:  Christoph F A Vogel; Thomas Haarmann-Stemmann
Journal:  Curr Opin Toxicol       Date:  2017-03-01

2.  Integrated Analysis of Copy Number Variations and Gene Expression Profiling in Hepatocellular carcinoma.

Authors:  Chenhao Zhou; Wentao Zhang; Wanyong Chen; Yirui Yin; Manar Atyah; Shuang Liu; Lei Guo; Yi Shi; Qinghai Ye; Qiongzhu Dong; Ning Ren
Journal:  Sci Rep       Date:  2017-09-05       Impact factor: 4.379

3.  Long non-coding RNA 1308 promotes cell invasion by regulating the miR-124/ADAM 15 axis in non-small-cell lung cancer cells.

Authors:  Hongliang Li; Xiaopeng Guo; Qiutian Li; Pengzhan Ran; Xudong Xiang; Yuncang Yuan; Tianqi Dong; Bei Zhu; Lei Wang; Fangfang Li; Chunyan Yang; Dengcai Mu; Dan Wang; Chunjie Xiao; Shangyong Zheng
Journal:  Cancer Manag Res       Date:  2018-12-03       Impact factor: 3.989

4.  Increased expression of TRIP13 drives the tumorigenesis of bladder cancer in association with the EGFR signaling pathway.

Authors:  Yanjun Gao; Shanhui Liu; Qi Guo; Su Zhang; Youli Zhao; Hanzhang Wang; Tianbao Li; Yuwen Gong; Yuhan Wang; Tao Zhang; Zhilong Dong; Dean Bacich; Wasim H Chowdhury; Ronald Rodriguez; Zhiping Wang
Journal:  Int J Biol Sci       Date:  2019-06-02       Impact factor: 6.580

5.  MicroRNA expression profiling of lung adenocarcinoma in Xuanwei, China: A preliminary study.

Authors:  Zaoxiu Hu; Xiaoxiong Wang; Yanlong Yang; Yonghe Zhao; Zhenghai Shen; Yunchao Huang
Journal:  Medicine (Baltimore)       Date:  2019-05       Impact factor: 1.817

6.  MIR99AHG is a noncoding tumor suppressor gene in lung adenocarcinoma.

Authors:  Chencheng Han; Hong Li; Zhifei Ma; Guozhang Dong; Qianyun Wang; Siwei Wang; Panqi Fang; Xiang Li; Hao Chen; Tongyan Liu; Lin Xu; Jie Wang; Jun Wang; Rong Yin
Journal:  Cell Death Dis       Date:  2021-04-30       Impact factor: 8.469

7.  Krüppel-Like Factor 10 participates in cervical cancer immunoediting through transcriptional regulation of Pregnancy-Specific Beta-1 Glycoproteins.

Authors:  Daniel Marrero-Rodríguez; Keiko Taniguchi-Ponciano; Malayannan Subramaniam; John R Hawse; Kevin S Pitel; Hugo Arreola-De la Cruz; Victor Huerta-Padilla; Gustavo Ponce-Navarrete; Ma Del Pilar Figueroa-Corona; Laura Gomez-Virgilio; Teresa I Martinez-Cuevas; Monica Mendoza-Rodriguez; Miriam Rodriguez-Esquivel; Pablo Romero-Morelos; Jorge Ramirez-Salcedo; Michael Baudis; Marco Meraz-Rios; Florinda Jimenez-Vega; Mauricio Salcedo
Journal:  Sci Rep       Date:  2018-06-21       Impact factor: 4.379

8.  A risk score system based on DNA methylation levels and a nomogram survival model for lung squamous cell carcinoma.

Authors:  Ming Zhang; Libing Sun; Yi Ru; Shasha Zhang; Junjun Miao; Pengda Guo; Jinghuan Lv; Feng Guo; Biao Liu
Journal:  Int J Mol Med       Date:  2020-04-27       Impact factor: 4.101

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.