Literature DB >> 20955586

Genome wide analysis of DNA copy number neutral loss of heterozygosity (CNNLOH) and its relation to gene expression in esophageal squamous cell carcinoma.

Nan Hu1, Robert J Clifford, Howard H Yang, Chaoyu Wang, Alisa M Goldstein, Ti Ding, Philip R Taylor, Maxwell P Lee.   

Abstract

BACKGROUND: Genomic instability plays an important role in human cancers. We previously characterized genomic instability in esophageal squamous cell carcinomas (ESCC) in terms of loss of heterozygosity (LOH) and copy number (CN) changes in tumors using the Affymetrix GeneChip Human Mapping 500K array in 30 cases from a high-risk region of China. In the current study we focused on copy number neutral (CN = 2) LOH (CNNLOH) and its relation to gene expression in ESCC.
RESULTS: Overall we found that 70% of all LOH observed was CNNLOH. Ninety percent of ESCCs showed CNNLOH (median frequency in cases = 60%) and this was the most common type of LOH in two-thirds of cases. CNNLOH occurred on all 39 autosomal chromosome arms, with highest frequencies on 19p (100%), 5p (96%), 2p (95%), and 20q (95%). In contrast, LOH with CN loss represented 19% of all LOH, occurred in just half of ESCCs (median frequency in cases = 0%), and was most frequent on 3p (56%), 5q (47%), and 21q (41%). LOH with CN gain was 11% of all LOH, occurred in 93% of ESCCs (median frequency in cases = 13%), and was most common on 20p (82%), 8q (74%), and 3q (42%). To examine the effect of genomic instability on gene expression, we evaluated RNA profiles from 17 pairs of matched normal and tumor samples (a subset of the 30 ESCCs) using Affymetrix U133A 2.0 arrays. In CN neutral regions, expression of 168 genes (containing 1976 SNPs) differed significantly in tumors with LOH versus tumors without LOH, including 101 genes that were up-regulated and 67 that were down-regulated.
CONCLUSION: Our results indicate that CNNLOH has a profound impact on gene expression in ESCC, which in turn may affect tumor development.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20955586      PMCID: PMC3091724          DOI: 10.1186/1471-2164-11-576

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Genomic instability is important for cancer development and can manifest as copy number (CN) gain or loss as well as loss of heterozygosity (LOH). Copy number neutral LOH (CNNLOH) has been observed in tumors following the widespread application of SNP array technology [1,2]. CNNLOH is common in many tumor types, including basal cell carcinoma [3], acute myeloid leukemia [4,5], medulloblastoma [6], melanoma [7], follicular lymphoma [8], colorectal cancers [9-11], glioblastoma [12,13], cutaneous squamous cell carcinomas [14], acute promyelocytic leukemia [15], acute lymphoblastic leukemia [16], ovarian tumor [17], and esophageal adenocarcinoma [18], and has recently been reviewed for myeloid malignancies [19]. CNNLOH is thought to result from mitotic recombination or nondisjunction in somatic tumor cells [3]. However, the distribution of complex DNA alterations and its relation to gene expression in tumors have not been characterized in ESCC. ESCC is a common malignancy worldwide and one of the most common cancers in the Chinese population; Shanxi Province in north central China has some of the highest esophageal cancer rates in the world [20,21]. Previously, we identified several regions of LOH and CN alteration in ESCC using microsatellite markers and low- and high-density SNP arrays [22-27], where the majority of ESCC patients from this high-risk population were found to have high genomic instability and high frequency of LOH on several chromosome arms. However, we have not found causal mutations in candidate genes within the LOH regions identified. For example, 82% of 56 ESCCs showed LOH when tested with four microsatellite markers flanking ANXA1 (9q11-q21), but no somatic mutations were detected in these patients [28]. Another example is BRCA2, which also showed frequent LOH in ESCC (57% for D13S260, 83% for D13S767), but only infrequent somatic mutations in these cancer patients (2/56, 3.5%) [29,30]. Contrary to expectation, expression of BRCA2 was often increased (unpublished data). In the present study, we analyzed DNA from 30 micro-dissected ESCC tumors, adjacent normal tissue, and blood DNA from the same patient using the Affymetrix 500K SNP array to identify the distribution of complex DNA alterations, including CNNLOH, and we related CNNLOH to expression of the genes affected as assessed with the Affymetrix U133A 2.0 array in these patients.

Methods

Case selection

This study was approved by the Institutional Review Boards of the Shanxi Cancer Hospital and the US National Cancer Institute (NCI). Cases diagnosed with ESCC between 1998 and 2001 in the Shanxi Cancer Hospital in Taiyuan, Shanxi Province, PR China, and considered candidates for curative surgical resection were identified and recruited to participate in this study. None of the cases had prior therapy and Shanxi was the ancestral home for all. After obtaining informed consent, cases were interviewed to obtain information on demographics, cancer risk factors (eg, smoking, alcohol drinking, and detailed family history of cancer), and clinical information. The cases evaluated here were part of a larger case-control study of upper gastrointestinal cancers conducted in Shanxi Province [31-33].

Biological specimen collection and processing

Venous blood (10 ml) was taken from each case prior to surgery and germ-line DNA from whole blood was extracted and purified using the standard phenol/chloroform method. Tumor and adjacent normal tissues were dissected at the time of surgery and stored in liquid nitrogen until used. One 5-micron section was H&E stained and reviewed by a pathologist from the NCI to guide the micro-dissection. Five to ten consecutive 8-micron sections were cut from fresh frozen tumor and adjacent normal tissues. Tumor and normal cells were manually micro-dissected under light microscopy. DNA was extracted from micro-dissected tumor as previously described [34] using the protocol from the Puregene DNA Purification Tissue Kit (Gentra Systems, Inc., Minneapolis, MN). RNA was extracted from 17 of these micro-dissected tumor and matched normal tissue pairs using the protocol from the PureLink Micro-to-Midi Total RNA Purification System (Catalog number 12183-018, Invitrogen, Carlsbad, CA). RNA quality and quantity were determined using the RNA 6000 Labchip/Agilent 2100 Bioanalyzer (Agilent Technologies, Germantown, MD). The same tissue blocks were used for extraction of both DNA and RNA for each case studied.

Target preparation for GeneChip Human Mapping 500 K array set

The Affymetrix GeneChip Human Mapping 500 K array set contains ~262,000 (Nsp I array) and ~238,000 (Sty I array) SNPs (mean probe spacing = 5.8 Kb, mean heterozygosity = 27%). A detailed gene chip protocol can be found at http://www.affymetrix.com/support/downloads/manuals/500k_assay_manual.pdf. Experiments were conducted according to the protocol (GeneChip Mapping Assay manual) supplied by Affymetrix, Inc. (Santa Clara, CA). Genotype calls were generated by GTYPE v 4.0 software (Affymetrix). Germ-line, tumor and adjacent normal DNA from each case were run together in parallel in the same experiment (ie, same batch, same day). The GEO accession numbers for these array data are GSE15526 and GSE20347.

Probe preparation and hybridization for Human Genome U133A 2.0 array

The Affymetrix Human Genome U133A 2.0 array is a single array used to interrogate expression of 14,500 well-characterized human genes. Array experiments were performed using 1-5 μg total RNA each. We followed the protocol provided by the manufacturer to carry out reverse transcription, labeling, and hybridization.

GeneChip 500 K array data analysis

Probe intensity data from Affymetrix 500 K SNP arrays were used to identify DNA alterations in the present study. To avoid gender-related issues, SNPs mapped to either the X or Y chromosome were excluded. Copy number (CN) loss or gain was based on comparisons of either adjacent normal to germ-line DNA or tumor to germ-line DNA. Microarray data were first normalized using the gtype-probe set-genotype package included in Affymetrix Power Tools version 1.85. Each tumor sample was individually normalized via the BRLMM algorithm along with 99 blood samples. These blood samples were obtained from the 30 ESCC cases evaluated in the present study plus 69 healthy controls (age-, sex-, and region-matched to cases) who were all part of a larger case-control study of upper gastrointestinal cancers conducted in Shanxi Province (as noted above). Paired CN analysis was then performed on each sample using the Affymetrix Power Tools paired-copy-number workflow, which implements the Affymetrix Copy Number Analysis Tool (CNAT) algorithm. DNA obtained from the blood of each case served as the normal control; a sliding window of 100 kb was chosen to optimize the identification of extended regions of CN alteration (see http://www.affymetrix.com/support/technical/whitepapers/cnat_4_algorithm_whitepaper.pdf). The output of the CNAT program is CN state rather than an absolute CN prediction: normal CN corresponds to a state of 2; zero and 1 correspond to CN loss; and states 3 and 4 correspond to CN gain. In the present study, we modified the method for identifying LOH used in our previous studies [26,27]. Here, LOH was determined using the Affymetrix Power Tools copynumber-pipeline program paired-LOH workflow. Input was *.CHP files generated with the gtype-probeset-genotype package as described above. Matched blood DNA served as the reference for LOH analysis for each tumor and normal adjacent sample.

Combination of LOH and CN alterations

We defined six combinations of copy number state and LOH status. LOH positive loci may have CN loss (CN ≤ 1), be CN neutral (CNNLOH, CN = 2) or show CN gain (CN ≥ 3); Likewise, LOH negative loci may show CN loss, gain, or neutrality. LOH and CN segments for each tumor were defined independently for each sample as contiguous blocks of informative SNPs that possessed the same LOH and CN state. Endpoints of LOH/CN segments were defined by informative SNPs. Some uninformative SNPs were located between these LOH/CN segments; we considered these SNPs to have an undefined LOH/CN state (see Additional file 1/Figure S1). Segment sizes were empirically observed from the data.

Comparison of CN status in DNA from blood versus micro-dissected adjacent normal tissue

DNA isolated from normal adjacent tissue is frequently used as a control in microarray experiments. In the present study we used DNA isolated from peripheral blood. We expected peripheral blood DNA to be a superior control for two reasons: first, unlike adjacent normal tissue, it is does not run the risk of being contaminated with tumor cells; second, adjacent normal tissue may actually be precancerous and contain genetic lesions. To examine whether blood DNA and adjacent normal esophageal DNA were equivalent controls, we compared copy number state calls for blood and normal adjacent from each of the 30 ESCC patients. We found that the two controls were equivalent: 99.29% to 99.99% of all copy number calls were identical. Overall, 99.96% of SNPs in blood and 99.93% in normal adjacent tissue were CN = 2 state.

Human Genome U133A 2.0 array data analysis and relation between CNNLOH and mRNA expression

The Robust Multiarray Average (RMA) algorithm [35,36] implemented in Bioconductor in R http://www.bioconductor.org was used for background correction and normalization across all samples. For each sample log2 fold changes in gene expression were calculated by subtracting the adjacent normal RMA value from the corresponding tumor RMA value. To determine whether any gene showed a difference in the tumor versus normal gene expression fold change that was dependent on LOH state, we performed the following steps: (i) First, genes assayed by the U133A microarray were mapped onto each LOHCN segment of each sample. Map locations of genes were taken from the Affymetrix version na29 microarray annotation file. Note that probe sets from the same gene may have different reference sequences which differ in their chromosomal locations. Also, not every gene will map to every sample - in a particular sample, a gene may map to a gap between LOHCN regions. (ii) Next, we identified genes for which at least two of the 17 ESCC samples with expression data were LOH negative and at least two samples were LOH positive. (iii) We then performed two-sided unpaired t-tests comparing the log2 fold changes for a probe set in LOH positive and LOH negative samples. A P-value < 0.01 was considered significant. (iv) Finally, SNPs on the 500 K microarray were mapped to the reference sequence for each expression probe set. Since probe sets from the same gene may have different reference sequences, they may differ in the number of SNPs assigned to them (Additional file 2/Figure S2).

Results

In the present study we determined copy number and loss of heterozygosity (LOH) status in DNA isolated from germ-line and micro-dissected tumor and matched adjacent normal samples from 30 ESCC patients using the Affymetrix 500 K SNP array. The average genotype call rate was 96% (89-99%): the 250 K Nsp I array was 96% (90-98%) and 250 K Sty I array was 95% (89-99%). Genotype call rates were similar for all three tissue types examined. We first analyzed whether copy numbers were similar between DNAs from the two normal tissues: germ-line (blood) and micro-dissected adjacent normal samples. Our analysis indicated that DNA CN values were similar between the two normal tissues (Additional file 3 - Table S1), as expected. Our results indicate that germ-line DNA can be used as a normal control in studies of CN alteration; it is more readily available than matched adjacent normal tissue.

Complex DNA alterations in ESCC

The distribution of DNA alterations in each of the 30 ESCC cases is summarized in Table 1 (with LOH) and in Additional file 4/Table S2 (without LOH). We divided genomic regions into three groups based on CN states: CN loss, neutral, and gain. We found that 50%, 90%, and 93% of cases showed LOH in the CN loss, neutral, and gain groups, respectively (Table 1). For each chromosome, we also calculated the percentage of SNPs involved in LOH for each group. They ranged between 20-57%, 7-100%, and 2-100% for the CN loss, neutral, and gain groups, respectively (Table 1). Our results suggest that LOH with CN neutral or gain are common phenomena in ESCC. For SNPs without LOH, we also calculated the percent of SNPs in each CN state; averages were 5%, 84%, and 11% for CN loss, neutral, and gain, respectively.
Table 1

LOH by copy number in ESCC cases by individual case (N = 30)

Case IDTotal no. informative SNPs with LOHNo. informative SNPs with LOH and CN = 1 (fraction)No. informative SNPs with LOH and CN = 2 (fraction)No. informative SNPs with LOH and CN = 3 or 4 (fraction)
*131,8082,260 (0.07)26,801 (0.84)2,747 (0.09)
23680 (0)205 (0.56)163 (0.44)
33770 (0)139 (0.37)238 (0.63)
*436,1752,093 (0.06)27,655 (0.76)6,427 (0.18)
*5240 (0)0 (0)24 (1.00)
*614,751210 (0.01)12,661 (0.86)1,880 (0.13)
*73,5590 (0)2,905 (0.82)654 (0.18)
*82,4084 (0)2,217 (0.92)187 (0.08)
*95931 (0)266 (0.45)326 (0.55)
1017,5461,075 (0.06)12,087 (0.69)4,384 (0.25)
1178,1593,678 (0.05)67,726 (0.87)6,755 (0.09)
12410 (0)0 (0)41 (1.00)
*136,113209 (0.03)5,154 (0.84)750 (0.12)
1413,4984,084 (0.30)9,190 (0.68)224 (0.02)
1530 (0)3 (1.00)0 (0)
1614310 (0)54 (0.04)1,377 (0.96)
1716,9345,732 (0.34)10,842 (0.64)360 (0.02)
182,1070 (0)1,553 (0.74)554 (0.26)
*1900 (0)0 (0)0 (0)
*205270 (0)137 (0.26)390 (0.74)
*2119,95462 (0)16,332 (0.82)3,560 (0.18)
*2211,1804,410 (0.39)6,357 (0.57)413 (0.04)
*2314,5235,939 (0.41)5,953 (0.41)2,631 (0.18)
2415,7735,672 (0.36)9,354 (0.59)747 (0.05)
*2523,00520 (0)17,114 (0.74)5,871 (0.26)
*2637,69121,495 (0.57)15,229 (0.40)967 (0.03)
2712,1204,500 (0.37)6,299 (0.52)1,321 (0.11)
284940 (0)37 (0.07)457 (0.93)
*2918,222444 (0.02)15,400 (0.85)2,378 (0.13)
*3042,49916,432 (0.39)25,394 (0.60)673 (0.02)

No. cases with SNP LOH fraction = 01532
Range (fraction SNPs with LOH)0-0.570-1.000-1.00
Median (fraction SNPs with LOH)0.000.600.13
Global average (fraction SNPs with LOH)0.190.700.11

*Case also examined with Affymetrix U133A 2.0 chip

LOH by copy number in ESCC cases by individual case (N = 30) *Case also examined with Affymetrix U133A 2.0 chip The distribution of the six types of DNA alterations for all 30 cases by chromosome arm is shown in Table 2 (with LOH) and Additional file 5/Table S3 (without LOH). CNNLOH was observed on all chromosome arms, but most frequently on 19p (100%), 5p (96%), 2p (95%), and 20q (95%). The highest frequencies of LOH with CN loss (CN = 1) were found on 3p (56%), 5q (47%), and 21q (41%); relatively high frequencies were also seen on 18q (31%), 11q (29%), 1p (28%), 19q (27%), and 11p (25%). LOH with CN gain was most common on 20p (82%), 8q (74%) and 3q (42%) (Table 2 and Figure 1). Taken together, our results show that LOH with CNN or CN gain were much more frequent than LOH with CN loss on every chromosome arm but one (ie, 3p).
Table 2

LOH by copy number in ESCC cases by chromosomal arm (N = 30 cases)

Chromosomal armTotal no. informative SNPs with LOHNo. informative SNPs with LOH and CN = 1 (fraction)No. informative SNPs with LOH and CN = 2 (fraction)No. informative SNPs with LOH and CN = 3 or 4 (fraction)
1p10,5762,930 (0.28)6,522 (0.62)1,124 (0.11)
1q8,366459 (0.05)7,674 (0.92)233 (0.03)
2p11,321371 (0.03)10,770 (0.95)180 (0.02)
2q23,1153,015 (0.13)18,939 (0.82)1,161 (0.05)
3p27,59315,335 (0.56)12,064 (0.44)194 (0.01)
3q7,25641 (0.01)4,168 (0.57)3,047 (0.42)
4p12,2234,771 (0.39)7,452 (0.61)0 (0)
4q28,1059,142 (0.33)17,905 (0.64)1,958 (0.04)
5p1,3150 (0)1,259 (0.96)56 (0.04)
5q16,5377,744 (0.47)8,777 (0.53)16 (0)
6p5,622342 (0.06)4,870 (0.87)410 (0.07)
6q3,365165 (0.05)2,773 (0.82)427 (0.13)
7p7,3899 (0)5,491 (0.74)1,889 (0.26)
7q8,001151 (0.02)6,452 (0.74)1,398 (0.17)
8p8,8031,580 (0.18)6,234 (0.71)989 (0.11)
8q17,32147 (0)4,633 (0.27)12,641 (0.74)
9p18,2922,772 (0.15)13,333 (0.73)2,187 (0.12)
9q31,4001,965 (0.06)27,212 (0.87)2,223 (0.07)
10p2,739472 (0.17)2,063 (0.75)204 (0.07)
10q14,6511,728 (0.12)12,075 (0.82)848 (0.06)
11p9,3912,367 (0.25)6,902 (0.73)122 (0.01)
11q17,3775,116 (0.29)10,422 (0.60)1,839 (0.11)
12p5,229156 (0.03)3,251 (0.62)1,822 (0.34)
12q6,79467 (0.01)5,980 (0.88)747 (0.11)
13q35,6485,897 (0.17)25,964 (0.73)3,787 (0.11)
14q10,9311,484 (0.14)7,446 (0.68)2,001 (0.18)
15q10,194685 (0.07)8,313 (0.82)1,196 (0.12)
16p1,395111 (0.08)1,284 (0.92)0 (0)
16q3,1271 (0)2,832 (0.91)294 (0.09)
17p7,719324 (0.04)6,939 (0.90)456 (0.06)
17q16,32838 (0)14,754 (0.90)1,536 (0.09)
18p2,596488 (0.19)1,147 (0.44)961 (0.37)
18q9,9783,108 (0.31)6,838 (0.69)32 (0)
19p1,0691 (0)1,065 (1.00)3 (0)
19q3,505932 (0.27)2,482 (0.71)91 (0.03)
20p1,2780 (0)229 (0.18)1,049 (0.82)
20q1,77360 (0.03)1,677 (0.95)36 (0.02)
21q9,4443,825 (0.41)5,599 (0.59)20 (0)
22q4,177621 (0.15)3,274 (0.80)222 (0.05)

Range (fraction SNPs with LOH)0-0.560.18-1.000-0.82
Median (fraction SNPs with LOH)0.050.710.05
Global average (fraction SNPs with LOH)0.190.700.11
Figure 1

Patterns of loss of heterozygosity and copy number variation in 30 ESCC samples for chromosome 3. Each row (numbered 1 - 30) represents an individual ESCC sample. Circles indicate the positions of SNPs showing LOH. SNP positions are color coded as follows: black indicates copy number neutral LOH; blue indicates LOH accompanied by copy number reduction; red indicates LOH with copy number gain. An ideogram of the chromosome is at the bottom of the figure.

LOH by copy number in ESCC cases by chromosomal arm (N = 30 cases) Patterns of loss of heterozygosity and copy number variation in 30 ESCC samples for chromosome 3. Each row (numbered 1 - 30) represents an individual ESCC sample. Circles indicate the positions of SNPs showing LOH. SNP positions are color coded as follows: black indicates copy number neutral LOH; blue indicates LOH accompanied by copy number reduction; red indicates LOH with copy number gain. An ideogram of the chromosome is at the bottom of the figure. Results of CN alterations in non-LOH group by chromosome arms are summarized in Additional file 5/Table S3. Briefly, a frequency of CN loss ≥ 10% was observed on eight chromosome arms (3p, 4p, 4q, 5q, 8p, 9p, 11q, and 13q). A frequency of CN gain ≥ 10% was observed on 13 chromosome arms (1q, 2p, 2q 3q, 5p, 7p, 7q, 8q, 12p, 14q, 18p, 20p, and 20q).

Relation between genomic alterations and gene expression

The average present call rate on the Human Genome U133A array was 53% (range 51- 61%) for the 34 chips from the 17 sample pairs with sufficient tissue for RNA isolation and testing. To investigate the relation between LOH/CNV and gene expression levels, we intersected genes on the Affymetrix U133A chip with SNPs on the 500 K SNP array. SNPs that mapped within genes are summarized in Additional file 6/Table S4 and include 169,687 SNPs within 12,225 genes. We were interested in identifying differentially-expressed genes between LOH and non-LOH groups in genes that were CN neutral. A total of 4,572 genes qualified for this analysis (see Methods). Among these genes, 168 genes showed significant differences in expression between tumors with and without LOH (P < 0.01) (Additional file 7/Table S5). Based on chance alone (at the P < 0.01 level), differences in only 45 genes would be expected, therefore, expression differences were observed in over three times as many genes as expected. One hundred and one (60%) of the 168 genes showed lower expression levels in CNNLOH than in the normal group (ie, CNN, no LOH), whereas 67 genes (40%) showed higher expression levels in CNNLOH (Additional file 7/Table S5). Twenty-eight of the 101 down-regulated genes (32 probes) and 18 of the 67 up-regulated genes (19 probes) showed expression differences ≥ 2-fold (Table 3). These findings suggest that in the CN neutral state, LOH can affect gene expression.
Table 3

Comparison of gene expression in copy number neutral (CNN) genes with LOH and without LOH (normal) (N = 46 genes significantly differentially-expressed 2-fold or greater)*

Gene nameCytobandProbeset on U133A arrayNo. cases with LOHNo. cases without LOHNo. SNPsFold change CNN with LOHFold change CNN without LOHT-testP-value
Genes where expression level of CNN with LOH is less than CNN without LOH (N = 28 genes)
ADKchr10q22|10q11-q24204120_s_at213460.4040.6063.1388.559E-03
ADKchr10q22|10q11-q24204119_s_at213460.4110.6193.1957.278E-03
AIM1Lchr1p36.11220289_s_at21560.0930.2013.5484.812E-03
APOL6chr22q12.3219716_at2540.4670.8944.6599.543E-03
ATP6V0E1chr5q35.2201171_at21250.3750.6693.2349.843E-03
ATP6V0E1chr5q35.2200096_s_at21250.4800.8724.4455.699E-03
BACH1chr21q22.11204194_at21290.4980.7383.9702.098E-03
BTG2chr1q32201236_s_at21410.4600.9864.0951.183E-03
CD2chr1p13.1205831_at21410.3240.6283.4629.221E-03
CEACAM6chr19q13.2211657_at21330.0070.1004.7797.023E-03
CRATchr9q34.1209522_s_at7810.2930.5673.1697.495E-03
CSDE1chr1p22202646_s_at21430.3430.8957.8357.590E-05
CYP4F3chr19p13.2206515_at2131360.0650.3073.3295.631E-03
DIO2chr14q24.2-q24.3203699_s_at21140.0720.1273.5656.175E-03
DIO2chr14q24.2-q24.3203700_s_at21140.1680.2803.6973.752E-03
EIF4EBP2chr10q21-q22208770_s_at21410.4520.6576.3002.692E-05
FASchr10q24.1216252_x_at21380.4120.6483.0589.883E-03
HERC6chr4q22.1219352_at3910.1050.5385.7692.901E-04
ICAM3chr19p13.3-p13.2204949_at21320.3560.6023.4784.342E-03
IL1RNchr2q14.2212659_s_at212130.0730.1423.2567.793E-03
NADSYN1chr11q13.4218840_s_at28100.4480.6493.3999.384E-03
NHP2chr5q35.3216583_x_at21010.4970.7744.0912.584E-03
PDCD4chr10q24202730_s_at21240.2120.3634.4839.256E-04
PPP1R13Lchr19q13.32218849_s_at31310.1900.4875.0165.018E-04
RIPK4chr21q22.3221215_s_at51070.2170.4543.6143.658E-03
SKAP2chr7p21-p15216899_s_at210120.2930.6154.1651.973E-03
SKAP2chr7p21-p15204361_s_at210430.3590.6654.3841.591E-03
SLC6A1chr3p25-p24205152_at2860.1580.4385.0442.481E-03
STK39chr2q24.3202786_at212860.1510.2903.3636.327E-03
SYNPO2Lchr10q22.2219804_at31310.0630.3456.9258.703E-06
TGM5chr15q15.2207911_s_at21380.3820.5173.4964.389E-03
ZNF91chr19p13.1-p12206059_at21050.0260.5519.0546.430E-04

Genes where expression level of CNN with LOH is greater than CNN without LOH (N = 18 genes)
ACVR1chr2q23-q24203935_at212133.4801.751-4.5337.869E-04
ASPNchr9q22219087_at510210.5942.679-3.7922.667E-03
BUB1chr2q14209642_at21228.5714.010-3.9222.170E-03
CALUchr7q32.1200757_s_at21273.4671.830-4.6127.505E-04
CALUchr7q32.1200755_s_at21273.8991.981-3.9032.269E-03
CAMSAP1L1chr1q32.1212765_at21333.0001.357-6.2881.629E-03
CENPFchr1q32-q41209172_s_at214105.4812.643-5.9093.926E-05
CSNK1Echr22q13.1202332_at2342.9331.157-7.4349.278E-03
FAM13Achr4q22.1202973_x_at310142.7561.149-3.5086.115E-03
FHL2chr2q12-q14202949_s_at214153.1591.244-4.6974.097E-04
GSRchr8p21.1205770_at31082.5051.222-3.7014.824E-03
ITGA6chr2q31.1201656_at211214.0382.400-3.5194.820E-03
KIF14chr1q32.1206364_at213611.0273.578-8.2572.071E-06
NCAPHchr2q11.2212949_at21123.6431.943-5.0874.450E-04
NRP2chr2q33.3214632_at311292.1611.257-3.5009.276E-03
RBM28chr7q32.1218593_at21192.2101.555-3.9762.370E-03
SMAD5chr5q31205187_at21232.2080.829-10.6923.171E-07
STEAP3chr2q14.2218424_s_at214122.7071.207-6.9229.985E-06
TGS1chr8q11219231_at2812.5331.656-3.4958.601E-03

*Sorted alphabetically by gene name

Comparison of gene expression in copy number neutral (CNN) genes with LOH and without LOH (normal) (N = 46 genes significantly differentially-expressed 2-fold or greater)* *Sorted alphabetically by gene name We also compared expression of genes with LOH versus no LOH in CN loss genes. We identified six of 600 genes which showed significantly different expression between the LOH groups. All six genes showed increased expression in tumors with LOH (Table 4a).
Table 4

Comparison of gene expression in copy number loss/gain genes with LOH and without LOH*

Gene nameCytobandProbeset on U133A arrayNo. cases with LOHNo. cases without LOHNo. SNPsFold change with LOHFold change without LOHT-testP-value
Table 4.a.: Genes/probes with CN loss (CN = 1)
ATXN7chr3p21.1-p12204516_at42100.5740.258-8.2331.333E-03
CTDSPLchr3p21.3213597_s_at33130.9970.890-5.4148.188E-03
F2Rchr5q13203989_x_at3246.0831.217-6.7717.275E-03
RAB5Achr3p24-p22209089_at5260.4190.240-9.0655.698E-04
RAF1chr3p25201244_s_at3290.5550.389-24.0085.074E-04
SCN10Achr3p22-p21208578_at34231.8060.884-4.2538.107E-03

Table 4.b.: Genes/probes with CN gain (CN = 3 or 4)
ARHGAP5chr14q12217936_at2511.7310.769-4.8098.159E-03
CYP11B1chr8q21214610_at2630.9151.1144.7283.469E-03
NFATC4chr14q11.2205897_at2311.3630.903-11.6922.823E-03
PAX9chr14q12-q13207059_at2510.8980.319-5.3955.706E-03
PPFIA1chr11q13.3202066_at72127.5242.448-3.9186.425E-03
TP63chr3q28211834_s_at213170.9011.1973.4389.809E-03
TP63chr3q28211194_s_at213180.9872.6494.0083.640E-03

*Sorted alphabetically by gene name

Comparison of gene expression in copy number loss/gain genes with LOH and without LOH* *Sorted alphabetically by gene name Finally, we compared gene expression in the CN gain state between tumors with and without LOH. We found that six of 354 genes showed significant differences in expression between the two groups, including two down-regulated and four up-regulated genes (Table 4b).

Discussion

We characterized ESCC tumors for complex DNA alterations - LOH and CNV - and related these genomic alterations to gene expression. To our knowledge, this is the first report to comprehensively address the distribution of complex DNA alterations in ESCC and its relation to gene expression on a genome-wide scale. Ninety percent of cases showed CNNLOH in their tumors and, over all cases, CNNLOH was found on every chromosome arm, indicating that it is a common phenomenon. The frequency of CNNLOH observed here in ESCC was much less than has been reported in other cancers [3-19]. For example, in colon cancer and basal cell carcinoma nearly all LOH was associated with copy number neutral regions [3,10]. In general, CNNLOH occurs with variable frequency in different genomic regions in tumors of different origin. There are several differences between the study reported here and previous studies which likely influenced the results. First, DNA from micro-dissected tumor and adjacent normal was used in the present study, while either cancer DNA without matched controls or cancer cell lines were used in most other reported studies. Second, we examined LOH and CN alterations using the same SNP array platform, while other studies used SNPs for LOH and CGH arrays for CN analyses. Third, the criteria for identifying LOH differed among the studies reported. Finally, the types of cancers studied previously differ from the present study which is the first report of CNNLOH in ESCC. In previous LOH studies, we reported high-frequency LOH on several chromosome arms, including 3p, 4p, 4q, 9p, 9q, 13q, 17p, and 17q [23,26,27]. By integrating LOH and CN alteration data in the present study, we can now say that the LOH on 3p is primarily due to CN loss LOH, while the LOH on the other seven chromosome arms is predominantly due to CNNLOH. Our results showed that CNNLOH can change expression levels of genes in ESCC, either increasing or decreasing them. We do not know why CNNLOH changes gene expression, but one possibility is that the two alleles may have different gene expression levels. For example, if allele A expression is greater than allele B, the expression level for the 3 genotypes would be ordered as AA > AB > BB. CNNLOH with retention of two B alleles (genotype BB) would then show lower expression than genotype AB. Conversely, CNNLOH with loss of the allele B would result in two copies of allele A and a higher level of expression than that of AB cells. Another possibility is that the two alleles have different expression due to different epigenetic states, with LOH resulting in copies with two extreme epigenetic states. A third possibility is that one allele harbors a mutation and subsequent LOH leads to a homozygous mutant. Several studies have shown that CNNLOH regions can harbor mutated genes. For example, JAK2 V617F, FLT3-ITD, AML1/RUNX1, WT1, and NPM1 mutations were all found in CNNLOH regions in AML [15]. These various hypotheses merit testing in the future. The study design in the present study has several important features: (i) we compared CN status between DNA from germ-line and micro-dissected adjacent normal tissue; (ii) we used micro-dissected DNA from tumor tissue; (iii) we assessed both LOH and CN alterations simultaneously using the same array platform; and (iv) we integrated complex DNA alterations and gene expression data on a genome-wide level using both high density SNP and expression arrays in the same cases. A noteworthy weakness of our study is the relatively small number of cases evaluated (including a particularly small number of cases with both LOH and RNA expression data to evaluate, due in part to the 500K chip mean heterozygosity of 27%), which limited our power to detect significant differences in loci between LOH and non-LOH groups. In addition, findings for ESCC from this high-risk region may not be generalizable to populations elsewhere in the world. In summary, we investigated the distribution of complex DNA alterations in ESCCs at the genome-wide level and determined that CN neutral is the most common CN state in LOH, and that CNNLOH is a very common phenomenon overall. Importantly, we also showed that CNNLOH could alter the expression level of genes affected in ESCC.

Conclusion

CNNLOH is a common phenomenon in many cancers, including ESCC, and non-disjunction and/or somatic recombination are the most likely mechanisms for its occurrence. CNNLOH can result in changes in gene expression which are functionally significant. Expression differences in CNNLOH suggest that alleles are different in terms of their gene expression potential, and that these differences may result from differences in genotype and/or epigenetics.

Abbreviations

SNP: single nucleotide polymorphism; ESCC: esophageal squamous cell carcinoma; (CNNLOH): copy number neutral loss of heterozygosity; CN: copy number.

Authors' contributions

NH, AMG, TD, and PRT designed, conducted, and supervised the field and clinical studies; PRT obtained funding for the project; NH and CW designed and performed the laboratory analyses; NH, RJC, HHY, and MPL conducted the statistical analyses; NH and ML drafted the manuscript; NH, RJC, AMG, PRT, and MPL conceptualized the data analyses and revised and edited the manuscript. All authors read and approved the final manuscript.

Additional file 1

Figure S1. Definition of LOH/CN blocks within a single sample. Click here for file

Additional file 2

Figure S2. The relationship between genes, Affymetrix expression probesets, and SNPs. Click here for file

Additional file 3

Table S1. Comparison of copy number alterations between DNA from blood and microdissected normal tissue. Click here for file

Additional file 4

Table S2. Copy number status of SNPs without LOH by case (N = 30). Click here for file

Additional file 5

Table S3. CN status of SNPs without LOH by chromosome arm (N = 30 cases). Click here for file

Additional file 6

Table S4. No. of genes matched on Affymetrix 500 K SNP and U133A expression arrays. Click here for file

Additional file 7

Table S5. Comparison of gene expression in copy number neutral (CNN) genes with LOH and without LOH (normal) (N = 168 genes significantly differentially-expressed). Click here for file
  36 in total

1.  Allelic loss in esophageal squamous cell carcinoma patients with and without family history of upper gastrointestinal tract cancer.

Authors:  N Hu; M J Roth; M R Emmert-Buck; Z Z Tang; M Polymeropolous; Q H Wang; A M Goldstein; X Y Han; S M Dawsey; T Ding; C Giffen; P R Taylor
Journal:  Clin Cancer Res       Date:  1999-11       Impact factor: 12.531

2.  Identification of novel regions of allelic loss from a genomewide scan of esophageal squamous-cell carcinoma in a high-risk Chinese population.

Authors:  N Hu; M J Roth; M Polymeropolous; Z Z Tang; M R Emmert-Buck; Q H Wang; A M Goldstein; S S Feng; S M Dawsey; T Ding; Z P Zhuang; X Y Han; T Ried; C Giffen; P R Taylor
Journal:  Genes Chromosomes Cancer       Date:  2000-03       Impact factor: 5.006

3.  Jasmine tea consumption and upper gastrointestinal cancer in China.

Authors:  Ying Gao; Nan Hu; XiaoYou Han; Carol Giffen; Ti Ding; Alisa M Goldstein; Philip R Taylor
Journal:  Cancer Causes Control       Date:  2009-12       Impact factor: 2.506

4.  Are there any more ovarian tumor suppressor genes? A new perspective using ultra high-resolution copy number and loss of heterozygosity analysis.

Authors:  Kylie L Gorringe; Manasa Ramakrishna; Louise H Williams; Anita Sridhar; Samantha E Boyle; Jennifer L Bearfoot; Jason Li; Michael S Anglesio; Ian G Campbell
Journal:  Genes Chromosomes Cancer       Date:  2009-10       Impact factor: 5.006

5.  Epidemiology of esophageal cancer in China.

Authors:  J Y Li
Journal:  Natl Cancer Inst Monogr       Date:  1982

6.  Prevalence of copy-number neutral LOH in glioblastomas revealed by genomewide analysis of laser-microdissected tissues.

Authors:  Daisuke Kuga; Masahiro Mizoguchi; Yanlei Guan; Nobuhiro Hata; Koji Yoshimoto; Tadahisa Shono; Satoshi O Suzuki; Yoji Kukita; Tomoko Tahira; Shinji Nagata; Tomio Sasaki; Kenshi Hayashi
Journal:  Neuro Oncol       Date:  2008-08-12       Impact factor: 12.300

7.  Hidden abnormalities and novel classification of t(15;17) acute promyelocytic leukemia (APL) based on genomic alterations.

Authors:  Tadayuki Akagi; Lee-Yung Shih; Motohiro Kato; Norihiko Kawamata; Go Yamamoto; Masashi Sanada; Ryoko Okamoto; Carl W Miller; Der-Cherng Liang; Seishi Ogawa; H Phillip Koeffler
Journal:  Blood       Date:  2008-12-23       Impact factor: 22.113

8.  Genomic characterization of esophageal squamous cell carcinoma from a high-risk population in China.

Authors:  Nan Hu; Chaoyu Wang; David Ng; Robert Clifford; Howard H Yang; Ze-Zhong Tang; Quan-Hong Wang; Xiao-You Han; Carol Giffen; Alisa M Goldstein; Philip R Taylor; Maxwell P Lee
Journal:  Cancer Res       Date:  2009-07-07       Impact factor: 12.701

Review 9.  Copy neutral loss of heterozygosity: a novel chromosomal lesion in myeloid malignancies.

Authors:  Christine O'Keefe; Michael A McDevitt; Jaroslaw P Maciejewski
Journal:  Blood       Date:  2010-01-27       Impact factor: 22.113

10.  Family history of cancer and risk for esophageal and gastric cancer in Shanxi, China.

Authors:  Ying Gao; Nan Hu; XiaoYou Han; Carol Giffen; Ti Ding; Alisa Goldstein; Philip Taylor
Journal:  BMC Cancer       Date:  2009-08-05       Impact factor: 4.430

View more
  82 in total

1.  Association between circulating levels of sex steroid hormones and Barrett's esophagus in men: a case-control analysis.

Authors:  Michael B Cook; Shannon N Wood; Brooks D Cash; Patrick Young; Ruben D Acosta; Roni T Falk; Ruth M Pfeiffer; Nan Hu; Hua Su; Lemin Wang; Chaoyu Wang; Barbara Gherman; Carol Giffen; Cathy Dykes; Veronique Turcotte; Patrick Caron; Chantal Guillemette; Sanford M Dawsey; Christian C Abnet; Paula L Hyland; Philip R Taylor
Journal:  Clin Gastroenterol Hepatol       Date:  2014-08-24       Impact factor: 11.382

2.  Predicting the molecular role of MEIS1 in esophageal squamous cell carcinoma.

Authors:  Abolfazl Rad; Moein Farshchian; Mohammad Mahdi Forghanifard; Maryam M Matin; Ahmad Reza Bahrami; Dirk Geerts; Azadeh A'rabi; Bahram Memar; Mohammad Reza Abbaszadegan
Journal:  Tumour Biol       Date:  2015-08-28

3.  Genomic Characterization of Esophageal Squamous Cell Carcinoma Reveals Critical Genes Underlying Tumorigenesis and Poor Prognosis.

Authors:  Hai-De Qin; Xiao-Yu Liao; Yuan-Bin Chen; Shao-Yi Huang; Wen-Qiong Xue; Fang-Fang Li; Xiao-Song Ge; De-Qing Liu; Qiuyin Cai; Jirong Long; Xi-Zhao Li; Ye-Zhu Hu; Shao-Dan Zhang; Lan-Jun Zhang; Benjamin Lehrman; Alan F Scott; Dongxin Lin; Yi-Xin Zeng; Yin Yao Shugart; Wei-Hua Jia
Journal:  Am J Hum Genet       Date:  2016-04-07       Impact factor: 11.025

4.  Computational analysis reveals microRNA-mRNA regulatory network in esophageal squamous cell carcinoma.

Authors:  Jie Zhao; Bi-Cheng Zhang; Li-Fang Yu; Wei-Xing Wang; Yong Zhao; Zhi-Guo Rao
Journal:  J Huazhong Univ Sci Technolog Med Sci       Date:  2016-12-07

5.  Genetic variants in miR-196a2 and miR-499 are associated with susceptibility to esophageal squamous cell carcinoma in Chinese Han population.

Authors:  Fangyuan Shen; Jiejun Chen; Shicheng Guo; Yinghui Zhou; Yabiao Zheng; Yajun Yang; Junjie Zhang; Xiaofeng Wang; Chenji Wang; Dunmei Zhao; Mengyun Wang; Meiling Zhu; Lixia Fan; Jiaqing Xiang; Yong Xia; Qingyi Wei; Li Jin; Jiucun Wang; Minghua Wang
Journal:  Tumour Biol       Date:  2015-10-30

6.  Kinase-Dependent and -Independent Roles for PTK6 in Colon Cancer.

Authors:  Priya S Mathur; Jessica J Gierut; Grace Guzman; Hui Xie; Rosa M Xicola; Xavier Llor; Michael I Chastkofsky; Ansu O Perekatt; Angela L Tyner
Journal:  Mol Cancer Res       Date:  2016-03-16       Impact factor: 5.852

7.  DNA methylation and nucleosome occupancy regulate the cancer germline antigen gene MAGEA11.

Authors:  Smitha R James; Carlos D Cedeno; Ashok Sharma; Wa Zhang; James L Mohler; Kunle Odunsi; Elizabeth M Wilson; Adam R Karpf
Journal:  Epigenetics       Date:  2013-07-09       Impact factor: 4.528

8.  Chromosomal aneuploidies and combinational fluorescence in situ hybridization probe panels are useful for predicting prognosis for esophageal squamous cell carcinoma.

Authors:  Jia-Jie Hao; Han-Qing Yao; Guang-Yun Dai; Wei Kang; Xue-Mei Jia; Xin Xu; Yan Cai; Qi-Min Zhan; Gui-Qi Wang; Ming-Rong Wang
Journal:  J Gastroenterol       Date:  2014-05-11       Impact factor: 7.527

9.  Three-dimensional mRNA measurements reveal minimal regional heterogeneity in esophageal squamous cell carcinoma.

Authors:  Wusheng Yan; Joanna Shih; Jaime Rodriguez-Canales; Michael A Tangrea; Audrey Player; Lixia Diao; Nan Hu; Alisa M Goldstein; Jing Wang; Philip R Taylor; Scott M Lippman; Ignacio I Wistuba; Michael R Emmert-Buck; Heidi S Erickson
Journal:  Am J Pathol       Date:  2012-12-03       Impact factor: 4.307

10.  Low expression of microRNA-202 is associated with the metastasis of esophageal squamous cell carcinoma.

Authors:  Guoliang Ma; Fengmei Zhang; Xueguang Dong; Xiaoli Wang; Yuguo Ren
Journal:  Exp Ther Med       Date:  2016-01-21       Impact factor: 2.447

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.