| Literature DB >> 20799942 |
Lesleyann Hawthorn1, Jesse Luce, Leighton Stein, Jenniffer Rothschild.
Abstract
BACKGROUND: A major challenge in the interpretation of genomic profiling data generated from breast cancer samples is the identification of driver genes as distinct from bystander genes which do not impact tumorigenesis. One way to assess the relative importance of alterations in the transcriptome profile is to combine parallel analyses that assess changes in the copy number alterations (CNAs). This integrated analysis permits the identification of genes with altered expression that map within specific chromosomal regions which demonstrate copy number alterations, providing a mechanistic approach to identify the 'driver genes'.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20799942 PMCID: PMC2939551 DOI: 10.1186/1471-2407-10-460
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Clinical information and array platform used for each sample.
| Patient Number | Tissue | Diagnosis | Experiments/Arrays |
|---|---|---|---|
| 1 | breast tumor | poorly differentiated infiltrating ductal carcinoma | 250K, 133plus2.0 |
| 2 | breast tumor | infiltrating ductal carcinoma | 250K, 133plus2.0 |
| 3 | breast tumor | infiltrating carcinoma | 250K |
| 4 | breast tumor | infiltrating ductal carcinoma | 250K, 133plus2.0 |
| 4 | non-tumor breast | adequate control tissue, T-0400, NOS | 133Plus2.0 |
| 5 | breast tumor | infiltrating ductal carcinoma | 250K, 133plus2.0 |
| 6 | breast tumor | poorly differentiated infiltrating ductal carcinoma | 250K, 133plus2.0 |
| 7 | breast tumor | infiltrating ductal carcinoma | 250K, 133plus2.0 |
| 8 | breast tumor | infiltrating ductal carcinoma | 250K, 133plus2.0 |
| 9 | Lymph node | metastatic ductal carcinoma to lymph node | 250K |
| 9 | breast tumor | infiltrating ductal carcinoma | 250K |
| 11 | breast tumor | infiltrating ductal carcinoma and high grade DCIS | 250K |
| 12 | breast tumor | infiltrating ductal carcinoma | 250K, 133plus2.0 |
| 13 | breast tumor | infiltrating ductal carcinoma | 250K, 133plus2.0 |
| 13 | non-tumor breast | adequate control tissue, T-0400, NOS | 133Plus2.0 |
| 14 | breast tumor | infiltrating ductal carcinoma | 250K |
| 15 | breast tumor | infiltrating ductal carcinoma | 250K |
| 16 | Lymph node | metastatic ductal carcinoma to lymph node | 250K, 133plus2.0 |
| 17 | breast tumor | Invasive ductal carcinoma | 250K |
| 18 | breast tumor | Invasive ductal carcinoma | 250K, 133plus2.0 |
| 19 | breast tumor | Invasive ductal carcinoma | 250K |
| 20 | breast tumor | Invasive ductal carcinoma | 250K, 133plus2.0 |
| 21 | breast tumor | Invasive ductal carcinoma | 250K |
| 22 | breast tumor | Invasive ductal carcinoma | 250K, 133plus2.0 |
| 23 | breast tumor | Invasive ductal carcinoma | 133Plus2.0 |
| 24 | breast tumor | Invasive ductal carcinoma | 133Plus2.0 |
| 25 | breast tumor | Invasive ductal carcinoma | 133plus2.0 |
| 26 | non-tumor breast | adequate control tissue, T-0400, NOS | 133Plus2.0 |
| 27 | non-tumor breast | adequate control tissue, T-0400, NOS | 133Plus2.0 |
Figure 1Principle Components Analysis of Copy Number Data. Principle Components Analysis PCA is a method of dimensionality reduction to look for overall trends in the data. In figure 1a it can be seen that "scan date" has had a significant impact on the data implying that processing of the samples at different times has contributed significantly to the variance in the samples meaning that this significant contribution may obscure the main effects. Figure 2b shows the PCA plot following batch removal of the effect of Scan Date. A mixed model ANOVA was used to estimate the effect of scan date and removed from the data.
Figure 2PCA and ANOVA of Transcript Expression Data. Analysis of Variance (ANOVA) was performed on the entire data set and a gene list was generated Figure 2a shows the histogram of variance indicating that the major variance was due to the differences between tumor and normal samples and scan date variance is almost the same as that attributed to type 1 error. PCA was used reduce dimensionality and to examine whether clusters of tumors separated from normal samples or if other variables, such as scan date contributed to the variance. It can be seen that the tumors are spread throughout the three dimensional space of the plot while the normal samples form a tight cluster at the base of the the plot on the right. Concordance of the clustering structure was observed between the two dimensionality-reduction procedures (data not shown). Analysis of Variance (ANOVA) was then performed on the entire data set and a gene list was generated using an FDR of 0.05, 2-fold expression changes and a p-value for fold change of 0.05. Figure 2 shows the histogram of variance and the PCA plot of the effect of the scan date on the data.
Figure 3Copy Number Alterations in Infiltrating Ductal Carcinoma. Copy number data was generated using Genomic segmentation to obtain the different CN state partitions. The baseline was the HapMap 270 set of samples. The gains/losses are shown in red and blue respectively for each chromosome. Each copy number event was required occur in 5/22 samples. The cutoff values for copy numbers of 1.7 or less for deletions and 2.3 or greater for CN gains. The length of each bar represents the average copy number across all samples. The range is from -2 to 22.
Figure 4Minimal Regions of Overlap in Copy Number. A amplified region on chromosome 8q21.11 is shown in 4a. The minimal region of overlap has an average of 4.9 copies in 19 samples. This small region is 13 KB (8: 75237536-75250960) and is surrounded by an amplification or gain in 7-18 samples. This extended region is 90 KB and begins at 8p11 and extends to the telomere (8:41489634- 133808163). 4b shows an novel copy number region of gain at 5p15.33 where two minimal regions of overlap were detected. The first extends 101 KB (5:1836735-1938410) and the second is 19 KB (5:3180419-3199829) while the entire amplified region extends 5.6 MB from 5p15.33-p33.2 (5:165712-5779631). The bottom trace shows the SNP intensity plot for the region. The larger dots are smoothed data and the fine dots are unsmoothed. The first region has a mean of 3.5 copies and the second has 4.7.
Figure 5LOH regions. The plot shows LOH detected across the genome in 22 IDC samples. The genotype calls were generated using the BBRLMM algorithm. HMM was used to isolate those regions with a high probability to be loss events based on the genotype error and the expected heterozygous frequency at each SNP. The tumor numbers have the largest numbers closest to the chromosome. It can be seen that tumor 1, the furthest from each chromosome shows many regions of LOH including whole chromosome allelic loss of 1,6 and 17.
Figure 6Allelic Ratio Plot of 12q24.3. The upper section of the figure plots the number of samples displaying LOH. The lower part of the diagram shows the allelic ratios plotted for each of the tumors defined by the colors shown above the plot. It is notable that the spots migrate towards 1 (AA calls) and -1 (BB calls) when the frequency of the LOH is increased, and as the frequency decreases, the more spots are plotted towards 0 (AB calls).
Figure 7Integration of Copy Number and LOH. The integration was performed using the data generated using the criteria outlined for LOH and CNA described in the Materials and methods. The length of the bars indicates the number of samples. The LOH events close to the centromeres of chromosomes 8, 16 and X are the most frequent LOH events but not certifiable due to few probes in those regions. Copy number loss with LOH is detected at 8p12, Xp24 and Xp25. The other detected regions are copy neutral events.
Integration of copy number with gene expression data in > samples
| Cyto-band | Gene Symbol | p-value | Fold-Change | length (bps) | Copy Number Location | Copy state | # | Mean copy # |
|---|---|---|---|---|---|---|---|---|
| 1q21.2 | SV2A | 0.000790217 | 2.35 | 1228718 | chr1.147361719.148590436 | gain | 6 | 2.93 |
| 1q21.2 | LOC730631 | 0.000454988 | 2.12 | 1228718 | chr1.147361719.148590436 | gain | 6 | 2.93 |
| 1q21.3 | S100A14 | 0.00103078 | 13.49 | 145569 | chr1.151766410.151911978 | gain | 6 | 3.27 |
| 1q22 | MUC1 | 3.86E-05 | 5.12 | 1636566 | chr1.152807193.154443758 | gain | 6 | 2.79 |
| 1q23.2 | IGSF9 | 1.70E-06 | 6.69 | 82022 | chr1.158101327.158183348 | gain | 5 | 2.75 |
| 1q25.2 | TDRD5 | 5.76E-08 | 2.49 | 76749 | chr1.177855168.177931916 | gain | 6 | 3.12 |
| 1q32.1 | PCTK3 | 0.000647034 | 2.12 | 57388 | chr1.203711819.203769206 | gain | 7 | 2.12 |
| 1q41 | CAPN8 | 5.28E-05 | 7.01 | 1214960 | chr1.221390196.222605155 | gain | 6 | 2.76 |
| 1p34.1 | ST3GAL3 | 0.00102362 | -3.50 | 52158 | chr1.44004795.44064981 | loss | 5 | 1.14 |
| 11p13 | PRRG4 | 1.96E-07 | 4.61 | 129389 | chr11.32799917.32929305 | gain | 7 | 3.42 |
| 11p11.2 | PACSIN3 | 0.000775686 | 2.11 | 271627 | chr11.47122445.47394071 | gain | 5 | 3.48 |
| 14q24.1 | WDR22 | 5.46E-09 | -2.06 | 2588491 | chr14.68430400.69090879 | loss | 5 | 1.49 |
| 15q23 | Hs.655686 | 5.25E-06 | 11.48 | 1692519 | chr15.68908327.69703941 | gain | 5 | 2.58 |
| 15q23 | Hs.655868 | 0.000235216 | 3.68 | 504419 | chr15.69797809.70302227 | gain | 6 | 2.61 |
| 15q24.1 | NEO1 | 0.00012075 | 2.11 | 843811 | chr15.70912294.71756104 | gain | 7 | 3.37 |
| 15q26.1 | ISG20 | 1.92E-06 | 3.16 | 249550 | chr15.86957456.87207005 | gain | 6 | 2.84 |
| 15q26.1 | FAM174B | 0.000142779 | 4.64 | 127979 | chr15.90953124.91081102 | gain | 5 | 3.50 |
| 15q26.1 | FAM174B | 4.31E-07 | 3.39 | 127979 | chr15.90953124.91081102 | gain | 5 | 3.50 |
| 5p15.33 | PLEKHG4B | 0.000905578 | 2.33 | 352294 | chr5.165712.518005 | gain | 5 | 3.26 |
| 6p24.3 | TFAP2A | 0.000412702 | 28.92 | 61721 | chr6.10487807.10549527 | gain | 7 | 4.42 |
| 8q23.3 | TRPS1 | 0.000200736 | 4.98 | 1434519 | chr8.116140217.117574735 | gain | 5 | 3.65 |
| 8p22 | DLC1 | 7.07E-05 | -6.72 | 6333283 | chr8.11614644.13945615 | loss | 6 | 1.49 |
| 8p21.3 | GFRA2 | 1.72E-10 | -2.18 | 1541653 | chr8.20297943.21825688 | loss | 6 | 1.38 |
| 8p21.2 | EBF2 | 5.96E-08 | -4.59 | 61487 | chr8.25917207.25978693 | loss | 5 | 1.46 |
| 8p21.2 | AK057935 | 0.000340165 | -8.34 | 798818 | chr8.26553995.26777510 | loss | 6 | 1.47 |
| 8p12 | Hs.654357 | 3.39E-06 | -3.64 | 1469584 | chr8.29124128.30492208 | loss | 5 | 1.48 |
| 8q12.1 | SDR16C5 | 0.00075179 | 7.97 | 689931 | chr8.56762001.57376988 | gain | 5 | 2.99 |
| 8p23.1 | ANGPT2 | 2.01E-06 | -6.11 | 168439 | chr8.6342789.6444367 | loss | 5 | 1.45 |
| 8p23.1 | DEFA1 /// DEFA3 /// LOC728358 | 6.87E-05 | -3.05 | 1406779 | chr8.6789275.6889920 | loss | 6 | 1.40 |
| 8q21.13 | CHMP4C | 0.000234239 | 16.46 | 5693518 | chr8.81713285.87406802 | gain | 5 | 3.22 |
| Xq22.3 | Hs.715776 | 2.99E-07 | -9.24 | 16623258 | chrX.102883803.119358070 | loss | 5 | 1.52 |
| Xq22.3 | LOC100130886 /// TMEM164 | 0.000421953 | -2.37 | 16623258 | chrX.102883803.119358070 | loss | 5 | 1.52 |
| Xq24 | GLUD2 | 6.93E-06 | -2.65 | 2753995 | chrX.119455338.122209332 | loss | 5 | 1.51 |
| Xq25 | XPNPEP2 | 6.87E-08 | -3.81 | 7869983 | chrX.128566292.129127918 | loss | 5 | 1.51 |
Integration of LOH and Gene Expression Data.
| Cyto band | Copy Number Location | Gene Symbol | p-value | Fold-Change | # Samples with Copy-Neutral LOH | # | Mean of Samples with loss and LOH | State |
|---|---|---|---|---|---|---|---|---|
| 1p13.2 | chr1.111970656.112340133 | C1orf183 | 8.2E-06 | -2.22 | 2 | 1 | 1.53 | CN-LOH |
| 1p13.2 | chr1.111970656.112340133 | KCND3 | 1.0E-03 | -3.75 | 2 | 1 | 1.53 | CN-LOH |
| 1p22.2 | chr1.89334399.89586461 | GBP4 | 5.1E-06 | -2.13 | 2 | 1 | 1.57 | CN-LOH |
| 6p21.32 | chr6.31698877.32259200 | HSPA1A /// HSPA1B | 1.4E-03 | 4.12 | 4 | CN-LOH | ||
| 12q24.12 | chr12.109960751.111501415 | ALDH2 | 3.8E-05 | -11.38 | 4 | CN-LOH | ||
| 16q12.1 | chr16.45091910.46733079 | PHKB | 5.0E-04 | -2.06 | 7 | CN-LOH | ||
| 16q22.1 | chr16.67206157.67268948 | CDH3 | 5.0E-04 | 6.72 | 2 | 1 | 1.44 | CN-LOH |
| 16q22.1 | chr16.67268948.67273787 | CDH3 | 5.0E-04 | 6.72 | 2 | 1 | 1.49 | CN-LOH |
| 16q22.1 | chr16.67273787.67334399 | CDH3 | 5.0E-04 | 6.72 | 2 | 1 | 1.49 | CN-LOH |
| 16q22.1 | chr16.67273787.67334399 | CDH1 | 4.1E-09 | 43.24 | 2 | 1 | 1.49 | CN-LOH |
| 16q22.1 | chr16.67334399.67396764 | CDH1 | 4.1E-09 | 43.24 | 2 | 1 | 1.49 | CN-LOH |
| 19q13.41 | chr19.57065800.60767150 | ZNF331 | 1.2E-04 | -4.34 | 3 | CN-LOH | ||
| 19q13.41 | chr19.57065800.60767150 | MYADM | 7.5E-07 | -3.21 | 3 | CN-LOH | ||
| 22q12.1 | chr22.26560066.27655923 | TTC28 | 9.8E-08 | -5.33 | 3 | CN-LOH | ||
| 22q12.1 | chr22.26560066.27655923 | XBP1 | 3.4E-04 | 4.50 | 3 | CN-LOH | ||
| Xq24 | chrX.118031377.119358070 | LONRF3 | 2.4E-05 | -6.96 | 0 | 3 | 1.48 | Loss with LOH |
| Xq25 | chrX.123198797.123430595 | ODZ1 | 6.2E-06 | -8.12 | 0 | 3 | 1.47 | Loss with |
| Xq25 | chrX.128574610.129127918 | XPNPEP2 | 4.6E-07 | -3.33 | 0 | 3 | 1.48 | Loss with LOH |