| Literature DB >> 20706610 |
Victor Abkevich1, Diana Iliev, Kirsten M Timms, Thanh Tran, Mark Skolnick, Jerry S Lanchbury, Alexander Gutin.
Abstract
Genomic copy number variations are a typical feature of cancer. These variations may influence cancer outcomes as well as effectiveness of treatment. There are many computational methods developed to detect regions with deletions and amplifications without estimating actual copy numbers (CN) in these regions. We have developed a computational method capable of detecting regions with deletions and amplifications as well as estimating actual copy numbers in these regions. The method is based on determining how signal intensity from different probes is related to CN, taking into account changes in the total genome size, and incorporating into analysis contamination of the solid tumors with benign tissue. Hidden Markov Model is used to obtain the most likely CN solution. The method has been implemented for Affymetrix 500K GeneChip arrays and Agilent 244K oligonucleotide arrays. The results of CN analysis for normal cell lines, cancer cell lines, and tumor samples are presented. The method is capable of detecting copy number alterations in tumor samples with up to 80% contamination with benign tissue. Analysis of 178 cancer cell lines reveals multiple regions of common homozygous deletions and strong amplifications encompassing known tumor suppressor genes and oncogenes as well as novel cancer related genes.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20706610 PMCID: PMC2914423 DOI: 10.1155/2010/386870
Source DB: PubMed Journal: J Biomed Biotechnol ISSN: 1110-7243
Median signal intensities calculated using parameters generated in this paper.
| Cell line | Median signal | Comment |
|---|---|---|
| PC3 | 0.36 | Homozygous deletion of PTEN |
| Data for 28 male cell lines provided by Affymetrix | 1.14 | Single copy of X chromosome outside of pseudoautosomal regions |
| NA04626 | 2.72 | 47,XXX |
| NA01416E | 3.41 | 48,XXXX |
| NA06061 | 4.1 | 49,XXXXX |
| HTB27 | 5.31 | CN estimated by qPCR for CCND1 is 6.5 ± 1.1 |
| CRL1620 | 5.79 | CN estimated by qPCR for EGFR is 8.6 ± 1.4 |
| CCL251 | 6.41 | CN estimated by qPCR for CCND1 is 10.6 ± 4.4 |
| HTB25 | 8.22 | CN estimated by qPCR for CCND1 is 13.8 ± 1.4 |
| CRL2338 | 8.82 | CN estimated by qPCR for CCND1 is 16.6 ± 0.7 |
| HTB27 | 9.42 | CN estimated by qPCR for ERBB2 is 18.6 ± 2.7 |
| CRL1978 | 7.95 | CN estimated by qPCR for CCND1 is 19.6 ± 6.5 |
| HTB23 | 11.26 | CN estimated by qPCR for CCND1 is 20.6 ± 2.6 |
| HTB19 | 10.13 | CN estimated by qPCR for EGFR is 20.8 ± 2.8 |
| HTB127 | 13.93 | CN estimated by qPCR for CCND1 is 26.1 ± 5.5 |
| CRL2321 | 14.13 | CN estimated by qPCR for CCND1 is 26.6 ± 5.7 |
| HTB41 | 13.25 | CN estimated by qPCR for CCND1 is 46.9 ± 7.1 |
| HTB128 | 17.92 | CN estimated by qPCR for CCND1 is 54.4 ± 14.7 |
| HTB127 | 21.34 | CN estimated by qPCR for ERBB2 is 56.5 ± 3.9 |
Figure 1Comparison between median signal intensities calculated using parameters generated in this paper (y-axis) and CN estimates obtained in a different way (x-axis). For details see Table 1. Blue line represents correlation between median signal intensities and CN values used in this study.
Figure 2Results of CN analysis for noncancerous cell lines from nuclear family: NA12056 from the father (a), NA12057 from the mother (b), and NA10851 from the son (c).
Figure 3(a) Signal intensity of SNPs (y axis) for ovarian cancer cell line OVCAR8 before adjustment on genome size. (b) Fraction of SNPs (x axis) with a certain signal intensity (y axis) for ovarian cancer cell line OVCAR8 (see (a)). (c) CN solution for ovarian cancer cell line OVCAR8 after adjustment on genome size (k = 1.167).
Heterozygosity and call rate of SNPs within seven maxima of signal intensity for ovarian cancer cell line OVCAR8 (see Figure 3(b)).
| Position of a maximum | SNP heterozygosity (%) | Call rate (%) | Height (%) |
|---|---|---|---|
| 0.3 | 9.1 | 36.7 | 0.2 |
| 1.0 | 1.2 | 90.2 | 5.3 |
| 1.7 | 8 | 88.1 | 25.8 |
| 2.4 | 14.2 | 82 | 11.2 |
| 3.0 | 19 | 77.6 | 2.8 |
| 3.6 | 1 | 98.4 | 0.5 |
| 4.3 | 28.9 | 69.6 | 0.03 |
Frequent homozygous deletions observed in 178 cancer cell lines. Chromosomal positions are based on March 2006 version of the UCSC Human Genome Browser.
| Region | Number of observation | Likely tumor suppressor gene |
|---|---|---|
| chr2:141238933-141834821 | 7 | LRP1B |
| chr3:59686057-61195849 | 29 | FHIT |
| chr4:91615328-92272605 | 6 | |
| chr6:162091010-162697695 | 5 | PARK2 |
| chr7:38238069-38371496 | 6 | |
| chr7:141707985-142210594 | 5 | |
| chr9:19866496-28550130 | 36 | CDKN2A (p16), CDKN2B |
| chr10:88921717-91164155 | 6 | PTEN |
| chr14:21159134-22067364 | 7 | |
| chr16:6161277-7101949 | 9 | |
| chr16:76814181-77773689 | 26 | WWOX |
| chr20:14275899-15352425 | 22 |
Frequent amplifications with CN >9 observed in 178 cancer cell lines. Chromosomal positions are based on March 2006 version of the UCSC Human Genome Browser.
| Region | Number of observations | Likely oncogene |
|---|---|---|
| chr8:81240524-81974370 | 5 | TPD52 |
| chr8:113091665-114742342 | 10* | |
| chr8:128756816-128849113 | 17 | MYC |
| chr8:142078709-142107781 | 7** | PTK2 |
| chr11:68619900-69985448 | 11 | CTTN, FGF4, FGF3, FGF19, ORAOV1, CCND1, MYEOV |
| chr17:34999505-35264341 | 14 | ERBB2 |
| chr17:43477124-44795465 | 10 | |
| chr17:55677511-61149311 | 6 | APPBP2, PPM1D, BCAS3 |
| chr20:45554593-46054338 | 5 | NCOA3 |
| chr20:55503756-56241375 | 5 | |
| chr20:57622421-58603945 | 6 | |
| chr22:19057363-20130955 | 5 | CRKL |
*One of these amplifications also involves MYC.
**Two of these amplifications also involve MYC.
Comparison of the degree of contamination of cancer cell lines with CEPH cell line NA12776 determined using the picogreen quantitation (averaged over three measurements) and the algorithm presented in this paper.
| Cancer cell line | Tissue type | Mix ratio from DNA quantitation | Mix ratio from CN analysis | Percent of SNPs with concordant CN values for contaminated versus pure cancer cell lines |
|---|---|---|---|---|
| HTB19 | Breast | 10 ± 1% | 9.6% | 97.1% |
| HTB76 | Ovary | 20 ± 4% | 18.2% | 98.6% |
| HTB127 | Breast | 30 ± 4% | 33.6% | 98.5% |
| CCL228* | Colon | 40 ± 3% | 43.4% | 95.9% |
| CCL253* | Colon | 50 ± 2% | 60.7% | 88.2% |
| HTB119 | Lung | 60 ± 2% | 54.1% | 99.8% |
| HTB9 | Urinary bladder | 70 ± 2% | 74.3% | 97.0% |
| HTB41 | Salivary gland | 80 ± 1% | 84.7% | 56.9% |
*For these two samples one of the picogreen measurements failed.
Figure 4Results of CN analysis of colon tumor sample (collected from male subject) significantly contaminated with benign tissue (~48%). (a) Signal intensity of SNPs (y axis) for this sample after adjustment on genome size but before adjustment on contamination with benign tissue. (b) Correct CN solution for this sample after adjustment on both genome size and contamination with benign tissue. (c) Incorrect CN solution for this sample after adjustment on genome size but without adjustment on contamination with benign tissue.