| Literature DB >> 19228381 |
Yuker Wang1, Victoria E H Carlton, George Karlin-Neumann, Ronald Sapolsky, Li Zhang, Martin Moorhead, Zhigang C Wang, Andrea L Richardson, Robert Warren, Axel Walther, Melissa Bondy, Aysegul Sahin, Ralf Krahe, Musaffe Tuna, Patricia A Thompson, Paul T Spellman, Joe W Gray, Gordon B Mills, Malek Faham.
Abstract
BACKGROUND: A major challenge facing DNA copy number (CN) studies of tumors is that most banked samples with extensive clinical follow-up information are Formalin-Fixed Paraffin Embedded (FFPE). DNA from FFPE samples generally underperforms or suffers high failure rates compared to fresh frozen samples because of DNA degradation and cross-linking during FFPE fixation and processing. As FFPE protocols may vary widely between labs and samples may be stored for decades at room temperature, an ideal FFPE CN technology should work on diverse sample sets. Molecular Inversion Probe (MIP) technology has been applied successfully to obtain high quality CN and genotype data from cell line and frozen tumor DNA. Since the MIP probes require only a small (approximately 40 bp) target binding site, we reasoned they may be well suited to assess degraded FFPE DNA. We assessed CN with a MIP panel of 50,000 markers in 93 FFPE tumor samples from 7 diverse collections. For 38 FFPE samples from three collections we were also able to asses CN in matched fresh frozen tumor tissue.Entities:
Year: 2009 PMID: 19228381 PMCID: PMC2649948 DOI: 10.1186/1755-8794-2-8
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
FFPE Samples used in the study
| MD Anderson set 1 | Breast cancer | < 3 | 8 | 9 | |
| MD Anderson set 2 | Breast cancer | 5 – 22 | 27 | 18 | |
| UCSF | Liver metastases from colorectal | 5 – 28 | 9 | 9 | Matching frozen tumors and normals available |
| Dana Farber | Invasive breast cancer | 5 – 6 | 6 | 13 | Microdissected |
| CRUK | Colorectal | 0.5 – 5 | 17 | 16 | Matching frozen tumors available; macrodissected |
| CHTN | Bladder, colorectal, kidney, liver | 1 – 3 | 13 | 15 | Matching frozen tumors and normals available |
| Leader | Kidney | 3 – 4 | 13 | 12 | |
Figure 1Relationship between the natural log of the false positive rate at 50% sensitivity (Ln_FPR50) and the natural log of the 2p-RSE (Ln_2p-RSE). Each point represents a single sample with copy number assessed using a particular reference set. Samples were assessed with multiple reference sets and hence may appear multiple times in the figure. In samples for which we can asses both the false positive rate at 50% sensitivity (FPR50) and 2p-RSE, the two metrics show a close relationship.
CN Performance of different FFPE and fresh frozen sets
| CHTN (FFPE) | 13 | 13 (100%) | 11 (85%) | 0.139 |
| CHTN (FF) | 12 | 12 (100%) | 12 (100%) | 0.102 |
| CRUK (FFPE) | 17 | 16 (94%) | 12 (75%) | 0.163 |
| CRUK (FF) | 17 | 17 (100%) | 17 (100%) | 0.142 |
| Dana Farber (FFPE) | 6 | 6 (100%) | 1 (17%) | 0.201 |
| Leader (FFPE) | 13 | 7 (54%) | 4 (57%) | 0.155 |
| MD Anderson set1 (FFPE) | 8 | 8 (100%) | 7 (88%) | 0.163 |
| MD Anderson set2 (FFPE) | 27 | 23 (85%) | 18 (78%) | 0.167 |
| UCSF (FFPE) | 9 | 9 (100%) | 9 (100%) | 0.115 |
| UCSF (FF) | 10 | 10 (100%) | 10 (100%) | 0.116 |
| 93 | 82 (88%) | 62 (76%) | 0.16 | |
| 39 | 39 (100%) | 39 (100%) | 0.129 | |
* Fresh frozen
Figure 2Copy number data for 4 samples of varying 2p-RSEs. In each panel, markers are arranged along the chromosomes and colored by chromosome. Chromosomes are typically labeled; some of the smaller chromosomes are unlabeled due to space constraints. The X axis represents chromosomes in genomic order of markers; the Y axis is the absolute copy number measurement. (A) shows one of the best samples with 2p-RSE = 0.109. (B) shows an average "high quality" sample with 2p-RSE = 0.147. (C) shows a sample that just fails to meet the high quality threshold with 2p-RSE = 0.184. (D) shows the worst passed sample with 2p-RSE = 0.247.
Genotype Performance of different passed FFPE and fresh frozen sample sets
| CHTN (FF) | 99.6 | 12 | 7.10E-05 | 2E-5 – 6E-4 |
| CHTN (FFPE) | 98.6 | 11 | 1.60E-04 | 2E-5 – 3E-3 |
| CRUK (FFPE) | 99.2 | 14 | 1.10E-04 | 2E-5 – 2E-4 |
| Dana Farber (FFPE) | 93.9 | 6 | 2.80E-03 | 3E-4 – 8E-3 |
| Leader (FFPE) | 98.5 | 9 | 2.40E-04 | 6E-5 – 1E-3 |
| MD Anderson set1 (FFPE) | 97.5 | 6 | 4.90E-04 | 6E-5 – 9E-4 |
| MD Anderson set2 (FFPE) | 97.8 | 13 | 2.70E-04 | 4E-5 – 2E-2 |
| UCSF (FF) | 98.9 | 10 | 2.00E-05 | 2E-5 – 2E-4 |
| UCSF (FFPE) | 99.2 | 7 | 7.10E-05 | 2E-5 – 1E-4 |
| 98.4 | 66 | 2.70E-04 | 2E-5 – 2E-2 | |
| 99.4 | 22 | 4.00E-05 | 2E-5 – 6E-4 | |
^ The number of pairs with available data allowing the Mendelian inconsistency calculation
* Fresh frozen
Figure 3Copy number (A) and allele ratio (B) data from the same tumor illustrating how the information can be used in concert to infer stromal contamination. In each panel, markers are arranged along the chromosomes and colored by chromosome. Chromosomes are typically labeled; some of the smaller chromosomes are unlabeled due to space constraints.
Figure 4CN in FFPE (A) and fresh frozen (B) samples from the same tumor. In each panel, markers are arranged along the chromosomes and colored by chromosome. Chromosomes are typically labeled; some of the smaller chromosomes are unlabeled due to space constraints.
Figure 5Comparison of MIP and BAC CN results from the same tumor as shown in figure 4. Markers are arranged along the chromosomes and colored by chromosome. Chromosomes are typically labeled; some of the smaller chromosomes are unlabeled due to space constraints. (A) shows the MIP data for the FFPE sample in figure 4B after smoothing (simple 3 marker median) and (B) shows the BAC data. (The two panels are on different X-axis scales because of differences in marker densities. However the panels use the same chromosome color coding scheme to facilitate comparisons. Differences in chromosome 6 are like those seen in figure 4 and, as we discuss in the text, likely due to tumor heterogeneity.) The next two panels show chromosome 17 MIP (C) and BAC (D) data. (These two panels share the same scale on the x-axis.) Some of the fine structure seen in the MIP data is missing in the BAC data, potentially due to resolution differences or genetic heterogeneity: an example is circled.