| Literature DB >> 26222067 |
Danielle Mercatante Carrick1, Michele G Mehaffey2, Michael C Sachs3, Sean Altekruse1, Corinne Camalier2, Rodrigo Chuaqui3, Wendy Cozen4, Biswajit Das2, Brenda Y Hernandez5, Chih-Jian Lih2, Charles F Lynch6, Hala Makhlouf3, Paul McGregor2, Lisa M McShane3, JoyAnn Phillips Rohan2, William D Walsh2, Paul M Williams2, Elizabeth M Gillanders1, Leah E Mechanic1, Sheri D Schully1.
Abstract
Next Generation Sequencing (NGS) technologies are used to detect somatic mutations in tumors and study germ line variation. Most NGS studies use DNA isolated from whole blood or fresh frozen tissue. However, formalin-fixed paraffin-embedded (FFPE) tissues are one of the most widely available clinical specimens. Their potential utility as a source of DNA for NGS would greatly enhance population-based cancer studies. While preliminary studies suggest FFPE tissue may be used for NGS, the feasibility of using archived FFPE specimens in population based studies and the effect of storage time on these specimens needs to be determined. We conducted a study to determine whether DNA in archived FFPE high-grade ovarian serous adenocarcinomas from Surveillance, Epidemiology and End Results (SEER) registries Residual Tissue Repositories (RTR) was present in sufficient quantity and quality for NGS assays. Fifty-nine FFPE tissues, stored from 3 to 32 years, were obtained from three SEER RTR sites. DNA was extracted, quantified, quality assessed, and subjected to whole exome sequencing (WES). Following DNA extraction, 58 of 59 specimens (98%) yielded DNA and moved on to the library generation step followed by WES. Specimens stored for longer periods of time had significantly lower coverage of the target region (6% lower per 10 years, 95% CI: 3-10%) and lower average read depth (40x lower per 10 years, 95% CI: 18-60), although sufficient quality and quantity of WES data was obtained for data mining. Overall, 90% (53/59) of specimens provided usable NGS data regardless of storage time. This feasibility study demonstrates FFPE specimens acquired from SEER registries after varying lengths of storage time and under varying storage conditions are a promising source of DNA for NGS.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26222067 PMCID: PMC4519244 DOI: 10.1371/journal.pone.0127353
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Workflow for the FFPE tissues.
Three SEER RTRs selected cases that would likely meet the study criteria from their database. Cases were screened for histology/morphology, tumor nuclei content, and necrotic cell content to select cases meet the study criteria. The RTRs screened 30 or 31 cases; one RTR did not keep track of the number of cases they screened. Each RTR selected and sent 20 cases that met study criteria to the lab. One case was excluded from analyses due to incomplete information regarding storage time. 59 cases were considered for subsequent analyses. DNA was successfully extracted from 58 cases. WES was successful on 53 of the cases.
FFPE specimens selected for analysis.
| Specimen Storage Time | |||||
|---|---|---|---|---|---|
| 3–12 years | 13–22 years | 23–32 years | Replicates | Total | |
|
| 8 | 9 | 3 | 4 | 24 |
|
| 4 | 11 | 5 | 3 | 23 |
|
| 1 | 11 | 7 | 0 | 19 |
|
| 13 | 31 | 15 | 7 | 66 |
*For seven cases two separate specimens were prepared and sent to the laboratory.
Pathology review results of FFPE tissues for serous ovarian adenocarcinoma received at the NCI lab.
| # of Tissues | |||
|---|---|---|---|
| Yes | No | % Discordance with how the registry designated the tissue | |
|
| 50 | 9 | 15% |
|
| 55 | 4 | 7% |
|
| 56 | 3 | 5% |
*Two pathologists at NCI reviewed these 3 cases and agreed the tissue sections provided did not show sufficient evidence of being high-grade.
DNA quantity and quality metrics by storage time of specimen.
| Mean (Standard Deviation) by Storage Time | Difference (95% CI) | P-value, Trend | ||||
|---|---|---|---|---|---|---|
| Overall(N = 59) | 3-12Years(N = 13) | 13-22Years(N = 31) | 23–32 Years(N = 15) | |||
|
| 3.7 (3.2) | 3.8 (3) | 4.6 (3.4) | 1.7 (2.1) | -2.1 (-3.5 to -0.7) | P = 0.003 |
|
| 2.1 (1.2) | 2.0 (0.3) | 2.0 (0.5) | 2.6 (2.3) | 0.3 (-0.3 to 0.9) | P = 0.288 |
|
| 0.21(0.13) | 0.22 (0.09) | 0.24 (0.14) | 0.13 (0.1) | -0.1 (-0.15 to -0.05) | P < 0.001 |
DNA yield (μg) (measured by Qubit): Extracted DNA concentration was measured using Qubit fluorometer using dsDNA BR Assay Kits from Life Technologies Inc. and total DNA yield for the sample were calculated.
A260/280: Absorbance at 260 nm and 280 nm were measured using Nanodrop for extracted DNA from each sample and expressed as a ratio.
KapaQC Q129/41 ratio: Kapa Human Genomic DNA Quantification and QC Kit [23] was used to measure the quality of the genomic DNA isolated from these FFPE samples. This ratio is a measure of fragmentation of the genomic DNA extracted from the sample.
*CI = confidence Interval. Overall means and standard deviations for quality measures and by storage time, differences per 10 years of storage and 95% confidence intervals estimated by linear regression on continuous time. P-values are for Wald tests of time coefficients in regression models. A slope of zero indicates no association with specimen storage time. P-values are not adjusted for multiple comparisons.
$ 1 sample, age group 13–22 years, is missing DNA yield because no DNA remained after the QC step.
Fig 2Association between specimen storage time and the Q129/41 ratio.
The solid line indicates the estimated linear relationship between age and Q129/41 ratio. The shaded area denotes pointwise 95% confidence intervals of the conditional mean. Cases successful through the entire WES workflow (DNA extraction through WES sequencing) are denoted as circles (N = 53); unsuccessful cases are denoted as X’s (N = 6).
Fig 3Association between specimen storage time and final library size in base pairs.
Specimens that failed sequencing do not have library size values and are indicated by x. The solid line indicates the estimated linear relationship between storage time and library size (N = 59). The shaded area denotes pointwise 95% confidence intervals for the conditional means. Cases successful through the entire WES workflow (DNA extraction through WES sequencing) are denoted as circles (N = 53); unsuccessful cases are denoted as X’s (N = 6). Failed assays were not used to estimate the linear trend.
Whole Exome Sequencing QC metrics by duration of specimen storage.
| Mean (Standard Deviation) by Storage Time | Difference (95% CI), 10 Years of Storage | P-value, Trend | ||||
|---|---|---|---|---|---|---|
| Overall (N = 53) | 3–12 Years (N = 12) | 13–22 Years (N = 29) | 23–32 Years (N = 12) | |||
|
| 275.3 (11.3) | 277.2 (6.7) | 277.6 (12.1) | 267.9 (10.3) | -8.5 (-13.4 to -3.5) | P = 0.001 |
|
| 86.2 (7.9) | 89.7 (2.8) | 86.3 (8.3) | 82.2 (9) | -6.1 (-9.5 to -2.6) | P < 0.001 |
|
| 112.1 (48.4) | 128.2 (35.5) | 116 (51) | 86.8 (46.8) | -39.1 (-60 to -18.2) | P < 0.001 |
|
| 33.6 (20.9) | 26.6 (13.9) | 34.1 (21.5) | 39.5 (24.5) | 9.0 (-0.9 to 18.8) | P = 0.074 |
|
| 2.48 (0.15) | 2.43 (.07) | 2.45 (.08) | 2.61 (0.24) | 0.12 (.06 to 0.18) | P < 0.001 |
Final Library Size (bp): This QC metric represents the peak size from BioAnalyzer electropherogram traces of the final exome library from each sample using the Agilent BioAnalyzer High Sensitivity DNA kit.
% Target Covered at 20x: The percentage of all target bases achieving 20X or greater read depth. This metric measures the efficiency of the exome capture. DNA samples with poor quality samples tend to have lower % target covered.
Average Read Depth: Average read depth in the target region.
Percent Duplication: Percent of reads originating from same fragment of the library. These duplicated reads may indicate bias originating from sample quality, library amplification etc. DNA samples with poor quality samples tend to have higher percent duplication.
Ti/Tv Ratio: This is ratio of transitions (single nucleotide substitutions with the same type of nucleotide, e.g., pyrimidine to pyrimidine (C<>T) or purine to purine (A<>G)) to transversions (single nucleotide substitutions with the different type of nucleotide, i.e., pyrimidine to purine or vice versa (A<>T etc.). FFPE DNA samples tend to have higher Ti/Tv ratio due to chemical crosslink and modification.
CI = confidence Interval. Overall means and standard deviations for quality measures and by storage time, differences per 10 years of storage and 95% confidence intervals estimated by linear regression on continuous time. P-values are for Wald tests of time coefficients in regression models. A slope of zero indicates no association with specimen storage time. P-values are not adjusted for multiple comparisons.
Fig 4Association between specimen storage time and average read depth.
The solid line indicates the estimated linear relationship between age and average read depth. The shaded area denoted pointwise 95% confidence intervals of the conditional mean. Cases successful through the entire WES workflow (DNA extraction through WES sequencing) are denoted as circles (N = 53); unsuccessful cases are denoted as X’s (N = 6). Failed assays were not used to estimate the linear trend.
Success of whole exome assay by formalin-fixed paraffin-embedded specimen storage time.
| Age Range in Years | Assay Failed | Assay Successful | Total | Proportion successful | 95% Confidence Interval |
|---|---|---|---|---|---|
|
| 1 | 12 | 13 | 0.92 | 0.64 to 0.99 |
|
| 2 | 29 | 31 | 0.94 | 0.79 to 0.99 |
|
| 3 | 12 | 15 | 0.80 | 0.52 to 0.99 |
* Sequencing assay success defined as having a non-failed final library size and percent target coverage of 50% with a minimum of 20x coverage.
The p-value for a linear test for trend of success probability by specimen storage time equals 0.366.
Success of whole exome assays by SEER Registry sites.
| Registry | Assay Failed | Assay Successful | Total | Percent Success | 95% Confidence Interval |
|---|---|---|---|---|---|
|
| 1 | 19 | 20 | 0.95 | 0.75 to 0.99 |
|
| 2 | 17 | 19 | 0.89 | 0.67 to 0.99 |
|
| 3 | 17 | 20 | 0.85 | 0.62 to 0.97 |
* Sequencing assay success defined as having a non-failed final library size and percent target coverage of 50% with a minimum of 20x coverage.
The p-value for a linear test for trend of success probability by site equals 0.693.