| Literature DB >> 29621323 |
Eric Bonnet1, Marie-Laure Moutet1, Céline Baulard1, Delphine Bacq-Daian1, Florian Sandron1, Lilia Mesrob1, Bertrand Fin1, Marc Delépine1, Marie-Ange Palomares1, Claire Jubin1, Hélène Blanché2, Vincent Meyer1, Anne Boland1, Robert Olaso1, Jean-François Deleuze1,3,2,4.
Abstract
Next-generation sequencing (NGS) studies are becoming routinely used for the detection of novel and clinically actionable DNA variants at a pangenomic scale. Such analyses are now used in the clinical practice to enable precision medicine. Formalin-fixed paraffin-embedded (FFPE) tissues are still one of the most abundant source of cancer clinical specimen, unfortunately this method of preparation is known to degrade DNA and therefore compromise subsequent analysis. Some studies have reported that variant detection can be performed on FFPE samples sequenced with NGS techniques, but few or none have done an in-depth coverage analysis and compared the influence of different state-of-the-art FFPE DNA extraction kits on the quality of the variant calling. Here, we generated 42 human whole-exome sequencing data sets from fresh-frozen (FF) and FFPE samples. These samples include normal and tumor tissues from two different organs (liver and colon), that we extracted with three different FFPE extraction kits (QIAamp DNA FFPE Tissue kit and GeneRead DNA FFPE kit from Qiagen, Maxwell™ RSC DNA FFPE Kit from Promega). We determined the rate of concordance of called variants between matched FF and FFPE samples on all common variants (representing at least 86% of the total number of variants for SNVs). The concordance rate is very high between all matched FF / FFPE pairs, with equivalent values for the three kits we analyzed. On the other hand, when looking at the difference between the total number of variants in FF and FFPE, we find a significant variation for the three different FFPE DNA extraction kits. Coverage analysis shows that FFPE samples have less good indicators than FF samples, yet the coverage quality remains above accepted thresholds. We detect limited but statistically significant variations in coverage indicator values between the three FFPE extraction kits. Globally, the GeneRead and QIAamp kits have better variant calling and coverage indicators than the Maxwell kit on the samples used in this study, although this kit performs better on some indicators and has advantages in terms of practical usage. Taken together, our results confirm the potential of FFPE samples analysis for clinical genomic studies, but also indicate that the choice of a FFPE DNA extraction kit should be done with careful testing and analysis beforehand in order to maximize the accuracy of the results.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29621323 PMCID: PMC5886566 DOI: 10.1371/journal.pone.0195471
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1DNA Integrity Number (DIN) values for FF and FFPE samples.
A: FF and FFPE samples. B: FFPE samples grouped by extraction method.
Fig 2DNA fragment length values for FF and FFPE samples.
A: FF and FFPE samples. B: FFPE samples grouped by extraction method.
Fig 3Percentage of positions having a coverage greater then or equal to 30X.
A: FF and FFPE samples. B: FFPE samples grouped by extraction method.
Fig 4Median coverage values for FF and FFPE samples.
A: FF and FFPE samples. B: FFPE samples grouped by extraction method.
Fig 5Percentage of duplicated reads for FF and FFPE samples.
A: FF and FFPE samples. B: FFPE samples grouped by extraction method.
Fig 6Percentage of reads mapping outside target regions.
A: FF and FFPE samples. B: FFPE samples grouped by extraction method.
Single nucleotide variants (SNVs) analysis between FF and FFPE sample pairs.
NFF: Number of SNV in FF samples, NFFPE: number of SNV in FFPE samples, NPos: number of common positions between FF and FFPE samples, Nco: number of concordant positions, Ndi: number of discordant positions, P: concordance rate (Nco/NPos * 100).
| Pair | NFF | NFFPE | NPos | Nco | Ndi | P |
|---|---|---|---|---|---|---|
| 1 | 41008 | 40165 | 39778 | 39772 | 6 | 99.98 |
| 2 | 40902 | 40157 | 39691 | 39685 | 6 | 99.98 |
| 3 | 40950 | 40361 | 39913 | 39906 | 7 | 99.98 |
| 4 | 41078 | 40011 | 39643 | 39639 | 4 | 99.99 |
| 5 | 40705 | 40295 | 39355 | 39351 | 4 | 99.99 |
| 6 | 40999 | 40134 | 39570 | 39567 | 3 | 99.99 |
| 7 | 40571 | 40481 | 39634 | 39629 | 5 | 99.99 |
| 8 | 40568 | 40623 | 39568 | 39564 | 4 | 99.99 |
| 9 | 41008 | 40716 | 40075 | 40068 | 7 | 99.98 |
| 10 | 40902 | 40764 | 40048 | 40044 | 4 | 99.99 |
| 11 | 40902 | 40712 | 40004 | 40001 | 3 | 99.99 |
| 12 | 40950 | 40712 | 40094 | 40089 | 5 | 99.99 |
| 13 | 41078 | 40788 | 40102 | 40098 | 4 | 99.99 |
| 14 | 41078 | 40779 | 40116 | 40110 | 6 | 99.99 |
| 15 | 40705 | 40794 | 39631 | 39626 | 5 | 99.99 |
| 16 | 40999 | 40852 | 40046 | 40042 | 4 | 99.99 |
| 17 | 40571 | 40586 | 39473 | 39468 | 5 | 99.99 |
| 18 | 40568 | 41009 | 39848 | 39844 | 4 | 99.99 |
| 19 | 41239 | 37396 | 36851 | 36846 | 5 | 99.99 |
| 20 | 41201 | 37834 | 37239 | 37236 | 3 | 99.99 |
| 21 | 41115 | 37721 | 37185 | 37179 | 6 | 99.98 |
| 22 | 41189 | 36215 | 35751 | 35745 | 6 | 99.98 |
| 23 | 40963 | 38097 | 37588 | 37582 | 6 | 99.98 |
| 24 | 41074 | 38132 | 37577 | 37572 | 5 | 99.99 |
| 25 | 41133 | 38369 | 37860 | 37858 | 2 | 99.99 |
Insertion-deletion events (INDELs) analysis between FF and FFPE sample pairs.
NFF: Number of INDELs in FF samples, NFFPE: number of INDELs in FFPE samples, NPos: number of common positions between FF and FFPE samples, Nco: number of concordant positions, Ndi: number of discordant positions, P: percentage of concordance Nco/NPos.
| Pair | NFF | NFFPE | NPos | Nco | Ndi | P |
|---|---|---|---|---|---|---|
| 1 | 3609 | 3402 | 3176 | 3140 | 36 | 98.87 |
| 2 | 3612 | 3369 | 3169 | 3122 | 47 | 98.52 |
| 3 | 3610 | 3435 | 3193 | 3154 | 39 | 98.78 |
| 4 | 3624 | 3428 | 3168 | 3122 | 46 | 98.55 |
| 5 | 3762 | 3445 | 3178 | 3136 | 42 | 98.68 |
| 6 | 3770 | 3446 | 3212 | 3170 | 42 | 98.69 |
| 7 | 3749 | 3587 | 3315 | 3276 | 39 | 98.82 |
| 8 | 3712 | 3589 | 3239 | 3203 | 36 | 98.89 |
| 9 | 3609 | 3568 | 3258 | 3210 | 48 | 98.53 |
| 10 | 3612 | 3589 | 3267 | 3221 | 46 | 98.59 |
| 11 | 3612 | 3538 | 3240 | 3196 | 44 | 98.64 |
| 12 | 3610 | 3588 | 3267 | 3216 | 51 | 98.44 |
| 13 | 3624 | 3565 | 3251 | 3206 | 45 | 98.62 |
| 14 | 3624 | 3567 | 3271 | 3235 | 36 | 98.90 |
| 15 | 3762 | 3611 | 3293 | 3245 | 48 | 98.54 |
| 16 | 3770 | 3614 | 3336 | 3292 | 44 | 98.68 |
| 17 | 3749 | 3629 | 3311 | 3268 | 43 | 98.70 |
| 18 | 3712 | 3711 | 3346 | 3310 | 36 | 98.92 |
| 19 | 3723 | 2931 | 2740 | 2703 | 37 | 98.65 |
| 20 | 3770 | 2999 | 2826 | 2797 | 29 | 98.97 |
| 21 | 3763 | 2957 | 2791 | 2759 | 32 | 98.85 |
| 22 | 3732 | 2736 | 2593 | 2570 | 23 | 99.11 |
| 23 | 3600 | 2952 | 2779 | 2756 | 23 | 99.17 |
| 24 | 3635 | 2972 | 2788 | 2752 | 36 | 98.71 |
| 25 | 3651 | 3036 | 2839 | 2810 | 29 | 98.98 |
Fig 7Difference in number of variants between FF and FFPE samples for all matched FF/FFPE pairs.
A: SNVs. B: INDELs.
Fig 8Rate of SNVs C.G > T.A substitution in FF and FFPE samples.
A: FF and FFPE samples. B: FFPE samples grouped by extraction method.
Selection of annotated tumor-specific variants found in common between FF and FFPE colon samples.
Chr: chromosome number. Position: position of the variant on the chromosome. Ns: number of samples in which the variant was found. COSMIC ID: COSMIC database identification code. Gene Symbol: HGNC gene symbol. Pathways: selection of KEGG or REACTOME pathways in which the gene is involved. FFV: mean variant allelic frequency (%) for FF samples. FFD: mean read depth for the position for FF samples (coverage). FEV: mean variant allelic frequency (%) for FFPE samples. FED: mean read depth for the position for FFPE samples (coverage).
| Chr | Position | Ns | COSMIC ID | Gene Symbol | Pathways | FFV | FFD | FEV | FED |
|---|---|---|---|---|---|---|---|---|---|
| chr1 | 228109626 | 10 | COSM4389123 | WNT9A | WNT signaling pathway | 24 | 208 | 22 | 428 |
| Basal cell carcinoma | |||||||||
| Melanogenesis | |||||||||
| Pathways in cancer | |||||||||
| Hedgehog signaling pathway | |||||||||
| chr2 | 231867435 | 10 | COSM5499569 | SPATA3 | 21 | 116 | 26 | 145 | |
| chr2 | 238990381 | 9 | COSM5471498 | UBE2F-SCLY | Selenoaminoacid metabolism | 23 | 153 | 22 | 226 |
| COSM5471497 | SCLY | ||||||||
| chr3 | 98188932 | 10 | COSM262658 | OR5K1 | Olfactory transduction | 22 | 166 | 19 | 85 |
| chr5 | 112175594 | 9 | COSM19705 | APC | WNT signaling pathway | 21 | 142 | 22 | 73 |
| CTC-554D6.1 | Endometrial cancer | ||||||||
| Pathways in cancer | |||||||||
| Colorectal cancer | |||||||||
| Basal cell carcinoma | |||||||||
| chr5 | 140432413 | 7 | COSM125200 | PCDHB1 | Signaling by Rho GTPases | 24 | 79 | 25 | 54 |
| chr6 | 36982783 | 10 | COSM3076428 | FGD2 | 22 | 130 | 18 | 211 | |
| chr10 | 50315817 | 10 | COSM259693 | VSTM4 | 22 | 180 | 21 | 286 | |
| COSM259692 | |||||||||
| chr10 | 89692910 | 10 | COSM5032 | PTEN | Pathways in cancer | 31 | 153 | 31 | 81 |
| Tight junction | |||||||||
| Prostate cancer | |||||||||
| PI signaling system | |||||||||
| Melanoma, Glioma | |||||||||
| P53 signaling pathway | |||||||||
| chr11 | 64083221 | 3 | COSM300694 | TRMT112 | Peroxisome | 15 | 60 | 10 | 120 |
| ESRRA | Nuclear receptor transcription | ||||||||
| PRDX5 | Generic transcription | ||||||||
| chr11 | 76796018 | 3 | COSM4592217 | CAPN5 | 25 | 88 | 27 | 178 | |
| chr12 | 25398284 | 9 | COSM1140133 | KRAS | Pathways in cancer | 19 | 88 | 20 | 61 |
| COSM49168 | Prostate cancer | ||||||||
| COSM520 | Endometrial cancer | ||||||||
| Acute myeloid leukemia | |||||||||
| Non small cell lung cancer | |||||||||
| Glioma, Thyroid cancer | |||||||||
| Colorectal cancer | |||||||||
| ERBB signaling pathway | |||||||||
| VEGF signaling pathway | |||||||||
| chr13 | 111109670 | 3 | COL4A2-AS2 | ECM receptor interaction | 48 | 169 | 48 | 212 | |
| COL4A2 | Pathways in cancer | ||||||||
| Small cell lung cancer | |||||||||
| Focal adhesion | |||||||||
| chr20 | 62076690 | 9 | COSM2932051 | KCNQ2, | Developmental biology | 23 | 112 | 19 | 190 |
| COSM2932050 | RP11-358D14.2 | Potassium channels | |||||||
| COSM2932052 | RP11-358D14.2 | Neuronal system | |||||||
| COSM4100400 | |||||||||
| chrX | 37028002 | 10 | COSM1319445 | FAM47C | 46 | 74 | 45 | 95 | |
| COSM1319446 |
Coverage, variant calling and technical quality indicators for the three FFPE DNA extraction kits.
Values are given for quantitative indicators. The number of stars between brackets (one, two or three) indicate the relative ranking of a given kit for the indicator and for the samples we have analyzed in this study. Technical indicators (qualitative and quantitative) describe how easy or practical it is to use the kits, especially with the aim of analyzing large number of samples, based on our experience in this study.
| GeneRead | Maxwell | QIAamp | |
|---|---|---|---|
| DNA Integrity Number median value | 2.35 (**) | 1.75 (*) | 3.3 (***) |
| DNA fragment length in bp | 1424 (**) | 988 (*) | 1622 (***) |
| Median percentage of positions with coverage ≥ 30X | 97.6 (***) | 95.5 (*) | 97.7 (***) |
| Median coverage values | 76X (**) | 60X (*) | 87.5X (***) |
| Percentage of duplicated reads | 15.5 (*) | 7.4 (***) | 12.2 (*) |
| Percentage of reads mapping outside target regions | 19.6 (**) | 21.4 (*) | 19.1 (**) |
| C.T > G.A conversion rate in SNV calls | 0.372 | 0.374 | 0.372 |
| Median value for the variant difference FF/FFPE | 214 (***) | 3367 (*) | 667 (**) |
| Median value for the percentage of concordance FF/FFPE | 99.9 (***) | 99.9 (***) | 99.9 (***) |
| Purification technique easiness | (*) | (***) | (*) |
| Possibility of elution step optimization | (**) | (**) | |
| Output tube format and transfer | (***) | (*) | (***) |
| Max number of samples per run | 12 (**) | 16 (***) | 12 (**) |
| Input material quantity (nb of 10 | 1 (*) | 1 to 16 (***) | 1 to 8 (**) |
1 The values are calculated on bam files normalized to 80M reads.
2 The value corresponds to the absolute value of the difference between the number of variants in FF versus FFPE sample pairs.
3 The percentage of concordance is determined on the common variants positions between FF and FFPE sample pairs.
4 Purification for the Maxwell kit is based on magnetic beads and cassettes, GeneRead and QIAamp kits use columns which are more time consuming to manipulate.
5 The elution volume is fixed for the Maxwell kit, leaving no room for optimization, while with the QIAGEN kits, it is possible to optimize the final concentration by playing with the elution volume and/or to warm up the elution buffer to increase efficiency.
6 For the Maxwell kit, the format of the output tube (0.5 ml) is not practical, and there are magnetic beads residues in the tubes, necessitating a transfer in a more adapted tube for further processing. These problems are not present for the QIAGEN kits.
7 The higher the number of tissue slices per extraction, the more flexible it is to obtain enough material for sequencing.