| Literature DB >> 33863344 |
Binsheng Gong1, Dan Li1, Rebecca Kusko2, Natalia Novoradovskaya3, Yifan Zhang1,4, Shangzi Wang5, Carlos Pabón-Peña6, Zhihong Zhang7, Kevin Lai8, Wanshi Cai9, Jennifer S LoCoco10, Eric Lader11, Todd A Richmond12, Vinay K Mittal13, Liang-Chun Liu14, Donald J Johann15, James C Willey16, Pierre R Bushel17, Ying Yu5, Chang Xu11, Guangchun Chen18, Daniel Burgess19, Simon Cawley20, Kristina Giorda21, Nathan Haseley10, Fujun Qiu7, Katherine Wilkins6, Hanane Arib22, Claire Attwooll10, Kevin Babson23, Longlong Bao24,25,26, Wenjun Bao27, Anne Bergstrom Lucas6, Hunter Best28,29, Ambica Bhandari30, Halil Bisgin31, James Blackburn32,33, Thomas M Blomquist34,35, Lisa Boardman36, Blake Burgher37, Daniel J Butler38, Chia-Jung Chang39, Alka Chaubey23, Tao Chen40, Marco Chierici41, Christopher R Chin38, Devin Close29, Jeffrey Conroy37, Jessica Cooley Coleman23, Daniel J Craig42, Erin Crawford42, Angela Del Pozo43,44, Ira W Deveson45,46, Daniel Duncan47, Agda Karina Eterovic48, Xiaohui Fan49, Jonathan Foox38, Cesare Furlanello41,50, Abhisek Ghosal30, Sean Glenn37, Meijian Guan27, Christine Haag51, Xinyi Hang9, Scott Happe52, Brittany Hennigan23, Jennifer Hipp53, Huixiao Hong1, Kyle Horvath30, Jianhong Hu54, Li-Yuan Hung55, Mirna Jarosz56, Jennifer Kerkhof57, Benjamin Kipp58, David Philip Kreil59, Paweł Łabaj60,61, Pablo Lapunzina44,62,63, Peng Li55, Quan-Zhen Li18, Weihua Li64, Zhiguang Li65, Yu Liang66, Shaoqing Liu67, Zhichao Liu1, Charles Ma47, Narasimha Marella47, Rubén Martín-Arenas68, Dalila B Megherbi69, Qingchang Meng54, Piotr A Mieczkowski70, Tom Morrison71, Donna Muzny54, Baitang Ning1, Barbara L Parsons40, Cloud P Paweletz72, Mehdi Pirooznia73, Wubin Qu9, Amelia Raymond74, Paul Rindler29, Rebecca Ringler30, Bekim Sadikovic57,75, Andreas Scherer44,76, Egbert Schulze77, Robert Sebra22, Rita Shaknovich47, Qiang Shi78, Tieliu Shi79, Juan Carlos Silla-Castro80, Melissa Smith22, Mario Solís López43,44, Ping Song48, Daniel Stetson74, Maya Strahl22, Alan Stuart57, Julianna Supplee72, Philippe Szankasi29, Haowen Tan81, Lin-Ya Tang48, Yonghui Tao24,25,26, Shraddha Thakkar1, Danielle Thierry-Mieg82, Jean Thierry-Mieg82, Venkat J Thodima47, David Thomas33,45, Boris Tichý44,83, Nikola Tom44,83, Elena Vallespin Garcia43,44, Suman Verma30, Kimbley Walker54, Charles Wang84,85, Junwen Wang86,87,88, Yexun Wang11, Zhining Wen89, Valtteri Wirta90, Leihong Wu1, Chunlin Xiao91, Wenzhong Xiao39,55, Shibei Xu92, Mary Yang4, Jianming Ying64, Shun H Yip86,93, Guangliang Zhang94, Sa Zhang94, Meiru Zhao95, Yuanting Zheng5, Xiaoyan Zhou24,25,26, Christopher E Mason38, Timothy Mercer96,97, Weida Tong1, Leming Shi98,99,100, Wendell Jones101, Joshua Xu102.
Abstract
BACKGROUND: Targeted sequencing using oncopanels requires comprehensive assessments of accuracy and detection sensitivity to ensure analytical validity. By employing reference materials characterized by the U.S. Food and Drug Administration-led SEquence Quality Control project phase2 (SEQC2) effort, we perform a cross-platform multi-lab evaluation of eight Pan-Cancer panels to assess best practices for oncopanel sequencing.Entities:
Keywords: Analytical performance; Molecular diagnostics; Oncopanel sequencing; Precision medicine; Reproducibility; Target enrichment
Mesh:
Substances:
Year: 2021 PMID: 33863344 PMCID: PMC8051090 DOI: 10.1186/s13059-021-02315-0
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Summary of recommendations
| Issue | Recommendation |
|---|---|
| A reference material (or a set of reference materials) with a high density of known variants spanning a range of low allele frequencies (e.g., from above 1 to 20%) is needed to assess the analytical performance of oncopanels (Fig. | |
| Sensitivity was found to be high (> 96.5%) for variants previously verified to have variant allele frequency (VAF) greater than 5% (Fig. | |
| Utilizing a sample spiked-in at a specified amount (e.g., 5%) can provide additional variants at known allele frequencies for analytical validation of oncopanels. | |
| The sensitivity for detecting insertion and deletion variants (indels) is typically more variable and poorer than single nucleotide variants (SNVs), and this difference becomes more pronounced at low VAFs (Fig. | |
| Reference materials can be used to establish an optimal VAF threshold that reduces false positives (FPs) and retains sensitivity. Indels and SNVs may require different VAF thresholds to optimize performance (Fig. | |
| In applications where a minimal FP rate is required, raising the allele frequency threshold was effective at reducing FPs. The additional restriction of analysis to the consensus targeted regions (CTR) can further reduce FP rate. | |
| Genomic location can impact the rate of FPs detected, and we recommend that analytical validation of panels is independently performed inside and outside of the CTR (Fig. | |
| Measuring the analytical performance of a panel in multiple labs is critical to establish reproducibility (Fig. | |
| TMB estimation should be confined to the CTR of each panel. Applying a minimal VAF threshold was helpful to reduce FPs and improve TMB evaluation (Fig. |
Fig. 1Comprehensive study design for assessing analytical performance of multiple pan-cancer targeted sequencing technologies. a Four samples were tested on 8 pan-cancer panels with at least 3 different test laboratories for each panel. b Basic information of 8 pan-cancer panels is listed in the embedded table (see Additional file 1: Table S1 for detailed information). *All participating panels are for research use only. †QGN’s UMI-aware variant caller is able to call variants with VAF as low as 0.5%. c Each sample had 4 library replicates at each test laboratory. After sequencing, panel-specific variant calling was performed by each panel vendor. d Variant calling results were submitted for performance analysis including sensitivity, false positive call rate, and reproducibility
Fig. 2Reproducibility and sensitivity across VAF ranges for SNVs in the consensus targeted regions. a Table listed the number of known variants in each VAF range (left number), sensitivity (right number) for all 8 panels across all samples tested. For the panels with a built-in VAF threshold, “N/A” is listed if the VAF low bound is much lower than the panel provider’s chosen VAF threshold. The VAF threshold is 2.6% for ILM, 2.0% for IDT, 2.5% for ROC, and 2.5% for TFS, respectively. b Average false positive SNV calls per million across various VAF cutoffs. Jittering was applied to avoid overlapping. c Cross-lab and intra-lab reproducibility (in Phred scale) for variant calls with VAF between 2.5 and 20%
Fig. 3Impact of VAF cutoff, variant type, and genomic region on sensitivity (in Phred scale). a Violin distribution plots of estimated sensitivity for each panel in all sample A and C libraries for known SNVs (in blue on the left side) and other variants (small indels or MNVs, in green on the adjacent right side) with VAF between 2.5 and 20%. Total numbers of small indels and MNVs are listed under the corresponding violin plot. b Artificial VAF filters reduce sensitivity for known positives with VAF between 2.5 and 5% due to the variable VAF measurements. c High concordance of sensitivity in and outside of CTR (more specifically, in HC_CR beyond CTR) for known positives with VAF between 2.5 and 20%. Jittering was applied to one dot at the top right corner to avoid overlapping
Fig. 4Impact of CTR region on FP rate and reproducibility. a FP rate in and outside of the CTR using two different methods (B_low and C_only) at three different VAF cutoffs, 1%, 2.5%, and 5%. C_only was not applied to TFS as sample C was not tested on TFS. FP rates are plotted in squared root scale. b Estimated FP rate within the CTR averaged over three methods at different VAF cutoffs. c Cross-lab reproducibility (in Phred scale) in samples A and C within and outside of the CTR
Fig. 5Coefficient of variation (CV) of TMB. a Technical run-to-run variance of TMB with VAF cutoff above 2.5% was estimated for six panels at different TMB levels. A power-law curve (dashed line) is fitted for each panel. b Technical run-to-run variance of TMB with VAF cutoff above 5%. c The intrinsic CV is plotted with the equation (embedded, see “Methods” for detail) for each panel based on their panel size. The curve (dashed) for size of 1 Mb is also plotted as a reference. d The overall CV is plotted combining technical and intrinsic CV. The solid fitting power-law curve is for TMB (VAF > 2.5%), and the dashed curve is for TMB (VAF > 5%)