| Literature DB >> 30529127 |
Richard A Blidner1, Brian C Haynes1, Stephen Hyter2, Sarah Schmitt2, Ziyan Y Pessetto2, Andrew K Godwin3, Dan Su4, Patrick Hurban4, Léon C van Kempen5, Maria L Aguirre5, Shobha Gokul1, Robyn D Cardwell1, Gary J Latham6.
Abstract
Lung cancer accounts for approximately 14% of all newly diagnosed cancers and is the leading cause of cancer-related deaths. Chimeric RNA resulting from gene fusions (RNA fusions) and other RNA splicing errors are driver events and clinically addressable targets for non-small cell lung cancer (NSCLC). The reliable assessment of these RNA markers by next-generation sequencing requires integrated reagents, protocols, and interpretive software that can harmonize procedures and ensure consistent results across laboratories. We describe the development and verification of a system for targeted RNA sequencing for the analysis of challenging, low-input solid tumor biopsies that includes reagents for nucleic acid quantification and library preparation, run controls, and companion bioinformatics software. Assay development reconciled sequence discrepancies in public databases, created predictive formalin-fixed, paraffin-embedded RNA qualification metrics, and eliminated read misidentification attributable to index hopping events on the next-generation sequencing flow cell. The optimized and standardized system was analytically verified internally and in a multiphase study conducted at five independent laboratories. The results show accurate, reproducible, and sensitive detection of RNA fusions, alternative splicing events, and other expression markers of NSCLC. This comprehensive approach, combining sample quantification, quality control, library preparation, and interpretive bioinformatics software, may accelerate the routine implementation of targeted RNA sequencing of formalin-fixed, paraffin-embedded samples relevant to NSCLC.Entities:
Mesh:
Year: 2018 PMID: 30529127 PMCID: PMC7057224 DOI: 10.1016/j.jmoldx.2018.10.003
Source DB: PubMed Journal: J Mol Diagn ISSN: 1525-1578 Impact factor: 5.341
Targeted RNA Sequencing Panel Coverage Includes 107 Recurrent Gene Fusions, 3′/5′ Imbalance Targets, MET Exon 14 Skipping, and 23 mRNA Expression Targets
| Coverage type | Markers |
|---|---|
| Fusion transcript | CLTC-ALK ( |
| Imbalance 3′/5′ | ALK, ROS1, RET, NTRK1 |
| Exon skipping | MET exons 13–15 |
| Expression markers | ABCB1, BRCA1, CDKN2A, CTLA4, ERCC1, ESR1, FGFR1, FGFR2, IFNGR, ISG15, MET, MSLN, PTEN, RRM1, TDP1, TERT, TLE3, TOP1, TUBB3, TYMS, CD274 (PDL1), PDCD1LG2 (PDL2), PDCD1 (PD1) |
| Reference markers | TBP, RAB5C, GGNBP2 |
Figure 1The targeted RNA sequencing workflow integrates the analysis of RNA targets with quantification of amplifiable RNA. After reverse transcription of the sample, the cDNA is quantified for preanalytical quality control (QC) and advanced to targeted enrichment and tagging. The resultant libraries are purified, quantified, and pooled for next-generation sequencing analysis. Sequence data are analyzed using a custom bioinformatics pipeline. TNA, total nucleic acid.
Figure 2Next-generation sequencing (NGS) detection sensitivity and on-target reads as a function of formalin-fixed, paraffin-embedded (FFPE) RNA functional yields. A: Fusion-positive libraries (98 of the total 601) evaluated by NGS. Blue points represent true-positive calls, and orange points represent false-negative calls. No false-negative calls are observed above 200 cDNA functional copy input and 5% fraction of fusion-positive input from admixtures. B: Percentage of total NGS reads that passed filter (mapping to intended targets) over a range of functional copy inputs for FFPE RNA (purple points) and cell line RNA (green points). Dashed lines in A and B represents the minimum threshold of 200 functional copies. C: A total of 100 FFPE samples binned into groups, indicating the amount of RNA mass (in nanograms) required to meet the cDNA functional copy minimum input of 200. Solid gray line represents the cumulative number of samples that meet the copy number requirement to pass quality control.
Figure 3Identification of positive targets at 800 functional copies of input using low-positive cell line samples. A: Fusions were detected at 1% positive sample mixture for all replicates. B:MET exon 14 skipping events were detected at 1% positive sample mixture for all replicates. C: Corresponding imbalances were detected at 15% positive sample mixture for all replicates. Blue points represent true-positive calls, and orange points represent false-negative calls.
Figure 4Quantitative measures of mRNA expression markers are consistent across replicate libraries. A: Libraries co-clustered by source sample type; x axis represents the hierarchical clustering by gene, and y axis represents the hierarchical clustering by library. Yellow shading in graph indicates higher levels of expression, whereas blue shading indicates lower relative levels of expression. Along the y axis, nontemplate control (black shading); HL-60 cell line DNA with (blue shading) and without (dark green shading) synthetic template spike-ins; and formalin-fixed, paraffin-embedded (FFPE) positive for EML4-ALK (beige and red shading) along with a cell line mixture positive for MET e14 (yellow shading) with the tissue mix (light green shading) and FFPE negative for any calls (purple shading) are also shown. B: All 23 expression targets from two replicate libraries from a single sample were prepared by two operators and sequenced on different next-generation sequencing runs. Dashed line represents the line of best fit for linear regression.
Targeted RNA-Seq QC Summary of Sample Libraries Assessed by Five Laboratories in a Multisite Study
| Study group | No. of test samples | ||
|---|---|---|---|
| Pass | At risk | Fail | |
| Site 1 | 30/30 | 0/30 | 0/30 |
| Site 2 | 56/58 | 0/58 | 2/58 |
| Site 3 | 54/64 | 10/64 | 0/64 |
| Site 4 | 59/64 | 5/64 | 0/64 |
| Site 5 | 11/20 | 9/20 | 0/20 |
| Total | 210/236 | 24/236 | 2/236 |
| Positive controls | 14/14 | 0/14 | 0/14 |
| Negative controls | 0/14 | 0/14 | 14/14 |
Eighty-nine percent of libraries passed quality control, 10% of libraries were at risk, and 1% of libraries failed quality control across all five sites (excluding controls). At risk libraries coincided with poor next-generation sequencing cluster density, and failed libraries were consistent with errors in library generation. All positive and negative control libraries yielded the expected results.
Summary of RNA Variant Call Accuracy in the Multisite Study Using the Targeted RNA Sequencing System
| Accuracy | No. |
|---|---|
| Fusion and splice variants | |
| Total libraries | 248 |
| True-positive calls | 130 |
| True-negative calls | 118 |
| False-negative calls | 0 |
| False-positive calls | 0 |
| Imbalance | |
| Total libraries | 224 |
| True-positive calls | 84 |
| True-negative calls | 140 |
| False-negative calls | 0 |
| False-positive calls | 0 |
Fusion and splice-variant calls and 3′/5′ imbalance calls are shown for sample next-generation sequencing libraries that met the defined inclusion and exclusion criteria.
Method Comparisons for the Detection of Non–Small Cell Lung Cancer Fusions
| Sample ID | FISH | IHC (ALK only) | NanoString nCounter calls | Targeted RNA sequencing calls | ||
|---|---|---|---|---|---|---|
| Fusion | Imbalance | Fusion | Imbalance | |||
| Sample 2 | (−) | (−) | (−) | (−) | (−) | (−) |
| Sample 3 | (−) | SLC34A2(4)-ROS1(32) | (−) | SLC34A2(4)-ROS1(32) | (−)∗ | |
| Sample 10 | (−) | EZR(10)-ROS1(34) | ROS1 | EZR(10)-ROS1(34) | ROS1 | |
| Sample 11 | ALK | EML4(6)-ALK(20) | (−) | EML4(6)-ALK(20) | (−) | |
| ERL02 | NA | NA | EML4(6)-ALK(20) | ALK | EML4(6)-ALK(20) | ALK |
| ERL10 | NA | NA | SLC34A2(4)-ROS1(32) | ROS1 | SLC34A2(4)-ROS1(32) | ROS1 |
| ERL14 | NA | NA | EML4(6)-ALK(20) | ALK | EML4(6)-ALK(20) | ALK |
| ERL15 | NA | NA | EML4(6)-ALK(20) | ALK | EML4(6)-ALK(20) | ALK |
| ERL16 | NA | NA | EML4(6)-ALK(20) | ALK | EML4(6)-ALK(20) | ALK |
| ERL17 | NA | NA | EML4(6)-ALK(20) | ALK | EML4(6)-ALK(20) | ALK |
| ERL18 | NA | NA | EML4(6)-ALK(20) | ALK | EML4(6)-ALK(20) | ALK |
| ERL19 | NA | NA | EML4(6)-ALK(20) | ALK | EML4(6)-ALK(20) | ALK |
| ERL20 | NA | NA | Unknown ALK fusion | ALK | EML4(6)-ALK(20) | ALK |
Residual clinical samples are denoted as sample, whereas the subset of test samples from the multisite precision studies are denoted as ERL. An additional seven clinical samples and six samples from the multisite study were negative for fusions and imbalances across all assays and are not shown in the table.
FISH, fluorescence in situ hybridization; IHC, immunohistochemistry; NA, not applicable.
Evidence of imbalance below threshold.