Chunxiao Fu1, Michal Marczyk2,3, Michael Samuels4, Alexander J Trevarton1, Jiaxin Qu5, Rosanna Lau1, Lili Du1, Todd Pappas5, Bruno V Sinn6, Rebekah E Gould1, Lajos Pusztai2, Christos Hatzis2, W Fraser Symmans7. 1. Departments of Pathology and Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA. 2. Yale Cancer Center, Yale School of Medicine, New Haven, CT, USA. 3. Department of Data Science and Engineering, Silesian University of Technology, Gliwice, Poland. 4. The Jackson Laboratory, Farmington, CT, USA. 5. Delphi Diagnostics, Austin, TX, USA. 6. Institute of Pathology, Charité Universitätsmedizin Berlin, Berlin, Germany. 7. Departments of Pathology and Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA. fsymmans@mdanderson.org.
Abstract
BACKGROUND: Our objective was to assess whether modifications to a customized targeted RNA sequencing (RNAseq) assay to include unique molecular identifiers (UMIs) that collapse read counts to their source mRNA counts would improve quantification of transcripts from formalin-fixed paraffin-embedded (FFPE) tumor tissue samples. The assay (SET4) includes signatures that measure hormone receptor and PI3-kinase related transcriptional activity (SETER/PR and PI3Kges), and measures expression of selected activating point mutations and key breast cancer genes. METHODS: Modifications included steps to introduce eight nucleotides-long UMIs during reverse transcription (RT) in bulk solution, followed by polymerase chain reaction (PCR) of labeled cDNA in droplets, with optimization of the polymerase enzyme and reaction conditions. We used Lin's concordance correlation coefficient (CCC) to measure concordance, including precision (Rho) and accuracy (Bias), and nonparametric tests (Wilcoxon, Levene's) to compare the modified (NEW) SET4 assay to the original (OLD) SET4 assay and to whole transcriptome RNAseq using RNA from matched fresh frozen (FF) and FFPE samples from 12 primary breast cancers. RESULTS: The modified (NEW) SET4 assay measured single transcripts (p< 0.001) and SETER/PR (p=0.002) more reproducibly in technical replicates from FFPE samples. The modified SET4 assay was more precise for measuring single transcripts (Rho 0.966 vs 0.888, p< 0.01) but not multigene expression signatures SETER/PR (Rho 0.985 vs 0.968) or PI3Kges (Rho 0.985 vs 0.946) in FFPE, compared to FF samples. It was also more precise than wtRNAseq of FFPE for measuring transcripts (Rho 0.986 vs 0.934, p< 0.001) and SETER/PR (Rho 0.993 vs 0.915, p=0.004), but not PI3Kges (Rho 0.988 vs 0.945, p=0.051). Accuracy (Bias) was comparable between protocols. Two samples carried a PIK3CA mutation, and measurements of transcribed mutant allele fraction was similar in FF and FFPE samples and appeared more precise with the modified SET4 assay. Amplification efficiency (reads per UMI) was consistent in FF and FFPE samples, and close to the theoretically expected value, when the library size exceeded 400,000 aligned reads. CONCLUSIONS: Modifications to the targeted RNAseq protocol for SET4 assay significantly increased the precision of UMI-based and reads-based measurements of individual transcripts, multi-gene signatures, and mutant transcript fraction, particularly with FFPE samples.
BACKGROUND: Our objective was to assess whether modifications to a customized targeted RNA sequencing (RNAseq) assay to include unique molecular identifiers (UMIs) that collapse read counts to their source mRNA counts would improve quantification of transcripts from formalin-fixed paraffin-embedded (FFPE) tumor tissue samples. The assay (SET4) includes signatures that measure hormone receptor and PI3-kinase related transcriptional activity (SETER/PR and PI3Kges), and measures expression of selected activating point mutations and key breast cancer genes. METHODS: Modifications included steps to introduce eight nucleotides-long UMIs during reverse transcription (RT) in bulk solution, followed by polymerase chain reaction (PCR) of labeled cDNA in droplets, with optimization of the polymerase enzyme and reaction conditions. We used Lin's concordance correlation coefficient (CCC) to measure concordance, including precision (Rho) and accuracy (Bias), and nonparametric tests (Wilcoxon, Levene's) to compare the modified (NEW) SET4 assay to the original (OLD) SET4 assay and to whole transcriptome RNAseq using RNA from matched fresh frozen (FF) and FFPE samples from 12 primary breast cancers. RESULTS: The modified (NEW) SET4 assay measured single transcripts (p< 0.001) and SETER/PR (p=0.002) more reproducibly in technical replicates from FFPE samples. The modified SET4 assay was more precise for measuring single transcripts (Rho 0.966 vs 0.888, p< 0.01) but not multigene expression signatures SETER/PR (Rho 0.985 vs 0.968) or PI3Kges (Rho 0.985 vs 0.946) in FFPE, compared to FF samples. It was also more precise than wtRNAseq of FFPE for measuring transcripts (Rho 0.986 vs 0.934, p< 0.001) and SETER/PR (Rho 0.993 vs 0.915, p=0.004), but not PI3Kges (Rho 0.988 vs 0.945, p=0.051). Accuracy (Bias) was comparable between protocols. Two samples carried a PIK3CA mutation, and measurements of transcribed mutant allele fraction was similar in FF and FFPE samples and appeared more precise with the modified SET4 assay. Amplification efficiency (reads per UMI) was consistent in FF and FFPE samples, and close to the theoretically expected value, when the library size exceeded 400,000 aligned reads. CONCLUSIONS: Modifications to the targeted RNAseq protocol for SET4 assay significantly increased the precision of UMI-based and reads-based measurements of individual transcripts, multi-gene signatures, and mutant transcript fraction, particularly with FFPE samples.
Entities:
Keywords:
Assay development; Breast cancer; Targeted RNAseq; Unique molecular identifiers
Authors: Sherene Loi; Benjamin Haibe-Kains; Samira Majjaj; Francoise Lallemand; Virginie Durbecq; Denis Larsimont; Ana M Gonzalez-Angulo; Lajos Pusztai; W Fraser Symmans; Alberto Bardelli; Paul Ellis; Andrew N J Tutt; Cheryl E Gillett; Bryan T Hennessy; Gordon B Mills; Wayne A Phillips; Martine J Piccart; Terence P Speed; Grant A McArthur; Christos Sotiriou Journal: Proc Natl Acad Sci U S A Date: 2010-05-17 Impact factor: 11.205
Authors: Stephen J Luen; Rebecca Asher; Chee Khoon Lee; Peter Savas; Roswitha Kammler; Patrizia Dell'Orto; Olivia Maria Biasi; David Demanse; Lellean JeBailey; Sinead Dolan; Wolfgang Hackl; Beat Thuerlimann; Giuseppe Viale; Marco Colleoni; Meredith M Regan; Sherene Loi Journal: JAMA Oncol Date: 2018-10-01 Impact factor: 31.777
Authors: Bruno V Sinn; Chunxiao Fu; Rosanna Lau; Jennifer Litton; Tsung-Heng Tsai; Rashmi Murthy; Alda Tam; Eleni Andreopoulou; Yun Gong; Ravi Murthy; Rebekah Gould; Ya Zhang; Tari A King; Agnes Viale; Victor Andrade; Dilip Giri; Roberto Salgado; Ioanna Laios; Christos Sotiriou; Esmeralda C Marginean; Danielle N Kwiatkowski; Rachel M Layman; Daniel Booser; Christos Hatzis; V Vicente Valero; W Fraser Symmans Journal: NPJ Breast Cancer Date: 2019-05-30
Authors: Michal Marczyk; Chunxiao Fu; Rosanna Lau; Lili Du; Alexander J Trevarton; Bruno V Sinn; Rebekah E Gould; Lajos Pusztai; Christos Hatzis; W Fraser Symmans Journal: BMC Cancer Date: 2019-12-05 Impact factor: 4.430
Authors: Mark Johnson; Irena Zaretskaya; Yan Raytselis; Yuri Merezhuk; Scott McGinnis; Thomas L Madden Journal: Nucleic Acids Res Date: 2008-04-24 Impact factor: 16.971