Literature DB >> 31651965

Comparative Tumor RNA Sequencing Analysis for Difficult-to-Treat Pediatric and Young Adult Patients With Cancer.

Olena M Vaske1,2, Isabel Bjork2, Sofie R Salama2,3, Holly Beale1,2, Avanthi Tayi Shah4, Lauren Sanders2, Jacob Pfeil2, Du L Lam2, Katrina Learned2, Ann Durbin2, Ellen T Kephart2, Rob Currie2, Yulia Newton2, Teresa Swatloski2, Duncan McColl2, John Vivian2, Jingchun Zhu2, Alex G Lee4, Stanley G Leung4, Aviv Spillinger4, Heng-Yi Liu4, Winnie S Liang5, Sara A Byron5, Michael E Berens6, Adam C Resnick7, Norman Lacayo8, Sheri L Spunt8, Arun Rangaswami8, Van Huynh9, Lilibeth Torno9, Ashley Plant9, Ivan Kirov9, Keri B Zabokrtsky9, S Rod Rassekh10, Rebecca J Deyell10, Janessa Laskin11, Marco A Marra12,13, Leonard S Sender9, Sabine Mueller14,15,16, E Alejandro Sweet-Cordero4, Theodore C Goldstein2,17, David Haussler2,3.   

Abstract

Importance: Pediatric cancers are epigenetic diseases; therefore, considering tumor gene expression information is necessary for a complete understanding of the tumorigenic processes. Objective: To evaluate the feasibility and utility of incorporating comparative gene expression information into the precision medicine framework for difficult-to-treat pediatric and young adult patients with cancer. Design, Setting, and Participants: This cohort study was conducted as a consortium between the University of California, Santa Cruz (UCSC) Treehouse Childhood Cancer Initiative and clinical genomic trials. RNA sequencing (RNA-Seq) data were obtained from the following 4 clinical sites and analyzed at UCSC: British Columbia Children's Hospital (n = 31), Lucile Packard Children's Hospital at Stanford University (n = 80), CHOC Children's Hospital and Hyundai Cancer Institute (n = 46), and the Pacific Pediatric Neuro-Oncology Consortium (n = 24). The study dates were January 1, 2016, to March 22, 2017. Exposures: Participants underwent tumor RNA-Seq profiling as part of 4 separate clinical trials at partner hospitals. The UCSC either downloaded RNA-Seq data from a partner institution for analysis in the cloud or provided a Docker pipeline that performed the same analysis at a partner institution. The UCSC then compared each participant's tumor RNA-Seq profile with more than 11 000 uniformly analyzed tumor profiles from pediatric and young adult patients with cancer, downloaded from public data repositories. These comparisons were used to identify genes and pathways that are significantly overexpressed in each patient's tumor. Results of the UCSC analysis were presented to clinical partners. Main Outcomes and Measures: Feasibility of a third-party institution (UCSC Treehouse Childhood Cancer Initiative) to obtain tumor RNA-Seq data from patients, conduct comparative analysis, and present analysis results to clinicians; and proportion of patients for whom comparative tumor gene expression analysis provided useful clinical and biological information.
Results: Among 144 samples from children and young adults (median age at diagnosis, 9 years; range, 0-26 years; 72 of 118 [61.0%] male [26 patients sex unknown]) with a relapsed, refractory, or rare cancer treated on precision medicine protocols, RNA-Seq-derived gene expression was potentially useful for 99 of 144 samples (68.8%) compared with DNA mutation information that was potentially useful for only 34 of 74 samples (45.9%). Conclusions and Relevance: This study's findings suggest that tumor RNA-Seq comparisons may be feasible and highlight the potential clinical utility of incorporating such comparisons into the clinical genomic interpretation framework for difficult-to-treat pediatric and young adult patients with cancer. The study also highlights for the first time to date the potential clinical utility of harmonized publicly available genomic data sets.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31651965      PMCID: PMC6822083          DOI: 10.1001/jamanetworkopen.2019.13968

Source DB:  PubMed          Journal:  JAMA Netw Open        ISSN: 2574-3805


Introduction

We present a framework for comparative RNA sequencing (RNA-Seq) analysis of pediatric tumors across multiple precision medicine studies. Our framework uses public genomic data sets of more than 11 000 tumor RNA-Seq samples that we consolidated and released to the community. We describe an application of our framework and the data compendium to the analysis of 144 tumors from children and young adults with a relapsed, refractory, or rare cancer, studied on 4 separate precision medicine trials in the United States and Canada. While genomic profiling of tumors is becoming the standard of care in oncology, many tumors, especially in children, do not harbor actionable DNA aberrations. Tumor gene expression information may increase the number of actionable aberrations detected in tumors, and its utility is being evaluated in adults (eg, the WINTHER trial[1]). Results of several studies suggested the possible clinical utility of RNA-Seq for children. The Michigan Oncology Sequencing Center's Peds-MiOncoSeq study[2] evaluated 92 patients with relapsed or refractory tumors using a combination of whole-exome sequencing (WES) and RNA-Seq and reported that 46% of samples had actionable findings, including 36% of this subset that had gene fusions with a known or suspected role in tumorigenesis identified through RNA-Seq analysis. In another study[3] of 59 children, most with relapsed or refractory cancers, analysis revealed actionable findings, including RNA fusions, in 51% of cases. The Individualized Therapy for Relapsed Malignancies in Childhood (INFORM) consortium[4] studied 57 patients with WES, low-coverage whole-genome sequencing, RNA-Seq, methylation, and gene expression microarrays and reported a 50% rate of actionable findings that included overexpression of druggable oncogenes. Several patients whose tumors exhibited oncogene overexpression were placed on targeted therapies against these alterations.[4] Finally, the Precision in Pediatric Sequencing (PIPseq) program[5] profiled 65 patients using a combination of tumor or normal WES and tumor RNA-Seq. Tumor RNA-Seq identified therapeutic targets in 23% of the patients; these targets included overexpression of druggable oncogenes, defined based on comparisons of tumor RNA-Seq expression with the RNA-Seq expression levels in a panel of normal tissues. While results of these studies suggested that RNA-Seq expression may be clinically beneficial, they did not provide reproducible methods that could be applied across different precision medicine trials. Our group recently developed a reproducible and scalable approach for performing outlier analysis for pediatric patients with cancer by using large publicly available cancer RNA-Seq data sets.[6] The objective of the present study was to evaluate the feasibility and potential utility of our approach for cancer samples collected prospectively from multiple precision medicine trials in difficult-to-treat pediatric and young adult patients with cancer.

Methods

Study Design

Among 144 tumors from children and young adults, this cohort study was conducted as a consortium of the following 4 clinical sites: British Columbia Children’s Hospital (BCCH), Vancouver, British Columbia, Canada; Lucile Packard Children’s Hospital at Stanford University (LPCH), Stanford, California; CHOC Children’s Hospital and Hyundai Cancer Institute, Orange, California; and the Pacific Pediatric Neuro-Oncology Consortium (PNOC), San Francisco, California. During the period from January 1, 2016, to March 22, 2017, the University of California, Santa Cruz (UCSC) obtained and processed tumor RNA-Seq data, as well as deidentified clinical and molecular information, for 181 tumors from 161 children and young adults with a relapsed, refractory, or rare cancer treated on precision medicine protocols. Tumor RNA-Seq data were obtained from the following 4 clinical sites: BCCH (n = 31), LPCH (n = 80), CHOC (n = 46), and PNOC (n = 24). Each clinical site had its own precision medicine protocol in place, and UCSC Treehouse Childhood Cancer Initiative served as a third-party institution conducting secondary analysis of each site’s tumor RNA-Seq data. This study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline. The BCCH study was approved by the University of British Columbia Research Ethics Committee. The LPCH protocol “Clinical Implementation of Genomic Analysis in Pediatric Malignancies” was approved by the Stanford University Institutional Review Board. The CHOC study “Pilot Project: Molecular Profiles of Newly Diagnosed, Refractory and Recurrent Childhood, Adolescent, and Young Adult Cancers” was approved by the CHOC Children’s Hospital and Hyundai Cancer Institute Institutional Review Board. The PNOC-003 protocol has been previously described.[7] The UCSC Treehouse Childhood Cancer Initiative protocol was approved by the UCSC Institutional Review Board. Because this study involved the sharing of deidentified data, UCSC was not required by our institutional review board to obtain informed consent from study participants; however, clinical partners obtained written informed consent from their participants as per their individual study protocols. All study participants were informed that their deidentified data would be shared with research partners, including UCSC.

Statistical Analysis

Comparative RNA-Seq Analysis

All RNA-Seq data (11 340 compendium samples and 144 samples from clinical partners) were first uniformly processed using the RNA-Seq pipeline version 3.2 developed by the UCSC Computational Genomics Lab[8] (eMethods in the Supplement). The UCSC either downloaded RNA-Seq data from a partner institution for analysis in the cloud or provided a Docker pipeline composed of gene-level expression calculation, which was run at the partner institution; gene expression outlier analysis and identification of druggable genes and pathways was then run on each of the 144 samples at UCSC.

Gene Expression Outlier Analysis

Gene-level transcript per million data were used to perform gene expression outlier analysis[9] to identify transcripts significantly enriched in each patient’s tumor compared with either all 11 340 tumors or tumor types identified as most similar (pan-disease analysis). For pan-cancer analysis, we used the filtered set of 27 084 genes; for pan-disease analysis, we used the unfiltered set of 58 581 unique GENCODE Human Release 23 genes (eMethods in the Supplement) to make sure we did not miss genes whose expression is specific to certain tumor subtypes.

Identification of Druggable Overexpressed Genes and Gene Sets

We obtained the following 3 lists of overexpressed genes: one list from pan-disease outlier analysis, a second list from pan-cancer outlier analysis, and a third list from overlapping genes in pan-disease and pan-cancer lists. For each list, we identified potential druggable genes and statistically enriched pathways.

Drug-Gene Interaction Analysis

We used the Drug-Gene Interaction Database to assess which of the overexpressed genes can be considered actionable by available therapies.[10] The database programmatically searches through publications and other curated databases for reported associations between human genes and available inhibitors. To refine our findings to only existing cancer therapies, we set the Drug-Gene Interaction Database to query for drug-gene interactions among the following 4 curated cancer databases (all part of the Drug-Gene Interaction Database[10]): CIViC, Cancer Commons, My Cancer Genome, and My Cancer Genome Clinical Trial. The Drug-Gene Interaction Database does not contain all known drug-gene interactions, nor does it guarantee a gene’s druggability. As a result, we performed additional literature searches and consulted published clinical cancer genomic studies. We prioritized studies, such as INFORM,[4] in which gene expression information was considered in assessing the actionability of each gene. The 92 genes for which overexpression was considered directly or indirectly actionable in this study are listed in eTable 1 in the Supplement.

Gene Set Overlap Analysis

In parallel to identifying druggable genes, we used the Molecular Signature Database[11] to identify overexpressed cancer pathways in the tumor sample. Gene set overlap analysis computes statistically significant pathways by evaluating the overlap between the input gene list of overexpressed genes and the gene sets from the Molecular Signature Database[11] collections “Hallmark Gene Sets” and “Canonical Pathways.” In this analysis, for each input gene list, we looked at the first 100 reported gene sets that have the false discovery rate (false discovery rate q value) below 0.05.

DNA Mutation Analysis

DNA mutation data were obtained from the following platforms: Foundation Medicine gene panel (LPCH), whole-genome sequencing as part of the Personalized Onco-Genomics Program (POG) (BCCH), NantOmics whole-genome sequencing (CHOC), or Ashion Analytics whole-exome sequencing (PNOC). We used the National Cancer Institute (NCI) Pediatric Molecular Analysis for Therapeutic Choice (hereinafter the NCI Pediatric MATCH) considerations to curate the mutation data reported by the DNA platforms and to classify samples into treatment arms based on the DNA aberrations.[12]

Results

Patient Characteristics

To evaluate the feasibility of comparative RNA-Seq analysis across multiple precision medicine studies, we obtained RNA-Seq data from 181 samples from 161 pediatric and young adult patients (age range, 0-29 years; 65 of 108 [60.2%] male) with a relapsed, refractory, or rare cancer treated at the following 4 clinical sites: BCCH (n = 31), LPCH (n = 80), CHOC (n = 46), and PNOC (n = 24). The age at diagnosis was available for 126 individuals: the median age at diagnosis was 9 years, and the range was 0 to 26 years. Among 144 tumor samples, 46 were from female patients, while 72 were male patients; sex was not reported for 26 samples. RNA sequencing quality control analysis (eMethods in the Supplement) was applied to all 181 samples; of these, 144 samples from 128 patients were of sufficient quality for further analysis. For each case, gene-level transcript per million measurements were computed[8] from tumor RNA-Seq data, which were used in 2 types of analyses to identify expression features of potential clinical relevance (Figure 1).
Figure 1.

Treehouse Workflow

The components in brown are performed by the University of California, Santa Cruz bioinformatics team, while the components in gray are performed by the clinical partners. Calculation of gene-level expression profiles can occur at the University of California, Santa Cruz or at a partner site through the use of portable software. Both the University of California, Santa Cruz and clinical partners participate in research discussions about cases. RNA-Seq indicates RNA sequencing.

Treehouse Workflow

The components in brown are performed by the University of California, Santa Cruz bioinformatics team, while the components in gray are performed by the clinical partners. Calculation of gene-level expression profiles can occur at the University of California, Santa Cruz or at a partner site through the use of portable software. Both the University of California, Santa Cruz and clinical partners participate in research discussions about cases. RNA-Seq indicates RNA sequencing.

Reference Compendium for Tumor Comparisons

To provide a robust reference for tumor comparisons and gene expression outlier detection, we assembled a compendium of 11 340 uniformly analyzed adult, pediatric, and young adult tumor profiles (eTable 2 and eFigure 1 in the Supplement). Of 11 340 samples in the compendium, 1859 (16.4%) were from pediatric, adolescent, and young adult patients with cancer who were younger than 30 years.

Gene Expression Outlier Analysis

Gene expression outlier analysis is a promising method for identifying druggable overexpressed oncogenes in adult tumors.[9,13] We performed gene expression outlier analysis against similar tumors (pan-disease analysis) and against all cancers in our compendium (pan-cancer analysis) (eMethods in the Supplement). The gene expression outliers were analyzed for the presence of genes whose products could be targeted by small molecules directly or indirectly by targeting the downstream signaling pathway (eTable 1 in the Supplement). This list is based on a similar list prepared by the INFORM study[4] and contains 37 genes whose protein products can be targeted directly and 55 genes whose products cannot be targeted but that function in a pathway that can be targeted by a therapy. We hypothesized that aberrant gene dosage of these directly or indirectly actionable genes could be detected by gene expression outlier analysis. We also sought to assess whether multiple members of the same pathways were highly expressed in concert in the same tumor. Of 144 high-quality RNA-Seq data sets, 99 (68.8%) harbored outlier gene expression of 1 of 92 actionable genes. In 75 samples, both an actionable gene and the corresponding pathway were overexpressed using outlier analysis. The most common gene expression outlier was FLT3 (OMIM 136351), overexpressed in 16 samples, all from hematopoietic tumors. This was followed by BTK (OMIM 300300) and CDK6 (OMIM 603368), overexpressed in 14 samples each. While BTK was overexpressed in 14 hematopoietic tumors, CDK6 was overexpressed in both hematopoietic and nonhematopoietic tumors, including neuroblastoma and glioma. The most common gene expression outlier in nonhematopoietic tumors was PTCH1 (OMIM 601309), overexpressed in 11 samples from craniopharyngioma, neurofibroma, sarcoma, glioma, medulloblastoma, and osteosarcoma. The most common overrepresented gene set was receptor tyrosine kinases, overexpressed in 55 samples from all diagnostic categories (Figure 2). Among these, FLT3 was most commonly overexpressed, followed by FGFR1 (OMIM 136350) and PDGFRA (OMIM 173490). While FGFR1 was overexpressed in a variety of nonhematopoietic tumor types, PDGFRA was exclusively overexpressed in brain tumors, and FLT3 was exclusively overexpressed in acute leukemias. Of the 92 actionable genes, 47 were overexpressed in 2 or more samples (Figure 3). For the remaining 45 of the 144 samples (31.3%), our comparative RNA-Seq analysis did not identify any actionable outliers (eTable 3 in the Supplement). An example of Treehouse analysis is provided in eFigure 2 in the Supplement.
Figure 2.

Actionable Gene Expression Outliers Identified Through Comparative RNA Sequencing Analysis of the Cohort

The details of findings in each sample are listed in eTable 3 in the Supplement. BCR indicates B-cell receptor; CNS, central nervous system tumors; HEME, hematopoietic tumors; HSP, heat-shock proteins; JAK/STAT, Janus kinase and signal transducer and activator of transcription signaling pathway; NBL, neuroblastomas; PI3K/AKT/mTOR, phosphatidylinositol-3-kinase (PI3K)/AKT and the mammalian target of rapamycin (mTOR) signaling pathway; RAS/RAF/MEK, mitogen-activated protein kinase RAS/RAF/MEK/ERK pathway; RTK, receptor tyrosine kinases; SHH, sonic hedgehog; and SRC, sarcomas.

Figure 3.

Recurrent Actionable Gene Expression Outliers

Recurrent actionable gene expression outliers (y-axis), colored by gene sets as in Figure 2B, organized by disease (x-axis). Filled black squares denote outliers identified using the pan-cancer analysis approach, while unfilled white squares denote outliers identified by the pan-disease analysis approach. CNS indicates central nervous system tumors; HEME, hematopoietic tumors; NBL, neuroblastoma; and SRC, sarcoma.

Actionable Gene Expression Outliers Identified Through Comparative RNA Sequencing Analysis of the Cohort

The details of findings in each sample are listed in eTable 3 in the Supplement. BCR indicates B-cell receptor; CNS, central nervous system tumors; HEME, hematopoietic tumors; HSP, heat-shock proteins; JAK/STAT, Janus kinase and signal transducer and activator of transcription signaling pathway; NBL, neuroblastomas; PI3K/AKT/mTOR, phosphatidylinositol-3-kinase (PI3K)/AKT and the mammalian target of rapamycin (mTOR) signaling pathway; RAS/RAF/MEK, mitogen-activated protein kinase RAS/RAF/MEK/ERK pathway; RTK, receptor tyrosine kinases; SHH, sonic hedgehog; and SRC, sarcomas.

Recurrent Actionable Gene Expression Outliers

Recurrent actionable gene expression outliers (y-axis), colored by gene sets as in Figure 2B, organized by disease (x-axis). Filled black squares denote outliers identified using the pan-cancer analysis approach, while unfilled white squares denote outliers identified by the pan-disease analysis approach. CNS indicates central nervous system tumors; HEME, hematopoietic tumors; NBL, neuroblastoma; and SRC, sarcoma.

Comparison of RNA-Seq Findings With DNA Mutation Analysis

A small number of childhood tumors contain DNA alterations that may forecast response to molecularly targeted therapies.[14] Children’s Oncology Group NCI Pediatric MATCH[12] is a nationwide basket trial for children and adolescents with relapsed or refractory solid tumors evaluating the use of DNA analysis to match patients to therapies. We had mutation data available for 74 of the 144 samples in our cohort; 52 of 74 were solid tumors. Of 74 solid tumor and leukemia samples, 34 (45.9%) had an actionable abnormality as defined by the NCI Pediatric MATCH study[12] detected by DNA analysis. Fifty-five of 74 samples (74.3%) had an actionable gene expression outlier (eTable 3 in the Supplement) detected by RNA-Seq, 28 (37.8%) had abnormalities detected by both DNA and RNA analysis, 6 (8.1%) had only DNA abnormalities, and 13 (17.6%) had no DNA or RNA abnormalities. Remarkably, 27 samples (36.5%) had only a gene expression dosage abnormality, highlighting the potential utility of comparative RNA-Seq for nominating molecular targets for patients with no DNA findings (Figure 4 and Figure 5).
Figure 4.

Comparison of DNA and RNA Analysis Results

DNA and RNA analysis results were reviewed for 74 samples with both types of data available.

Figure 5.

Utility of RNA Sequencing (RNA-Seq) Analysis

A, RNA-Seq analysis can be used as additional support for DNA aberrations when a single mutated gene is itself highly expressed or downstream genes are highly expressed as a result of the mutation. B, With multiple mutated genes, RNA-Seq analysis can be used to prioritize among them based on high expression of the mutated gene itself or downstream targets. C, If DNA aberration is not expressed, nor are downstream genes, RNA-Seq analysis can be used to deprioritize DNA abnormalities with no evidence of effectiveness at the level of RNA. D, RNA-Seq analysis can reveal an abnormality in the absence of DNA mutation.

Comparison of DNA and RNA Analysis Results

DNA and RNA analysis results were reviewed for 74 samples with both types of data available.

Utility of RNA Sequencing (RNA-Seq) Analysis

A, RNA-Seq analysis can be used as additional support for DNA aberrations when a single mutated gene is itself highly expressed or downstream genes are highly expressed as a result of the mutation. B, With multiple mutated genes, RNA-Seq analysis can be used to prioritize among them based on high expression of the mutated gene itself or downstream targets. C, If DNA aberration is not expressed, nor are downstream genes, RNA-Seq analysis can be used to deprioritize DNA abnormalities with no evidence of effectiveness at the level of RNA. D, RNA-Seq analysis can reveal an abnormality in the absence of DNA mutation. To assess the consistency of DNA and RNA findings, we reviewed 28 samples that had both types of findings. In 11 of 28 samples, at least 1 of the genes with a targetable DNA mutation was identified as a gene expression outlier, suggesting that actionable DNA mutations are often associated with the overexpression of the mutated gene. In 17 of 28 samples, however, none of the genes with a targetable DNA abnormality were identified as a gene expression outlier. Because we do not necessarily expect all mutant genes to be abnormally expressed themselves, we then reviewed the 17 samples to see if there was expression support of the DNA abnormality downstream of the mutated gene. DNA analysis of 2 acute lymphoblastic leukemia samples (TH01_0122_S01 and TH01_0130_S01) revealed a PAX5 (OMIM 167414)–JAK2 (OMIM 147796) fusion, which was previously shown to activate Janus kinase and signal transducer and activator of transcription (JAK/STAT) signaling and promote a progenitor phenotype in leukemia cells.[15] Our comparative gene expression analysis did not reveal the overexpression of the JAK/STAT pathway in these tumors but instead identified overexpression of phosphatidylinositol-3-kinase (PI3K)/AKT and the mammalian target of rapamycin (mTOR) (PI3K/AKT/mTOR) signaling pathway and B-cell receptor signaling pathways in both tumors and overexpression of FLT3 in TH01_0130_S01. The overexpression of PI3K/AKT/mTOR and B-cell receptor signaling pathway genes may be indicative of a progenitor B-cell state assumed by the leukemia cells.[16] Similarly, another acute lymphoblastic leukemia sample (TH01_0129_S01) harbored a BCR–ABL (OMIM 151410) fusion. RNA sequencing revealed outlier expression of PI3K/AKT/mTOR and B-cell receptor signaling pathways; PI3K/AKT/mTOR activation is known to be downstream of the BCR-ABL fusion signaling,[17] suggesting that this overexpression is consistent with the DNA finding of the gene fusion. DNA analysis of 5 leukemia samples (TH01_0124_S01, TH01_0134_S01, TH03_0010_S01, TH03_0010_S02, and TH03_0011_S01) identified an activating mutation in NRAS (OMIM 164790). Activation of NRAS has been associated with proliferation and self-renewal in leukemia via the activation of MEK and mTOR signaling pathways.[18] Our RNA-Seq analysis revealed overexpression of cell cycle or BCL2 (OMIM 603167)–MDM2 (OMIM 164785) pathways in TH01_0134_S01, TH03_0010_S01, TH03_0010_S02, and TH03_0011_S01; these pathways are downstream of activated RAS signaling, and their overexpression is thus consistent with the activating NRAS mutation. Notably, TH01_0124_S01 harbored subclonal activating mutations in both KRAS (OMIM 190070) and NRAS (20.6% and 29.1% mutant allele frequency based on RNA-Seq, respectively). While gene expression analysis revealed overexpression of FLT3, outlier expression associated with pathways downstream of activated RAS signaling was not found. These findings may represent either discordance between the DNA and RNA analysis or intratumor heterogeneity in this leukemia sample, already suspected based on the presence of 2 subclonal RAS mutations. DNA analysis of a diffuse intrinsic pontine glioma (DIPG), TH02_0092_S01, revealed copy number gains of KDR (OMIM 191306), KIT (OMIM 164920), and PDGFRA, located on 4q12. Notably, while KIT and PDGFRA were highly expressed but not meeting the outlier threshold (84th and 93rd percentiles in the compendium), KDR was expressed at a much lower level, in the 54th percentile. Therefore, considering expression information alongside the copy number information may be useful for prioritizing druggable targets within copy number amplicons.[19] In another DIPG sample, TH02_0091_S01 with a BRAF (OMIM 164757) p.V600E mutation, gene expression analysis revealed outlier expression of CSF1R (OMIM 164770). Recent work in melanoma showed that overexpression of CSF1R can occur in melanomas with activating BRAF or MAPK mutations and is associated with resistance to BRAF inhibitors.[20] Because the interaction of these 2 pathways in DIPG is not known, we did not consider these concordant DNA and RNA findings. An atypical teratoid rhabdoid tumor (TH03_0016_S01) and myoepithelial carcinoma (TH03_0113_S01) harbored loss of SMARCB1 (OMIM 601607) (INI1) through a frameshift mutation or protein loss of unknown mechanism detected by immunohistochemistry, respectively. Comparative gene expression analysis of both tumors revealed outlier expression of FGFR1, a promising target in rhabdoid tumors deficient in SMARCB1 (INI1).[21] Gene expression analysis of DIPG tumor TH02_0087_S01 with a loss-of-function mutation of PIK3R1 (OMIM 171833) activating the PI3K/AKT/mTOR pathway revealed overexpression of the JAK/STAT pathway. While it is unknown whether PI3K/AKT/mTOR and JAK/STAT pathways interact in DIPG, these pathways may be coactivated as a result of PI3K mutations in meningiomas.[22] Because the interaction of these 2 pathways in DIPG is not known, we did not count this sample as having concordant DNA and RNA findings. Comparative gene expression analysis of a malignant peripheral nerve sheath tumor TH06_0645_S01 and neurofibroma TH06_0646_S01 with loss of NF1 (OMIM 162200) revealed overexpression of sonic hedgehog signaling present in this tumor type.[23] We also identified overexpression of receptor tyrosine kinases ERBB3 (OMIM 190151) and EGFR (OMIM 131550) in these tumors. Finally, in a glioma TH03_0290_S01 with a BRAF p.V600E mutation, the mutation was not expressed in the RNA. In an additional case (TH01_0131_S01), an activating JAK2 mutation was supported by only a few reads, with more than 100 total read coverage in both the DNA and RNA, suggesting that the mutation may represent a subclonal event or a technical artifact. Overall, our review of 17 samples with mutated genes not themselves overexpressed by RNA-Seq analysis revealed that in 12 of the 17 samples the overexpressed genes and pathways were consistent with the detected DNA mutations, even though the mutant genes themselves were not overexpressed. In the remaining 5 samples, outlier expression was not consistent with an activating mutation detected in the sample (including the lack of a BRAF p.V600E mutant allele in the RNA in TH03_0290_S01; ambiguous evidence in TH02_0087_S01, TH01_0124_S01, and TH02_0091_S01; and possible technical issues in TH01_0131_S01).

Discussion

DNA sequencing is increasingly integrated in clinical trials to identify new molecular targets for children with incurable cancers. However, molecular targets are found for only a small number of patients, and the yield is much lower than that of similar adult cancer trials.[24] Studies focusing on pediatric cancers have shown that the percentage of patients with potentially actionable findings increases to 40% to 50% when RNA-Seq data are considered alongside DNA mutation information.[4] Herein, we described a framework for including RNA-Seq–derived gene expression information into precision medicine studies. Most notably, we show for the first time to date that such a framework can be used consistently across separate precision medicine clinical trials. To our knowledge, our work represents the first report of a translational cancer genomic analysis in which prospective patient data are analyzed by a third-party computational group, with results returned to clinicians and researchers. We found that this comparative analysis is feasible and can produce new information of potential clinical relevance in 68.8% of samples. In 36.5% of samples (27 of 74), druggable overexpressed genes and pathways were identified based on RNA analysis alone and were not apparent in the tumor DNA analysis. Our work suggests that direct investigations of the clinical utility and effectiveness of tumor RNA-Seq–derived gene expression information will be valuable, and the next phase of our project will focus on defining the incremental benefit of this approach. The findings from our work also suggest that open sharing of cancer genomic data can benefit each pediatric and young adult patient with cancer so that every family’s struggle contributes to the advancement of clinical care for the families that follow.

Clinical Implications

Although this study was not designed to assess clinical consequences, we noted associations of comparative RNA-Seq analysis findings and clinical features. For example, our analysis of a high-risk neuroblastoma sample revealed outlier expression of the ALK (OMIM 105590) kinase and CDK6 kinase (eFigure 2 in the Supplement). The outlier expression of CDK6, as well as several other cell cycle genes, was consistent with a known DNA amplification of CDK6 in this sample; however, the potential activation of ALK (OMIM 191175) was not evident before the RNA analysis. In another example, a 2-year-old boy with multifocal stage 4 hepatoblastoma metastatic to the lungs, was initially treated in the Childhood Liver Tumour Strategy Group of the International Society of Paediatric Oncology (SIOPEL-4) study,[25] followed by surgery, 2 cycles of HEP0731 regimen T protocol, then salvage therapy with 3 cycles of vincristine, irinotecan, and temozolomide and 1 cycle of gemcitabine-oxaliplatin with bevacizumab. The patient had disease progression despite these therapies. Pathological analysis showed well and poorly differentiated hepatoblastoma with fetal and embryonal elements, and immunostaining showed retention of INI1 staining and diffuse nuclear and cytoplasmic β-catenin. Foundation Medicine testing revealed the p.G34V variant in CTNNB1, previously reported in hepatocellular carcinoma as an activating mutation.[26] Comparative RNA-Seq analysis of the liver sample (TH03_0004_S04) uncovered gene expression similar to the proliferation subtype of hepatocellular carcinoma[27,28] as well as outlier expression of HSP90B1, interleukin 6, and 4 other members of the JAK/STAT pathway. The overexpression of HSP90B was previously noted in hepatocellular carcinoma.[29] The proliferative subtype of hepatocellular carcinoma is characterized by increased proliferation, high levels of serum α-fetoprotein (AFP), and chromosomal instability[27]; tumors with chromosomal instability are potentially sensitive to Aurora kinase inhibitors.[30] Consistent with the similarity of the tumor to the proliferative subtype of hepatocellular carcinoma, the patient with the TH03_0004_S04 tumor had a response to the pan-kinase inhibitor pazopanib hydrochloride, with activity against Aurora kinase A.[31] Based on the present study, after initiation of this treatment, the patient had a decline in his AFP levels from 14 036 to 1052 ng/mL at 7 weeks after initiation of the therapy (to convert AFP level to micrograms per liter, multiply by 1.0). At 10 weeks into this therapy, restaging studies showed progressive disease, and the patient was switched to therapy with ruxolitinib phosphate, without objective response by AFP levels or by imaging criteria.

Limitations

Our study has some limitations. The heterogeneous nature of the patients analyzed in this study (all types of relapsed, refractory, and rare cancers) made drawing general statements difficult. The study was not designed to directly evaluate clinical utility of comparative RNA-Seq analysis, and clinical follow-up data on these patients were not readily available.

Conclusions

Our experience suggests that it is feasible to include RNA-Seq–derived gene expression analysis in precision medicine studies and that this analysis can be harmonized across studies. We showed that RNA-Seq–derived gene expression was potentially useful for 68.8% of 144 samples compared with DNA mutation information, which was potentially useful for only 45.9% of 74 samples. Our study also highlights for the first time to date the potential clinical utility of harmonized publicly available genomic data sets. Open sharing and combined analysis of tumor RNA-Seq data from pediatric and young adult patients treated on separate clinical trials represent a feasible approach and can produce useful clinical and biological information for individual patients.
  31 in total

1.  Focal gains of VEGFA and molecular classification of hepatocellular carcinoma.

Authors:  Derek Y Chiang; Augusto Villanueva; Yujin Hoshida; Judit Peix; Philippa Newell; Beatriz Minguez; Amanda C LeBlanc; Diana J Donovan; Swan N Thung; Manel Solé; Victoria Tovar; Clara Alsinet; Alex H Ramos; Jordi Barretina; Sasan Roayaie; Myron Schwartz; Samuel Waxman; Jordi Bruix; Vincenzo Mazzaferro; Azra H Ligon; Vesna Najfeld; Scott L Friedman; William R Sellers; Matthew Meyerson; Josep M Llovet
Journal:  Cancer Res       Date:  2008-08-15       Impact factor: 12.701

2.  Molecular signatures database (MSigDB) 3.0.

Authors:  Arthur Liberzon; Aravind Subramanian; Reid Pinchback; Helga Thorvaldsdóttir; Pablo Tamayo; Jill P Mesirov
Journal:  Bioinformatics       Date:  2011-05-05       Impact factor: 6.937

3.  Target and Agent Prioritization for the Children's Oncology Group-National Cancer Institute Pediatric MATCH Trial.

Authors:  Carl E Allen; Theodore W Laetsch; Rajen Mody; Meredith S Irwin; Megan S Lim; Peter C Adamson; Nita L Seibel; D Williams Parsons; Y Jae Cho; Katherine Janeway
Journal:  J Natl Cancer Inst       Date:  2017-05-01       Impact factor: 13.506

Review 4.  Targeting PI3K/AKT/mTOR network for treatment of leukemia.

Authors:  Jessika Bertacchini; Nazanin Heidari; Laura Mediani; Silvano Capitani; Mohammad Shahjahani; Ahmad Ahmadzadeh; Najmaldin Saki
Journal:  Cell Mol Life Sci       Date:  2015-02-25       Impact factor: 9.261

5.  A pilot precision medicine trial for children with diffuse intrinsic pontine glioma-PNOC003: A report from the Pacific Pediatric Neuro-Oncology Consortium.

Authors:  Sabine Mueller; Payal Jain; Winnie S Liang; Lindsay Kilburn; Cassie Kline; Nalin Gupta; Eshini Panditharatna; Suresh N Magge; Bo Zhang; Yuankun Zhu; John R Crawford; Anu Banerjee; Kellie Nazemi; Roger J Packer; Claudia K Petritsch; Nathalene Truffaux; Alison Roos; Sara Nasser; Joanna J Phillips; David Solomon; Annette Molinaro; Angela J Waanders; Sara A Byron; Michael E Berens; John Kuhn; Javad Nazarian; Michael Prados; Adam C Resnick
Journal:  Int J Cancer       Date:  2019-04-03       Impact factor: 7.396

6.  The role of the Janus-faced transcription factor PAX5-JAK2 in acute lymphoblastic leukemia.

Authors:  Dagmar Schinnerl; Klaus Fortschegger; Maximilian Kauer; João R M Marchante; Reinhard Kofler; Monique L Den Boer; Sabine Strehl
Journal:  Blood       Date:  2014-12-16       Impact factor: 22.113

7.  Evolution of an adenocarcinoma in response to selection by targeted kinase inhibitors.

Authors:  Steven Jm Jones; Janessa Laskin; Yvonne Y Li; Obi L Griffith; Jianghong An; Mikhail Bilenky; Yaron S Butterfield; Timothee Cezard; Eric Chuah; Richard Corbett; Anthony P Fejes; Malachi Griffith; John Yee; Montgomery Martin; Michael Mayo; Nataliya Melnyk; Ryan D Morin; Trevor J Pugh; Tesa Severson; Sohrab P Shah; Margaret Sutcliffe; Angela Tam; Jefferson Terry; Nina Thiessen; Thomas Thomson; Richard Varhol; Thomas Zeng; Yongjun Zhao; Richard A Moore; David G Huntsman; Inanc Birol; Martin Hirst; Robert A Holt; Marco A Marra
Journal:  Genome Biol       Date:  2010-08-09       Impact factor: 13.583

8.  Apg-2 has a chaperone-like activity similar to Hsp110 and is overexpressed in hepatocellular carcinomas.

Authors:  Kazuhisa Gotoh; Kohsuke Nonoguchi; Hiroaki Higashitsuji; Yoshiyuki Kaneko; Toshiharu Sakurai; Yasuhiko Sumitomo; Katsuhiko Itoh; John R Subjeck; Jun Fujita
Journal:  FEBS Lett       Date:  2004-02-27       Impact factor: 4.124

9.  Integrated analysis of gene expression and copy number identified potential cancer driver genes with amplification-dependent overexpression in 1,454 solid tumors.

Authors:  Keiichi Ohshima; Keiichi Hatakeyama; Takeshi Nagashima; Yuko Watanabe; Kaori Kanto; Yuki Doi; Tomomi Ide; Yuji Shimoda; Tomoe Tanabe; Sumiko Ohnami; Shumpei Ohnami; Masakuni Serizawa; Koji Maruyama; Yasuto Akiyama; Kenichi Urakami; Masatoshi Kusuhara; Tohru Mochizuki; Ken Yamaguchi
Journal:  Sci Rep       Date:  2017-04-04       Impact factor: 4.379

10.  Integrative Clinical Sequencing in the Management of Refractory or Relapsed Cancer in Youth.

Authors:  Rajen J Mody; Yi-Mi Wu; Robert J Lonigro; Xuhong Cao; Sameek Roychowdhury; Pankaj Vats; Kevin M Frank; John R Prensner; Irfan Asangani; Nallasivam Palanisamy; Jonathan R Dillman; Raja M Rabah; Laxmi Priya Kunju; Jessica Everett; Victoria M Raymond; Yu Ning; Fengyun Su; Rui Wang; Elena M Stoffel; Jeffrey W Innis; J Scott Roberts; Patricia L Robertson; Gregory Yanik; Aghiad Chamdin; James A Connelly; Sung Choi; Andrew C Harris; Carrie Kitko; Rama Jasty Rao; John E Levine; Valerie P Castle; Raymond J Hutchinson; Moshe Talpaz; Dan R Robinson; Arul M Chinnaiyan
Journal:  JAMA       Date:  2015-09-01       Impact factor: 56.272

View more
  17 in total

Review 1.  Clinical applications of artificial intelligence and machine learning in cancer diagnosis: looking into the future.

Authors:  Muhammad Javed Iqbal; Zeeshan Javed; Haleema Sadia; Ijaz A Qureshi; Asma Irshad; Rais Ahmed; Kausar Malik; Shahid Raza; Asif Abbas; Raffaele Pezzani; Javad Sharifi-Rad
Journal:  Cancer Cell Int       Date:  2021-05-21       Impact factor: 5.722

Review 2.  Transcriptome analysis provides critical answers to the "variants of uncertain significance" conundrum.

Authors:  Mackenzie D Postel; Julie O Culver; Charité Ricker; David W Craig
Journal:  Hum Mutat       Date:  2022-05-18       Impact factor: 4.700

3.  CPDR: An R Package of Recommending Personalized Drugs for Cancer Patients by Reversing the Individual's Disease-Related Signature.

Authors:  Ruzhen Chen; Xun Wang; Xinru Deng; Lanhui Chen; Zhongyang Liu; Dong Li
Journal:  Front Pharmacol       Date:  2022-06-20       Impact factor: 5.988

4.  Bayesian Framework for Detecting Gene Expression Outliers in Individual Samples.

Authors:  John Vivian; Jordan M Eizenga; Holly C Beale; Olena M Vaske; Benedict Paten
Journal:  JCO Clin Cancer Inform       Date:  2020-02

5.  Whole genome, transcriptome and methylome profiling enhances actionable target discovery in high-risk pediatric cancer.

Authors:  Marie Wong; Chelsea Mayoh; Loretta M S Lau; David S Ziegler; Paul G Ekert; Mark J Cowley; Dong-Anh Khuong-Quang; Mark Pinese; Amit Kumar; Paulette Barahona; Emilie E Wilkie; Patricia Sullivan; Rachel Bowen-James; Mustafa Syed; Iñigo Martincorena; Federico Abascal; Alexandra Sherstyuk; Noemi A Bolanos; Jonathan Baber; Peter Priestley; M Emmy M Dolman; Emmy D G Fleuren; Marie-Emilie Gauthier; Emily V A Mould; Velimir Gayevskiy; Andrew J Gifford; Dylan Grebert-Wade; Patrick A Strong; Elodie Manouvrier; Meera Warby; David M Thomas; Judy Kirk; Katherine Tucker; Tracey O'Brien; Frank Alvaro; Geoffry B McCowage; Luciano Dalla-Pozza; Nicholas G Gottardo; Heather Tapp; Paul Wood; Seong-Lin Khaw; Jordan R Hansford; Andrew S Moore; Murray D Norris; Toby N Trahair; Richard B Lock; Vanessa Tyrrell; Michelle Haber; Glenn M Marshall
Journal:  Nat Med       Date:  2020-10-05       Impact factor: 53.440

Review 6.  Cancer microenvironment and genomics: evolution in process.

Authors:  Stanley P Leong; Isaac P Witz; Orit Sagi-Assif; Sivan Izraely; Jonathan Sleeman; Brian Piening; Bernard A Fox; Carlo B Bifulco; Rachel Martini; Lisa Newman; Melissa Davis; Lauren M Sanders; David Haussler; Olena M Vaske; Marlys Witte
Journal:  Clin Exp Metastasis       Date:  2021-05-10       Impact factor: 5.150

7.  Brain Tumor Biobank Development for Precision Medicine: Role of the Neurosurgeon.

Authors:  Emilie Darrigues; Benjamin W Elberson; Annick De Loose; Madison P Lee; Ebonye Green; Ashley M Benton; Ladye G Sink; Hayden Scott; Murat Gokden; John D Day; Analiz Rodriguez
Journal:  Front Oncol       Date:  2021-04-26       Impact factor: 6.244

8.  African Americans and European Americans exhibit distinct gene expression patterns across tissues and tumors associated with immunologic functions and environmental exposures.

Authors:  Urminder Singh; Kyle M Hernandez; Bruce J Aronow; Eve Syrkin Wurtele
Journal:  Sci Rep       Date:  2021-05-10       Impact factor: 4.379

9.  Synthetic lethality-mediated precision oncology via the tumor transcriptome.

Authors:  Joo Sang Lee; Nishanth Ulhas Nair; Gal Dinstag; Lesley Chapman; Youngmin Chung; Kun Wang; Sanju Sinha; Hongui Cha; Dasol Kim; Alexander V Schperberg; Ajay Srinivasan; Vladimir Lazar; Eitan Rubin; Sohyun Hwang; Raanan Berger; Tuvik Beker; Ze'ev Ronai; Sridhar Hannenhalli; Mark R Gilbert; Razelle Kurzrock; Se-Hoon Lee; Kenneth Aldape; Eytan Ruppin
Journal:  Cell       Date:  2021-04-14       Impact factor: 66.850

10.  Hydra: A mixture modeling framework for subtyping pediatric cancer cohorts using multimodal gene expression signatures.

Authors:  Jacob Pfeil; Lauren M Sanders; Ioannis Anastopoulos; A Geoffrey Lyle; Alana S Weinstein; Yuanqing Xue; Andrew Blair; Holly C Beale; Alex Lee; Stanley G Leung; Phuong T Dinh; Avanthi Tayi Shah; Marcus R Breese; W Patrick Devine; Isabel Bjork; Sofie R Salama; E Alejandro Sweet-Cordero; David Haussler; Olena Morozova Vaske
Journal:  PLoS Comput Biol       Date:  2020-04-10       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.