Literature DB >> 25392694

Learning dysregulated pathways in cancers from differential variability analysis.

Bahman Afsari¹, Donald Geman², Elana J Fertig³.

Abstract

Analysis of gene sets can implicate activity in signaling pathways that is responsible for cancer initiation and progression, but is not discernible from the analysis of individual genes. Multiple methods and software packages have been developed to infer pathway activity from expression measurements for set of genes targeted by that pathway. Broadly, three major methodologies have been proposed: over-representation, enrichment, and differential variability. Both over-representation and enrichment analyses are effective techniques to infer differentially regulated pathways from gene sets with relatively consistent differentially expressed (DE) genes. Specifically, these algorithms aggregate statistics from each gene in the pathway. However, they overlook multivariate patterns related to gene interactions and variations in expression. Therefore, the analysis of differential variability of multigene expression patterns can be essential to pathway inference in cancers. The corresponding methodologies and software packages for such multivariate variability analysis of pathways are reviewed here. We also introduce a new, computationally efficient algorithm, expression variation analysis (EVA), which has been implemented along with a previously proposed algorithm, Differential Rank Conservation (DIRAC), in an open source R package, gene set regulation (GSReg). EVA inferred similar pathways as DIRAC at reduced computational costs. Moreover, EVA also inferred different dysregulated pathways than those identified by enrichment analysis.

Entities: Chemical Disease Gene Species

Keywords: gene expression; gene set analysis; multivariate analysis; variability analysis

Year: 2014 PMID： 25392694 PMCID： PMC4218688 DOI： 10.4137/CIN.S14066

Source DB: PubMed Journal: Cancer Inform ISSN： 1176-9351

Introduction

Cellular signaling generates a chain of protein–protein interactions, often terminating in the activation of transcription factors. Such signaling in molecular pathways induces and advances many human cancers. In principle, targeting the specific signaling pathways responsible for individual malignancies would yield an effective treatment. However, identifying the key signaling pathways relies on first inferring the signaling activity in that tumor. Ideally, coordinated changes in the phosphorylation state in network proteins could be measured to directly implicate specific signaling pathways in a malignancy, and the technology to measure such protein states is rapidly advancing. In the meantime, however, many algorithms use the existing transcriptional data to infer differentially regulated pathways. The accuracy of such inference relies in large part on the sets of genes annotated to each pathway (reviewed in Ref. 1–6). In analyses of gene expression data, it is essential to select sets of genes whose expression is altered because of pathway activation. For example, the TRANScription FACtor (TRANSFAC) database2 assembles experimentally validated sets of genes resulting from transcription factor activation. Using these data with set statistics to infer coordinated changes in targets of transcription factors downstream of cell signaling pathways has been an effective substitute for directly inferring differential pathway signaling (eg, Ochs et al.3, and Fertig et al.4). Regardless of the measurement technology, inference of signaling pathways thus requires statistical techniques to be able to account for changes in multiple molecular species. Historically, analysis of differential pathway regulation from transcriptional data has been divided into two major classes of methodologies (reviewed in Irizzary et al.5): over-representation methods and enrichment methods. Over-representation methods compare sets of genes annotated to pathways to a list of those genes that are significantly differentially expressed (DE) between two phenotypes. Enrichment methods employ a “soft” version of over-representation based on a summary statistic to characterize the level of differential expression of genes in the pathway relative to a null distribution. These methods have been extended to infer pathway members or networks from transcriptional data (eg, Tarca et al.6). Both over-representation and enrichment methods for detecting differential pathway regulation are robust at inferring consistent up- or down-regulation of pathway genes. However, alterations in cell signaling pathways may be associated with complex changes in gene expression because of pathway interactions.7 Moreover, expression in individual genes is highly variable in human tumors8,9 in part because of the distinct evolution of individual tumors from the same cancer subtype. Thus, individual genes may contribute differently to alterations in the same pathway. As a result, pathways that are dysregulated in human tumors may exhibit complex, multivariate changes in variability that are not captured by the aggregation of statistics of individual genes in over-representation or enrichment analyses. Here, we review more recent methods for detecting differential regulation based directly on multivariate measures of pathway variability. Specifically, we focus on Differential Rank Conservation (DIRAC),10 and a more computationally efficient alternative algorithm, expression variation analysis (EVA). We also introduce a new R package, gene set regulation (GSReg),11 that implements these algorithms to facilitate inference of pathway dysregulation.

Pathway Analysis Methodologies

In this section, we briefly review algorithms for pathway analysis from transcriptional data. Currently, all such algorithms identify significantly perturbed pathways by applying gene set statistics to compare gene expression of pathway targets in one phenotype to gene expression of pathway targets in another phenotype. As a result, they rely critically on the numerous curated databases that annotate genes to pathways.1 Regardless of the pathway targets, algorithms for pathway analysis can be divided into three major classes: over-representation, enrichment, and differential variability analyses. We list software that implements each technique in Table 1 and refer the reader to Irizarry et al.5, Khatri et al.12, and Maciejewski.13 for more reviews and comparisons of over-representation and enrichment analyses. An overview is provided in Figure 1.

Table 1

Examples of software available for gene set analysis, divided into three major families of algorithms: over-representation, enrichment, and differential variability analyses.

ANALYSIS FAMILY	METHODS	AVAILABILITY	REFERENCE
Over-representation	GeneMAPP	http://www.genmapp.org/	19,20
	GoMiner	http://discover.nci.nih.gov/gominer/	21
	GatiGo	http://bioinfo.cipf.es/babelomicswiki/tool:fatigo	22
	Gostat	http://gostat.wehi.edu.au/	23
	FunAssociate	http://llama.mshri.on.ca/funcassociate/	24
	GOToolBox	http://genome.crg.es/GOToolBox/	25
	GeneMergeGOEAST	http://www.oeb.harvard.edu/faculty/hartl/old_site/lab/publications/Genemerge.html	26
	ClueGo	http://omicslab.genetics.ac.cn/GOEAST/	27
	FunSpec	http://www.ici.upmc.fr/cluego/	28
	Go:TermFinder	http://funspec.med.utoronto.ca/	29
	WebGestalt	http://go.princeton.edu/cgi-bin/GoTermFinder	30
	agriGo	http://bioinfo.vanderbilt.edu/webgestalt/	31
		http://bioinfo.cau.edu.cn/agriGo/	32

Enrichment	GSEA	http://www.broadinstitute.org/gsea	16
	SAFE	Bioconductor (safe)	33
	LIMMA	Bioconductor (LIMMA)	34
	DAVID	http://david.abcc.ncifcrf.gov/list.jsp	35
	TopGO	Bioconductor (topGo)	36
	Gage	Bioconductor (gage)	37
	sigPathway	Bioconductor (sigPathway)	38

Differential variability	DIRAC	Bioconductor (GS-Reg)	10
	EVA	Bioconductor (GS-Reg)	39
	GINEA	No implementation	40
	IB-GSA	No implementation	18
	MAVTgsa	CRAN	41
	synergy	http://www.biomedcentral.com/content/supplementary/1752-0509-2-10-s3.pdf	42

Figure 1

Pathway analysis methodologies from gene expression: (A) Over-representation analysis first performs a statistical test for each gene by comparing expression values in phenotypes to identify a set of significantly DE genes, obtaining a gene count N. The procedure then counts the number of DE genes that are also annotated to a specified pathway (N) and calculates a P-value for enrichment of that pathway by testing if N is unusually high relative to N and N (the number of genes in the pathway). (B) Enrichment analysis first assigns an individual DE score to each of the genes annotated to a pathway, and aggregates these into a pathway score Z. A similar score is computed for a null distribution, Z. For example, this null distribution may be defined empirically from the DE score for alternative sets of genes or permuted sample labels. Enrichment analysis forms a pathway statistic by comparing the distribution of DE scores in Z to that of DE scores in Z. (C) Differential variability analysis defines a statistic to measure variability of the expression of pathway genes for samples from a given phenotype, denoted by V1 and V2 for phenotypes 1 and 2, respectively. If the variability between two phenotypes is significantly high (ie, |V1 − V2| >> 0), the pathway is identified as dysregulated.

The first methodology, over-representation analysis, assesses similarity between the set of all DE genes and the set of genes annotated to a pathway (Fig. 1A), and was introduced in Khatri et al.14 First, significantly DE genes between specified phenotypes are identified. For example, a set of DE genes may be defined by computing a Wilcoxon test to compare expression in two phenotypes for each gene measured and selecting significant genes as those having a false discovery rate below a threshold value of 5%. Then a gene set statistic is calculated for each pathway by applying a statistical test (eg, Fisher’s test) that compares each set of pathway genes to the set of DE genes. Pathways whose members are significantly enriched for DE genes are called significant. The methods listed in top of the Table 1 are examples from this family. Whereas over-representation analysis compares discrete sets of genes, the second methodology – enrichment analysis – formulates set statistics that summarize the overall level of differential expression for the pathway genes between the phenotypes. The first method of this class was gene set enrichment analysis (GSEA).15 Generally, enrichment analysis calculates the differential activity of genes across phenotypes using a differential expression statistic (eg, a t- or Z-statistic). Then the differential activity of a pathway is calculated by applying another statistic (eg, Kolmogrov–Smirnov test, sum, mean, maxmean statistic, etc.) to compare the differential expression statistic for genes in the pathway to a null distribution of differential expression statistics, often defined from alternative sets of genes or permuting sample labels. The algorithms in the middle of Table 1 represent examples from this family accompanied by the software implementing them. A full review of these algorithms is provided in Khatri.12 Although they use different statistics, both over- representation and enrichment methods infer coordinated, average expression changes between phenotypes in sets of genes annotated to a pathway. Because they do not rely on a hard threshold, enrichment methods are more sensitive than over-representation methods at inferring coordinated expression changes in sets of genes. However, they may yield many more false positives. Regardless of their relative advantages, the false-positive rate of both tests may be dependent upon the number of genes in the set.41 Moreover, both methods perform the best when changes in genes annotated to a pathway are consistent and relatively homogeneous in each phenotype; for example, sharply different expression values for a given gene are seen in most samples. However, tumor pathway dysregulation based on interactions among multiple genes may cause differential variability in gene expression between phenotypes. Therefore, the third family of the methods, differential variability analysis, is a multivariate approach that assesses variability within a pathway for a given phenotype and then compares these measures across phenotypes. This emerging methodology, pioneered in Eddy et al.10, and Zhang et al.38, has been extended to a broad set of algorithms summarized in Table 1, and is the focus of the remainder of this paper.

Differential Variability Analysis

Differential variability analysis first measures the level of variability in gene expression in a pathway for a given phenotype and then compares these levels for different phenotypes to determine differential pathway regulation. For example, different pathway genes may have expression outliers in distinct tumor samples relative to normal controls, captured with methods such as Open Grid Services Architecture (OGSA).42 Such distinct alterations in individual tumors may also increase variability of expression in individual genes, motivating approaches that apply over-representation or enrichment analysis to variability statistics for individual genes.43 Nonetheless, in general, alterations in expression may depend strongly on interactions among the genes in the pathway. Consequently, new algorithms employing multivariate statistics are emerging in order to model such complex shifts in variability from one phenotype to another. Zhang et al.38 developed one of the first methods of this type. Their algorithm calculates correlations between all pairs of genes within a pathway given a phenotype as a measure of pairwise interactions, and then a z-score for the difference in pairwise interactions between two phenotypes. To summarize the change in the correlation pattern, the algorithm applies a “maxmean” statistic to compute the maximum of the mean of positive and negative z-scores corresponding to all gene pairs in the pathway and then ranks pathways by this maxmean statistic. Watkinson et al.40 extended this algorithm by defining synergy between pairs of genes, using an information-theoretic approach. Recently, Liu et al.37 have developed a more sophisticated analysis of variability called gene interaction enrichment and network analysis (GIENA). Instead of correlation, they consider four possible statistics on the expression of two genes: their sum, difference, maximum, and minimum. These operations are assumed to correspond to gene pair cooperation, competition, redundancy, and dependency, respectively. Thereafter, pathway activity is summarized by applying a maxmean statistic over all pairs of interactions within the gene set, similar to Zhang et al.38 In contrast to both Liu et al.37, and Zhang et al.38, Ochs et al.42 provide a formulation for pathway analysis based upon outliers to account for pathway dysregulation and tumor heterogeneity, thereby utilizing a simpler algorithm that does not rely on selecting a variability statistic. Regrettably, none of the algorithms described above provide a robust software package to facilitate application to new data. They also rely on continuous, normalized gene expression measurements. We have previously shown that rank-based techniques (ie, methods that depend only on the relative ordering of expression values) (i) are more robust to the preprocessing and normalization of data44 than techniques relying on normalized gene expression, (ii) are competitive with the best classification methods in discriminating among phenotypes (eg, Geman et al.45), and (iii) can be far simpler to explain and interpret in biological terms.46,47 Therefore, Eddy et al proposed DIRAC10 as an ordering-based method for differential variability analysis. Given a pathway and a phenotype, DIRAC generates a binary template (one component for each pair of genes) for the ordering of the expression values for the genes in the pathway, and then calculates the average “distance” between training samples and the template as the measure of the pathway variability of the phenotype. The “distance” used in DIRAC involves the Hamming distance over the pairwise comparisons. Permutation tests are used to estimate P-values associated with differences in this variability score between phenotypes, and pathways with significant P-values are identified as perturbed. Consistent with increased complexity in more advanced stages of diseases,10 they found that most dysregulated pathways have higher variability in phenotypes with worse prognosis. Although DIRAC is effective in inferring dysregulated pathways, the permutation test on which it is based is computationally inefficient, and becomes infeasible when applied to large numbers of pathways and samples. Therefore, we propose an alternative approach called EVA.36 Given a phenotype and pathway, EVA measures the average distance between two randomly chosen expression profiles for the phenotype. More specifically, the EVA variability statistic is the expected Kendall-τ distance48 between the rank vectors corresponding to two independent copies of expression profiles over the pathway. Kendall-τ is a distance that quantifies the difference between the orderings of two vectors. In this case, the permutation distance is defined for the rank vectors of gene expression profiles for pathway genes. The Kendall-τ distance between the two gene expression profiles is essentially the number of disagreeing comparisons between all pairs of genes in the pathway, analogous to the change of rank in DIRAC. To estimate a variability statistic from samples in each phenotype, the EVA algorithm then averages the Kendall-τ distance between each pair of samples from that phenotype. These variability statistics are then compared between two phenotypes for each pathway to estimate pathway dysregulation between phenotypes. The P-values for pathway deregulation are computed analytically from the difference between the empirical Kendall-τ statistics using an approximation for the asymptotic distribution from the theory of U-statistics described in detail in Afsari et al.36 A general description about U-statistics can be found in Van der Vaart.49

GSReg Package

We develop GSReg R package to perform differential variability analysis using DIRAC and EVA, available through Bioconductor. Here, we demonstrate our software by reproducing the results from the DIRAC paper and replicating these results with EVA. Since the original data of DIRAC paper was in Matlab format, we provide the data in a complementary R package, GSBenchMark,50 also available through Bioconductor. Figure 2 shows the results of variability pathway analysis comparing head and neck squamous cell carcinoma samples to matched normal controls.51 Figure 2 compares variability statistics of pathways in tumors (y-axis) to normal controls (x-axis), revealing that most of the dysregulated pathways have higher variability in tumor samples than normal samples. This was the general trend found for DIRAC10 (Fig. 2A) and persists for EVA (Fig. 2B). In total, DIRAC found 48 dysregulated pathways and EVA discovered 64; there are 45 pathways in common and 68 in total. The general trend that most of the dysregulated pathways have higher variability in the phenotype with poor prognosis remains true for EVA in other datasets compared in DIRAC and provided in GSBenchMark (results not shown).

Figure 2

Comparison of dysregulated pathways identified by (A) DIRAC and (B) EVA in comparing head and neck squamous cell carcinoma samples (y-axis) and normal samples (x-axis). Hence, the pathways shown above the line are those with significantly (P-value <0.05) higher variability in tumor than normal samples, and those below the line have significantly higher variability in normal samples.

DIRAC and EVA have been shown mathematically similar.36 The main advantages of the EVA are efficiency in calculation and a more straightforward interpretation that does not involve a “template” but rather is simply the average distance between two samples. To illustrate the computational advantage, for the head and neck cancer data, using a Lenovo Think-Pad with Core™ i7–3720QM Intel CPU at 2.6 GHz and only 1000 permutations of phenotype labels, the DIRAC analysis required 207 seconds while the EVA analytical computation only took 0.3 seconds. Figure 3 compares the corresponding P-values of the differential variability measure generated by DIRAC and EVA. These P-values are highly correlated, with a 0.88 Pearson correlation coefficient (P-value <2 × 10−16). Taken together, these results indicate that EVA can be used as a more efficient alternative for DIRAC analysis.

Figure 3

P-value comparison of DIRAC and EVA: Each circle represents a pathway. x-axis and y-axis represent DIRAC and EVA P-values, respectively.

To illustrate the difference between the outcomes of EVA and enrichment analysis, we chose a well-known enrichment method, the Wilcoxon gene set test implemented in Linear Models for Microarray Data (LIMMA).31 For these analyses, we apply the Benjamini–Hotchberg procedure52 to account for multiple hypothesis testing, which was not feasible in the previous comparison with the DIRAC analysis because of the relatively coarse resolution of P-values from the computationally intensive permutation test. In the case of the head and neck squamous tumors, both LIMMA and EVA infer a similar number of differentially regulated pathways (11 and 21, respectively). However, consistent with the test statistic, the significant pathways from EVA have consistently higher variability in the tumor group than those identified with the enrichment statistic. On the other hand, if we apply LIMMA on the univariate F-test statistic for the difference of variances, LIMMA does not identify any pathway as dysregulated. This shows that analyses based on differential variability and enrichment may result in different outcomes.

Conclusion

Cancer is known to be the result of the perturbations in signaling pathways. Many algorithms have been proposed to identify and analyze these perturbations from transcriptional data. We reviewed three major families of pathway analysis methods, each having different criteria for calling a pathway perturbed: over-representation of DE genes, enrichment of large DE statistics in pathway genes, and significant difference in variability of gene expression. This last class of methods is particularly adept at inferring dysregulated pathways with differential variability in multivariate gene expression patterns. Here, we implemented one such variability analysis algorithm, DIRAC,10 and a novel, more efficient alternative EVA in an R package GSReg. For future work, methods that incorporate more information about biological mechanism may enhance interpretation and reproducibility of learned dysregulated pathways. Also, methods that can assess variability across more than two phenotypes are needed to infer dysregulated pathways in distinct tumor subtypes. Moreover, existing methods for gene set analysis either detect the differential expression or differential variability to identify differential regulation across phenotypes. A more versatile methodology might be a combination of both types of pathway analyses. These combinations may be implemented by using the Kendall-τ distance to compare two independent samples, but from two different phenotypes. Thus, extending the sample comparisons in EVA would provide an algorithm to compare pathway variability within phenotypes with pathway variability between phenotypes.

41 in total

1. GeneMerge--post-genomic analysis, data mining, and hypothesis testing.

Authors: Cristian I Castillo-Davis; Daniel L Hartl
Journal: Bioinformatics Date: 2003-05-01 Impact factor: 6.937

2. Selection and validation of differentially expressed genes in head and neck cancer.

Authors: M A Kuriakose; W T Chen; Z M He; A G Sikora; P Zhang; Z Y Zhang; W L Qiu; D F Hsu; C McMunn-Coffran; S M Brown; E M Elango; M D Delacure; F A Chen
Journal: Cell Mol Life Sci Date: 2004-06 Impact factor: 9.261

3. Analyzing gene expression data in terms of gene sets: methodological issues.

Authors: Jelle J Goeman; Peter Bühlmann
Journal: Bioinformatics Date: 2007-02-15 Impact factor: 6.937

4. A novel signaling pathway impact analysis.

Authors: Adi Laurentiu Tarca; Sorin Draghici; Purvesh Khatri; Sonia S Hassan; Pooja Mittal; Jung-Sun Kim; Chong Jai Kim; Juan Pedro Kusanovic; Roberto Romero
Journal: Bioinformatics Date: 2008-11-05 Impact factor: 6.937

5. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors: Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal: Proc Natl Acad Sci U S A Date: 2005-09-30 Impact factor: 11.205

6. Detection of treatment-induced changes in signaling pathways in gastrointestinal stromal tumors using transcriptomic data.

Authors: Michael F Ochs; Lori Rink; Chi Tarn; Sarah Mburu; Takahiro Taguchi; Burton Eisenberg; Andrew K Godwin
Journal: Cancer Res Date: 2009-11-10 Impact factor: 12.701

7. agriGO: a GO analysis toolkit for the agricultural community.

Authors: Zhou Du; Xin Zhou; Yi Ling; Zhenhai Zhang; Zhen Su
Journal: Nucleic Acids Res Date: 2010-04-30 Impact factor: 16.971

8. Identification of gene interactions associated with disease from gene expression data using synergy networks.

Authors: John Watkinson; Xiaodong Wang; Tian Zheng; Dimitris Anastassiou
Journal: BMC Syst Biol Date: 2008-01-30

9. Identifying gene interaction enrichment for gene expression data.

Authors: Jigang Zhang; Jian Li; Hong-Wen Deng
Journal: PLoS One Date: 2009-11-30 Impact factor: 3.240

10. GAGE: generally applicable gene set enrichment for pathway analysis.

Authors: Weijun Luo; Michael S Friedman; Kerby Shedden; Kurt D Hankenson; Peter J Woolf
Journal: BMC Bioinformatics Date: 2009-05-27 Impact factor: 3.169

22 in total

1. A Novel Functional Splice Variant of AKT3 Defined by Analysis of Alternative Splice Expression in HPV-Positive Oropharyngeal Cancers.

Authors: Theresa Guo; Akihiro Sakai; Bahman Afsari; Michael Considine; Ludmila Danilova; Alexander V Favorov; Srinivasan Yegnasubramanian; Dylan Z Kelley; Emily Flam; Patrick K Ha; Zubair Khan; Sarah J Wheelan; J Silvio Gutkind; Elana J Fertig; Daria A Gaykalova; Joseph Califano
Journal: Cancer Res Date: 2017-07-21 Impact factor: 12.701

2. Characterization of Alternative Splicing Events in HPV-Negative Head and Neck Squamous Cell Carcinoma Identifies an Oncogenic DOCK5 Variant.

Authors: Chao Liu; Theresa Guo; Guorong Xu; Akihiro Sakai; Shuling Ren; Takahito Fukusumi; Mizuo Ando; Sayed Sadat; Yuki Saito; Zubair Khan; Kathleen M Fisch; Joseph Califano
Journal: Clin Cancer Res Date: 2018-06-26 Impact factor: 12.531

3. Extracting the Strongest Signals from Omics Data: Differentially Expressed Pathways and Beyond.

Authors: Galina Glazko; Yasir Rahmatallah; Boris Zybailov; Frank Emmert-Streib
Journal: Methods Mol Biol Date: 2017

4. Single-Cell RNA-Seq Analysis of Retinal Development Identifies NFI Factors as Regulating Mitotic Exit and Late-Born Cell Specification.

Authors: Brian S Clark; Genevieve L Stein-O'Brien; Fion Shiau; Gabrielle H Cannon; Emily Davis-Marcisak; Thomas Sherman; Clayton P Santiago; Thanh V Hoang; Fatemeh Rajaii; Rebecca E James-Esposito; Richard M Gronostajski; Elana J Fertig; Loyal A Goff; Seth Blackshaw
Journal: Neuron Date: 2019-05-22 Impact factor: 17.173

5. Integrated Analysis of Whole-Genome ChIP-Seq and RNA-Seq Data of Primary Head and Neck Tumor Samples Associates HPV Integration Sites with Open Chromatin Marks.

Authors: Dylan Z Kelley; Emily L Flam; Evgeny Izumchenko; Ludmila V Danilova; Hildegard A Wulf; Theresa Guo; Dzov A Singman; Bahman Afsari; Alyza M Skaist; Michael Considine; Jane A Welch; Elena Stavrovskaya; Justin A Bishop; William H Westra; Zubair Khan; Wayne M Koch; David Sidransky; Sarah J Wheelan; Joseph A Califano; Alexander V Favorov; Elana J Fertig; Daria A Gaykalova
Journal: Cancer Res Date: 2017-09-25 Impact factor: 12.701

6. Differential Variation Analysis Enables Detection of Tumor Heterogeneity Using Single-Cell RNA-Sequencing Data.

Authors: Emily F Davis-Marcisak; Thomas D Sherman; Pranay Orugunta; Genevieve L Stein-O'Brien; Sidharth V Puram; Evanthia T Roussos Torres; Alexander C Hopkins; Elizabeth M Jaffee; Alexander V Favorov; Bahman Afsari; Loyal A Goff; Elana J Fertig
Journal: Cancer Res Date: 2019-07-23 Impact factor: 12.701

7. Effects of β-catenin on differentially expressed genes in multiple myeloma.

Authors: Hui Chen; Wei Chai; Bin Li; Ming Ni; Guo-Qiang Zhang; Hua-Wei Liu; Zhuo Zhang; Ji-Ying Chen; Yong-Gang Zhou; Yan Wang
Journal: J Huazhong Univ Sci Technolog Med Sci Date: 2015-07-31

8. MiRImpact, a new bioinformatic method using complete microRNA expression profiles to assess their overall influence on the activity of intracellular molecular pathways.

Authors: Alina V Artcibasova; Mikhail B Korzinkin; Maksim I Sorokin; Peter V Shegay; Alex A Zhavoronkov; Nurshat Gaifullin; Boris Y Alekseev; Nikolay V Vorobyev; Denis V Kuzmin; Аndrey D Kaprin; Nikolay M Borisov; Anton A Buzdin
Journal: Cell Cycle Date: 2016 Impact factor: 4.534

9. Splice Expression Variation Analysis (SEVA) for inter-tumor heterogeneity of gene isoform usage in cancer.

Authors: Bahman Afsari; Theresa Guo; Michael Considine; Liliana Florea; Luciane T Kagohara; Genevieve L Stein-O'Brien; Dylan Kelley; Emily Flam; Kristina D Zambo; Patrick K Ha; Donald Geman; Michael F Ochs; Joseph A Califano; Daria A Gaykalova; Alexander V Favorov; Elana J Fertig
Journal: Bioinformatics Date: 2018-06-01 Impact factor: 6.937

10. Subtype prediction in pediatric acute myeloid leukemia: classification using differential network rank conservation revisited.

Authors: Askar Obulkasim; Maarten Fornerod; Michel C Zwaan; Dirk Reinhardt; Marry M van den Heuvel-Eibrink
Journal: BMC Bioinformatics Date: 2015-09-23 Impact factor: 3.169