Literature DB >> 29180805

A fast approach to detect gene-gene synergy.

Pengwei Xing1,2, Yuan Chen1,2, Jun Gao3, Lianyang Bai4, Zheming Yuan5,6.   

Abstract

Selecting informative genes, including individually discriminant genes and synergic genes, from expression data has been useful for medical diagnosis and prognosis. Detecting synergic genes is more difficult than selecting individually discriminant genes. Several efforts have recently been made to detect gene-gene synergies, such as dendrogram-based I(X 1; X 2; Y) (mutual information), doublets (gene pairs) and MIC(X 1; X 2; Y) based on the maximal information coefficient. It is unclear whether dendrogram-based I(X 1; X 2; Y) and doublets can capture synergies efficiently. Although MIC(X 1; X 2; Y) can capture a wide range of interaction, it has a high computational cost triggered by its 3-D search. In this paper, we developed a simple and fast approach based on abs conversion type (i.e. Z = |X 1 - X 2|) and t-test, to detect interactions in simulation and real-world datasets. Our results showed that dendrogram-based I(X 1; X 2; Y) and doublets are helpless for discovering pair-wise gene interactions, our approach can discover typical pair-wise synergic genes efficiently. These synergic genes can reach comparable accuracy to the individually discriminant genes using the same number of genes. Classifier cannot learn well if synergic genes have not been converted properly. Combining individually discriminant and synergic genes can improve the prediction performance.

Entities:  

Mesh:

Year:  2017        PMID: 29180805      PMCID: PMC5703944          DOI: 10.1038/s41598-017-16748-w

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Selection of informative genes, including individually discriminant genes and synergic genes, from expression data has been useful for medical diagnosis and prognosis. Individual gene ranking techniques such as t-test[1] etc. can typically produce a “list of genes” that are correlated with disease[2]. However, they cannot provide insights into the interaction of these genes. According to information theory, the pair-wise interactions I (X 1; X 2; Y)[3] is defined aswhere I is the symbol for mutual information, I (X 1; Y) is the individual effect of gene X 1 relative to phenotype Y, I (X 2; Y) is the individual effect of gene X 2 relative to Y, and I (X 1, X 2; Y) is the joint effect of X 1 and X 2 relative to Y. A positive value of I (X 1; X 2; Y) indicates synergy, while a negative value of I (X 1; X 2; Y) indicates redundancy. Figure 1 illustrates four typical pair-wise synergies examples from Watkinson et al.[4] (Fig. 1A,B) and Chen et al.[5] (Fig. 1C,D). Figure 1A–C are generated by simulated data, and Fig. 1D is generated by real-world data. As an example, when the RSG9 or DIAPH2 is evaluated individually and separately, neither of these two genes is correlated with cancer. Therefore, genes RGS9 and DIAPH2 would not be present in the output of any “individual gene ranking” techniques. However, when the pair-wise interactions is evaluated, the genes RGS9 -DIAPH2 are sufficient to distinguish cancer from normal samples (Fig. 1D).
Figure 1

Four typical pair-wise synergies examples. Red and green dots represent cancer and normal samples, respectively.

Four typical pair-wise synergies examples. Red and green dots represent cancer and normal samples, respectively. Detecting synergic genes is more difficult than selecting individually discriminant genes. Several efforts have recently been made to detect gene–gene synergies. These efforts often fall into one of the two strategies. One is the non-conversion strategy, which uses formula (1) directly to measure I(X 1; X 2; Y)[4] or uses the maximal information coefficient directly to measure MIC(X 1; X 2; Y)[5]. The way to discretize continuous variable is the key to estimate the value of mutual information. Binarization, such as the dendrogram-based[4] technique, simplifies the estimation, and provides simple logical functions in the connection of the genes. However, it may result in information loss and estimation error. Although MIC(X 1; X 2; Y)[5] can capture a wide range of interactions, it has a high computational cost triggered by its 3-D search. The other is the conversion strategy, such as doublets [6] and top scoring pair (TSP)[7]. They employ a new variable Z derived from the combinations between X 1 and X 2 (e.g. for the sum type of doublets, Z = X 1 + X 2) to measure I (Z; Y) instead of I(X 1; X 2; Y). This strategy is low computational cost, due to the search space reduced from 3-D to 2-D. However, it is unclear whether this conversion strategy can capture synergies[8] efficiently. Inspecting Fig. 1A–C, we found that they share the same pattern and can be characterized by the same function, Y = |X 1 − X 2|. The only difference between them is the value ranges of independent variables. Although Doublets [6] included sum, diff, mul and sign conversion types (TSP is similar to sign), it, unfortunately, ignored abs conversion type. In this work, we developed a simple and fast approach based on abs conversion type and t-test, to discover pair-wise synergic genes that are related to cancer. Furthermore, we validated these synergic genes by using classification performance with simulation and real-world datasets. Our results show that these synergic genes can enhance the individually discriminant model and improve the prediction performance. We also demonstrated that these synergic genes should be converted into new variables (Z) prior to be used as input features for classifiers, especially for many pairs of synergistic genes.

Datasets and Methods

Datasets

Four binary class datasets are involved in this work. The reference, sample size, number of genes in each dataset, and the number of samples in each class are summarized in Table 1. All gene expression data have been normalized by using the RMA method[9].
Table 1

Four binary class gene expression datasets.

DatasetsSample sizeNumber of genesReference
Prostate 1102(52, 50)12600Singh, D(2002)[11]
Lung cancer187 (97, 90)22,215Spira, A(2007)[17]; GSE4115
Prostate 2424 (264, 160)20,280Penney, K(2015)[18]; GSE62872
Cardiovascular disease378 (138, 240)22,277Ellsworth, D(2014)[19]; GSE46097
Four binary class gene expression datasets.

Conversion types and pair-wise gene rank

Suppose that a dataset has n samples and m genes, and can be denoted as {Y , X ij}, i = 1,2,…,n; j = 1,2,…,m. X ij represents the expression value of the j th gene (G j) in the i th sample; and Y i represents the class label of i th sample. Y i ∈ {0, 1}, 0 denotes cancerous and 1 denotes normal tissue samples. Rank-based methods[7] are robust to quantization effects and to overcome background differences between gene pairs. Therefore, let R denote the rank of the i th sample in the j th gene, we replace the expression values X ij by their ranks R and get a new data matrix {Y , R ij}. For two genes G and G , Doublets [6] lists four conversion types. We add a new conversion type:Here, i = 1,2,…,n; p = 1,2,…, m; q = 1,2,…, m; p ≠ q; s = 1,2,…, m(m−1)/2. Again, we get a new data matrix {Y , Z }. For each converted feature Z , we use the t-score, instead of I (Z; Y), to rank the association between Z and Y, since Y ∈ {0, 1}. The individually discriminant genes are also ranked by t- score.

Support Vector Machine Classifier and performance evaluation

Each gene pairs and each individually discriminant genes are ranked by t- score based on all samples. The Top N gene pairs and/or the Top N individually discriminant genes are selected as input features. Support Vector Machine (SVM) Classifier is available at http://www.csie.ntu.edu.tw/~cjlin/libsvm/ [10]. We simply use the average accuracy of five-fold cross-validation (CV) to evaluate the classifier performance as the datasets involved in this paper have balanced numbers of positive and negative samples.Here TP, TN, FP, FN denote true positives, true negatives, false positives and false negatives respectively.

Results and Discussion

Comparing gene pairs selected by different methods

Figure 2 illustrates the scatterplot of the top-two gene pairs selected by abs conversion type and six reference methods in Prostate1 dataset[11]. In Fig. 2A,B,M and N, although the top-two synergic genes selected by abs conversion type and MIC(X 1; X 2; Y) are different, they share the same pattern: each individual gene is unrelated to cancer by individual gene evaluation, but the pair-wise genes are sufficient to distinguish the cancer from normal samples. Figure 2C–L are the top-two gene pairs selected from sum, diff, mul, sign and dendrogram-based I(X 1; X2; Y) methods. As an example (Fig. 2C), the higher the gene PWP2 expression level, the more likely to suffer cancer. The gene MNAT1 showed similar pattern as PWP2. Thus, these two genes (PWP2 and MNAT1) are related with cancer directly. However, they are individually discriminant rather than synergic genes. In a word, only abs conversion type and MIC(X 1; X 2; Y) can capture typical pair-wise synergies, dendrogram-based I(X 1; X 2; Y) and doublets are helpless for discovering pair-wise gene interactions.
Figure 2

Top2 gene pairs selected by different methods in Prostate1 dataset. Red and green dots represent cancer and control, respectively. Gene expression levels are represented by the ranked values. K and L are from dendrogram-based I(X 1; X 2; Y)[4], M and N are from MIC(X 1; X 2; Y)[5].

Top2 gene pairs selected by different methods in Prostate1 dataset. Red and green dots represent cancer and control, respectively. Gene expression levels are represented by the ranked values. K and L are from dendrogram-based I(X 1; X 2; Y)[4], M and N are from MIC(X 1; X 2; Y)[5]. We then compared the overlaps among the informative genes selected by Ind, Sum, Diff, Mul, Sign and Abs methods (Table 2). Clearly, a considerable number of similar informative genes can be detected by the first five methods. On the contrary, the informative genes selected by Abs method have little overlap with the informative genes selected by the others.
Table 2

Overlaps among the informative genes selected by different methods in the Prostate1 dataset.

Ind(100) Sum(98) Diff(94) Mul(70) Sign(128) Abs(132)
Ind(100)
Sum(98)35
Diff(94)3641
Mul(70)232021
Sign(128)25283018
Abs(132)10000

Ind(100): The Top 100 individually discriminant genes selected by t-test. Sum (98): The Top 100 gene pairs selected by Sum conversion type and t-test, 98 genes reserved after removing repeated genes; the others as well.

Overlaps among the informative genes selected by different methods in the Prostate1 dataset. Ind(100): The Top 100 individually discriminant genes selected by t-test. Sum (98): The Top 100 gene pairs selected by Sum conversion type and t-test, 98 genes reserved after removing repeated genes; the others as well. Given the top10 pair-wise synergic genes (16 genes) selected by abs conversion type, Fig. 3 contains the heat maps generated by these genes with different conversion type. Only the heat maps with abs conversion type (Fig. 3A) and diff conversion type (Fig. 3C) can distinguish cancer from normal samples. In diff conversion type, the Z values are medium in cancer samples, but they are either low or high in normal samples, and vice versa. Therefore, the pair-wise synergic genes converted by diff will receive low t-scores and cannot be highlighted.
Figure 3

The heat maps generated by the same top10 synergic genes which were selected by abs conversion type. Each row corresponds to a pair of genes (A–E) or a gene (F), and each column corresponds to a sample. Gene expression levels are represented by the ranked values, and normalized to [−1, 1].

The heat maps generated by the same top10 synergic genes which were selected by abs conversion type. Each row corresponds to a pair of genes (A–E) or a gene (F), and each column corresponds to a sample. Gene expression levels are represented by the ranked values, and normalized to [−1, 1]. To answer whether the synergic genes selected by abs conversion type have any biological relevance to cancer, we further validated the top10 gene pairs (16 genes) according to UniHI[12] database (http://www.unihi.org/) and PubMed (Table 3). UniHI is an enhanced database for retrieval and interactive analysis of human molecular interaction networks. In Top10 gene pairs, so far we have found two gene pairs (PARP1-HMGB1 and CCHCR1-GRAP) that are associated with interaction in UniHI. The interaction between PARP1 and HMGB1 has been verified by Dara et al. (2007)[13], the activation of PARP1 induces release of the pro-inflammatory mediator HMGB1 from the nucleus[13-15]. Of the 16 genes, 15 of them have been reported to relate to cancer. Four of them have been reported to relate to prostate cancer directly. Although LINC01278 has not yet been reported to relate to cancer, abs conversion type suggests that it is an important informative gene. LINC01278 occurred three times in the top 10 gene pairs (Table 3), and should be given proper attention.
Table 3

The top10 synergic genes selected by abs conversion type in Prostate1 dataset.

Pair-wise synergic GenesRelated carcinoma and Ref.
ZNF324–EPHB4 Breast cancer[20] Prostate cancer[21]
TAB1–LINC01278 Breast cancer[22] Unreported
CDH22–LINC01278 Colorectal cancer[23] Unreported
KLF7–EXT1 Oral carcinoma[24] Cartilage-capped tumor[25]
SIPA1L3–LINC01278 Breast cancer[26] Unreported
KLF7–DDR2 Oral carcinoma[24] Lung cancer[27]
MMP23A–DIP2C Bladder cancer[28] – Breast and lung cancer[29]
CARM1–EPHB4 Prostate cancers[30] Prostate cancer[21]
CCHCR1–GRAP Skin cancer[31] Medullary thyroid carcinoma[32]
PARP1–HMGB1 Prostate cancer[33] Prostate cancer[13]
The top10 synergic genes selected by abs conversion type in Prostate1 dataset.

Classifier cannot learn well if synergic genes have not been converted properly

Although we get the pair-wise synergic genes based on abs conversion type, Fig. 3F suggests that the no conversion feature (X or R) cannot distinguish cancer from normal samples. It also indicates that the input features for classifiers should be conversion feature Z (Fig. 3A). Therefore, we conducted an experiment to further validate this hypothesis. Ten simulation datasets were generated according to Table 4; their prediction accuracy of 5 fold cross-validation is listed in Table 5.
Table 4

Ten simulation datasets and their input features.

DatasetFunctionNo converted input featuresConverted input features
1 Y = |X 1 − X 2| = Z 1 {X 1, X 2}{Z 1}
2 Y = |X 1 − X 2| + |X 3 − X 4| = Z 1 + Z 2 {X 1, X 2, X 3, X 4}{Z 1, Z 2}
10 Y = |X 1 − X 2| + |X 3 − X 4| + … + |X 19 − X 20| = Z 1 + Z 2 + … + Z 10 {X 1, X 2, X 3, X 4,…, X 19, X 20}{Z 1, Z 2,…, Z 10}

Here, X is assigned with random values between 0 and 1, and Y is binarized with the median. Sample size for each dataset is 200.

Table 5

Prediction accuracy with converted and not converted input features.

DatasetSVM-RBFa SVM-linearb SVM-polyc SVM-sigd RFANNsDT
Con. No con. Con. No con. Con. No con. Con. No con. Con. No con. Con. No con. Con. No con.
10.9850.9850.9900.6051.000.560.9900.5401.000.8651.000.9750.9950.895
20.9700.9050.9750.6000.9850.6400.9950.4550.9600.7950.9900.9300.9650.785
30.9850.8600.9750.4650.9800.5750.9750.5000.8600.7800.9950.9100.9000.705
40.9600.8100.9250.5150.9850.4000.9800.4200.8500.6550.9850.8250.8650.695
50.9700.7900.9100.5350.9650.5500.9800.4600.8100.6150.9950.7800.8400.600
60.9450.8150.8600.5000.9850.47509800.4850.7700.6200.9900.7700.7950.615
70.9400.7150.9050.5300.9800.5000.9800.5350.8650.6100.9850.6700.7950.585
80.9700.6750.9550.4100.9700.4550.9550.4550.7600.5450.9950.6950.7600.610
90.9550.6600.8850.5150.9600.4600.9550.4350.7900.5100.9900.6650.7700.580
100.9550.6550.8600.4800.9550.5250.9750.5250.7350.5200.9600.6000.7500.625

Here, a: SVM with radial basis function (RBF) kernel; b: SVM with linear kernel; c: SVM with polynomial kernel; d: SVM with sigmoid kernel. RF: Random Forest; ANNs: artificial neuron network; DT: Decision Tree; Con: the converted input features; No con: the not converted input features.

Ten simulation datasets and their input features. Here, X is assigned with random values between 0 and 1, and Y is binarized with the median. Sample size for each dataset is 200. Prediction accuracy with converted and not converted input features. Here, a: SVM with radial basis function (RBF) kernel; b: SVM with linear kernel; c: SVM with polynomial kernel; d: SVM with sigmoid kernel. RF: Random Forest; ANNs: artificial neuron network; DT: Decision Tree; Con: the converted input features; No con: the not converted input features. For the less input features (e.g dataset1 and dataset2) (Table 5), all of the seven models perform well by applying with the converted features, whereas only two models (SVM-RBF and ANNs) perform well by applying with the not- converted features. For the larger input features (e.g dataset9 and dataset10) (Table 5), although four models (SVM-RBF, SVM-poly, SVM-sig and ANNs) still perform well by applying with the converted features, none of these seven models perform well by applying with the not converted features. Thus, we can conclude that pair-wise synergic genes should be converted into new variables (Z) prior to be used as input features for classifiers, especially for many pairs of synergistic genes. This is a surprising and important discovery. Suppose phenotype Y is determined by individually discriminant genes X 1 and X 2, and pair-wise synergic genes X 3–X 4 and X 5–X 6. In other words, the true genetic model is , and the true optimal subset is {X 1, X 2, X 3, X 4, X 5, X 6}, X 7–X 1000 are genes unrelated to Y. Now we get the dataset {Y, X 1, X 2,…, X 1000} and want to construct a genomic prediction model[16] based on machine learning, but don’t know the true genetic model. Even the individual discriminant genes X 1 and X 2 can be highlighted by t-test, and the synergic genes X 3, X 4, X 5 and X 6 can be highlighted by Abs conversion type or MIC(X 1; X 2; Y), classifier cannot learn well when the input features space is {X 1, X 2, X 3, X 4, X 5, X 6}. It means that learning machine can never tell us the true optimal subset, if synergic genes have not been converted properly. This indicates the complexity of genomic prediction, also provides a new explain for “missing heritability” in GWAS study.

Combining individually discriminant and synergic genes can improve prediction performance

To further validate the reliability of synergic genes selected by abs conversion type, we also evaluated the prediction performance of individually discriminant and synergic genes with three more recent and larger publicly available datasets (Lung, Prostate2 and Cardiovascular) (see Table 1). Meantime, the label randomization tests were performed. The top individually discriminant genes are selected by t-test, the top synergic genes are selected by abs conversion type + t-test. Here, we take the individually discriminant genes and/or converted synergic genes as the input features for the SVM-RBF classifier. Table 6 illustrates the prediction of accuracy in different schemes of input features. The results show that: 1) By using the individually discriminant genes as input features alone, the average accuracies for Top10_Ind, Top20_Ind and Top40_Ind are 77.30%, 78.74% and 80.36%, respectively. By using the synergic genes as input features alone, the average accuracies for Top5_Syn, Top10_Syn and Top20_Syn are 75.58%, 81.67% and 84.63%, respectively. These indicate that the synergic genes receive comparable accuracy to the individually discriminant genes using the same number of genes. 2) When the input features involves 20 genes, the average accuracies for Top20_Ind, Top10_Syn and Top10_Ind + Top5_Syn are 78.74%, 81.67%, and 83.74%, respectively. When the input features involves 40 genes, the average accuracies for Top40_Ind, Top20_Syn and Top20_Ind + Top10_Syn are 80.36%, 84.63%, and 85.75%, respectively. These indicate that combining individually discriminant and synergic genes, rather than only using the individually discriminant genes or the synergic genes, can receive better prediction accuracies. 3) The classification performances of the label randomization tests drop to random, it validate the reliability of synergic genes selected by abs conversion type.
Table 6

Prediction accuracies of 5-fold CV in different schemes of input features (%).

Input featuresLungProstate2CardiovascularAverage
Top10_Ind 74.41 (43.81)84.20 (64.39)73.29 (63.22)77.30 (57.14)
Top20_Ind 76.49 (43.31)85.13 (61.08)74.59 (61.65)78.74 (55.35)
Top40_Ind 75.93 (46.02)84.20 (61.09)80.96 (62.95)80.36 (56.69)
Top5_Syn 76.54 (47.03)74.52 (62.25)75.67 (62.99)75.58 (57.42)
Top10_Syn 84.44 (50.28)76.18 (55.90)84.40 (61.38)81.67 (55.85)
Top20_Syn 83.98 (47.06)80.20 (62.96)89.70 (62.17)84.63 (57.40)
Top10_Ind + Top5_Syn 82.33 (48.17)86.34 (62.27)82.55 (63.22)83.74 (57.89)
Top20_Ind + Top10_Syn 83.91 (40.11)86.31 (57.54)87.04 (62.44)85.75 (53.36)

Ind represents the individually discriminant genes, Syn represents the synergic genes. A number in parentheses indicates the result of label randomization test.

Prediction accuracies of 5-fold CV in different schemes of input features (%). Ind represents the individually discriminant genes, Syn represents the synergic genes. A number in parentheses indicates the result of label randomization test. The minimum number of individually discriminant and synergic genes required in the optimal subset remains to be determined by the further research. We also compared the prediction performance of the 5 conversion types (Table 7). The results show that the genes selected by Abs conversion type have more powerful ability to improve prediction performance for the individually discriminant model than the genes selected by the other conversion types.
Table 7

Prediction accuracies of 5-fold CV in different conversion types (%).

FeaturesLungProstate2CardiovascularAverage
Top20_Ind 76.4985.1374.5978.73
Top10_Sum 80.6881.6178.8380.37
Top10_Diff 83.3785.8476.9782.06
Top10_Mul 80.8181.6179.0980.50
Top10_Sign 78.0884.6879.3880.71
Top10_Abs 84.4476.1884.4081.67
Top10_Sum + Top20_Ind 79.7085.1480.4281.75
Top10_Diff + Top20_Ind 82.3384.4483.3383.37
Top10_Mul + Top20_Ind 78.11 86.55 79.6481.43
Top10_Sign + Top20_Ind 81.3584.4376.2180.66
Top10_Abs + Top20_Ind 83.91 86.31 87.04 85.75

Top20_Ind: The Top20 individually discriminant genes selected by t-test. Top10_Sum: the Top10 gene pairs selected by Sum conversion types + t-test, the others as well.

Prediction accuracies of 5-fold CV in different conversion types (%). Top20_Ind: The Top20 individually discriminant genes selected by t-test. Top10_Sum: the Top10 gene pairs selected by Sum conversion types + t-test, the others as well.

Conclusion

In this paper, we propose a fast approach based on the combination of abs conversion type and t-test, to detect gene–gene synergy. We find that dendrogram-based I(X 1; X 2; Y) and doublets are helpless for discovering pair-wise gene interactions, and the synergic genes selected by our method and the MIC(X 1; X 2; Y) method are consistent with the typical pair-wise synergy. However, MIC(X 1; X 2; Y) has a higher computational cost. For example, the running time of the entire process on Prostate1 dataset (12,600 × 12,599/2 gene pairs) by MIC(X 1; X 2; Y) method is approximately 20 hours (Intel Core i5-4590@3.3 GHz), whereas it is only 47 minutes by our method. Experiments on simulated and real-world data showed that combining the individually discriminant genes selected by t-test and the synergic genes selected by our methods can improve prediction performance. These synergic genes should be converted into new variables (Z) prior to be used as input features for classifiers.
  30 in total

1.  Dual roles of PARP-1 promote cancer growth and progression.

Authors:  Matthew J Schiewer; Jonathan F Goodwin; Sumin Han; J Chad Brenner; Michael A Augello; Jeffry L Dean; Fengzhi Liu; Jamie L Planck; Preethi Ravindranathan; Arul M Chinnaiyan; Peter McCue; Leonard G Gomella; Ganesh V Raj; Adam P Dicker; Jonathan R Brody; John M Pascal; Margaret M Centenera; Lisa M Butler; Wayne D Tilley; Felix Y Feng; Karen E Knudsen
Journal:  Cancer Discov       Date:  2012-09-19       Impact factor: 39.397

2.  EphB4 expression and biological significance in prostate cancer.

Authors:  Guangbin Xia; S Ram Kumar; Rizwan Masood; Sutao Zhu; Ramchandra Reddy; Valery Krasnoperov; David I Quinn; Susan M Henshall; Robert L Sutherland; Jacek K Pinski; Siamak Daneshmand; Maurizio Buscarini; John P Stein; Chen Zhong; Daniel Broek; Pradip Roy-Burman; Parkash S Gill
Journal:  Cancer Res       Date:  2005-06-01       Impact factor: 12.701

3.  Mutations in the DDR2 kinase gene identify a novel therapeutic target in squamous cell lung cancer.

Authors:  Peter S Hammerman; Martin L Sos; Alex H Ramos; Chunxiao Xu; Amit Dutt; Wenjun Zhou; Lear E Brace; Brittany A Woods; Wenchu Lin; Jianming Zhang; Xianming Deng; Sang Min Lim; Stefanie Heynck; Martin Peifer; Jeffrey R Simard; Michael S Lawrence; Robert C Onofrio; Helga B Salvesen; Danila Seidel; Thomas Zander; Johannes M Heuckmann; Alex Soltermann; Holger Moch; Mirjam Koker; Frauke Leenders; Franziska Gabler; Silvia Querings; Sascha Ansén; Elisabeth Brambilla; Christian Brambilla; Philippe Lorimier; Odd Terje Brustugun; Aslaug Helland; Iver Petersen; Joachim H Clement; Harry Groen; Wim Timens; Hannie Sietsma; Erich Stoelben; Jürgen Wolf; David G Beer; Ming Sound Tsao; Megan Hanna; Charles Hatton; Michael J Eck; Pasi A Janne; Bruce E Johnson; Wendy Winckler; Heidi Greulich; Adam J Bass; Jeonghee Cho; Daniel Rauh; Nathanael S Gray; Kwok-Kin Wong; Eric B Haura; Roman K Thomas; Matthew Meyerson
Journal:  Cancer Discov       Date:  2011-06       Impact factor: 39.397

4.  Overexpression of high mobility group (HMG) B1 and B2 proteins directly correlates with the progression of squamous cell carcinoma in skin.

Authors:  Ashok Sharma; Ruma Ray; Moganty R Rajeswari
Journal:  Cancer Invest       Date:  2008-10       Impact factor: 2.176

5.  Gene expression correlates of clinical prostate cancer behavior.

Authors:  Dinesh Singh; Phillip G Febbo; Kenneth Ross; Donald G Jackson; Judith Manola; Christine Ladd; Pablo Tamayo; Andrew A Renshaw; Anthony V D'Amico; Jerome P Richie; Eric S Lander; Massimo Loda; Philip W Kantoff; Todd R Golub; William R Sellers
Journal:  Cancer Cell       Date:  2002-03       Impact factor: 31.743

6.  Genomic subtypes of breast cancer identified by array-comparative genomic hybridization display distinct molecular and clinical characteristics.

Authors:  Göran Jönsson; Johan Staaf; Johan Vallon-Christersson; Markus Ringnér; Karolina Holm; Cecilia Hegardt; Haukur Gunnarsson; Rainer Fagerholm; Carina Strand; Bjarni A Agnarsson; Outi Kilpivaara; Lena Luts; Päivi Heikkilä; Kristiina Aittomäki; Carl Blomqvist; Niklas Loman; Per Malmström; Håkan Olsson; Oskar Th Johannsson; Adalgeir Arason; Heli Nevanlinna; Rosa B Barkardottir; Ake Borg
Journal:  Breast Cancer Res       Date:  2010-06-24       Impact factor: 6.466

7.  Identification of gene interactions associated with disease from gene expression data using synergy networks.

Authors:  John Watkinson; Xiaodong Wang; Tian Zheng; Dimitris Anastassiou
Journal:  BMC Syst Biol       Date:  2008-01-30

8.  Computational analysis of the synergy among multiple interacting genes.

Authors:  Dimitris Anastassiou
Journal:  Mol Syst Biol       Date:  2007-02-13       Impact factor: 11.429

9.  UniHI 7: an enhanced database for retrieval and interactive analysis of human molecular interaction networks.

Authors:  Ravi Kiran Reddy Kalathur; José Pedro Pinto; Miguel A Hernández-Prieto; Rui S R Machado; Dulce Almeida; Gautam Chaurasia; Matthias E Futschik
Journal:  Nucleic Acids Res       Date:  2013-11-08       Impact factor: 16.971

10.  Informative gene selection and the direct classification of tumors based on relative simplicity.

Authors:  Yuan Chen; Lifeng Wang; Lanzhi Li; Hongyan Zhang; Zheming Yuan
Journal:  BMC Bioinformatics       Date:  2016-01-20       Impact factor: 3.169

View more
  2 in total

Review 1.  Data analysis methods for defining biomarkers from omics data.

Authors:  Chao Li; Zhenbo Gao; Benzhe Su; Guowang Xu; Xiaohui Lin
Journal:  Anal Bioanal Chem       Date:  2021-12-24       Impact factor: 4.142

2.  High dimensional model representation of log-likelihood ratio: binary classification with expression data.

Authors:  Ali Foroughi Pour; Maciej Pietrzak; Lori A Dalton; Grzegorz A Rempała
Journal:  BMC Bioinformatics       Date:  2020-04-25       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.