MOTIVATION: The identification of gene regulatory modules is an important yet challenging problem in computational biology. While many computational methods have been proposed to identify regulatory modules, their initial success is largely compromised by a high rate of false positives, especially when applied to human cancer studies. New strategies are needed for reliable regulatory module identification. RESULTS: We present a new approach, namely multilevel support vector regression (ml-SVR), to systematically identify condition-specific regulatory modules. The approach is built upon a multilevel analysis strategy designed for suppressing false positive predictions. With this strategy, a regulatory module becomes ever more significant as more relevant gene sets are formed at finer levels. At each level, a two-stage support vector regression (SVR) method is utilized to help reduce false positive predictions by integrating binding motif information and gene expression data; a significant analysis procedure is followed to assess the significance of each regulatory module. To evaluate the effectiveness of the proposed strategy, we first compared the ml-SVR approach with other existing methods on simulation data and yeast cell cycle data. The resulting performance shows that the ml-SVR approach outperforms other methods in the identification of both regulators and their target genes. We then applied our method to breast cancer cell line data to identify condition-specific regulatory modules associated with estrogen treatment. Experimental results show that our method can identify biologically meaningful regulatory modules related to estrogen signaling and action in breast cancer. AVAILABILITY AND IMPLEMENTATION: The ml-SVR MATLAB package can be downloaded at http://www.cbil.ece.vt.edu/software.htm.
MOTIVATION: The identification of gene regulatory modules is an important yet challenging problem in computational biology. While many computational methods have been proposed to identify regulatory modules, their initial success is largely compromised by a high rate of false positives, especially when applied to humancancer studies. New strategies are needed for reliable regulatory module identification. RESULTS: We present a new approach, namely multilevel support vector regression (ml-SVR), to systematically identify condition-specific regulatory modules. The approach is built upon a multilevel analysis strategy designed for suppressing false positive predictions. With this strategy, a regulatory module becomes ever more significant as more relevant gene sets are formed at finer levels. At each level, a two-stage support vector regression (SVR) method is utilized to help reduce false positive predictions by integrating binding motif information and gene expression data; a significant analysis procedure is followed to assess the significance of each regulatory module. To evaluate the effectiveness of the proposed strategy, we first compared the ml-SVR approach with other existing methods on simulation data and yeast cell cycle data. The resulting performance shows that the ml-SVR approach outperforms other methods in the identification of both regulators and their target genes. We then applied our method to breast cancer cell line data to identify condition-specific regulatory modules associated with estrogen treatment. Experimental results show that our method can identify biologically meaningful regulatory modules related to estrogen signaling and action in breast cancer. AVAILABILITY AND IMPLEMENTATION: The ml-SVR MATLAB package can be downloaded at http://www.cbil.ece.vt.edu/software.htm.
Authors: N Brünner; B Boysen; S Jirus; T C Skaar; C Holst-Hansen; J Lippman; T Frandsen; M Spang-Thomsen; S A Fuqua; R Clarke Journal: Cancer Res Date: 1997-08-15 Impact factor: 12.701
Authors: A Sala; B Saitta; P De Luca; M N Cervellera; I Casella; R E Lewis; R Watson; C Peschle Journal: Oncogene Date: 1999-02-11 Impact factor: 9.867
Authors: R Clarke; N Brünner; B S Katzenellenbogen; E W Thompson; M J Norman; C Koppi; S Paik; M E Lippman; R B Dickson Journal: Proc Natl Acad Sci U S A Date: 1989-05 Impact factor: 11.205
Authors: P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher Journal: Mol Biol Cell Date: 1998-12 Impact factor: 4.138
Authors: Anne Boulay; Madlaina Breuleux; Christine Stephan; Caroline Fux; Cathrin Brisken; Maryse Fiche; Markus Wartmann; Michael Stumm; Heidi A Lane; Nancy E Hynes Journal: Cancer Res Date: 2008-05-15 Impact factor: 12.701
Authors: Robert Clarke; Ayesha N Shajahan; Yue Wang; John J Tyson; Rebecca B Riggins; Louis M Weiner; William T Bauman; Jianhua Xuan; Bai Zhang; Caroline Facey; Harini Aiyer; Katherine Cook; F Edward Hickman; Iman Tavassoly; Anael Verdugo; Chun Chen; Alan Zwart; Anni Wärri; Leena A Hilakivi-Clarke Journal: Horm Mol Biol Clin Investig Date: 2011-03
Authors: S Mounika Inavolu; J Renbarger; M Radovich; V Vasudevaraja; G H Kinnebrew; S Zhang; L Cheng Journal: CPT Pharmacometrics Syst Pharmacol Date: 2017-03-07
Authors: Jiangning Song; Hao Tan; Andrew J Perry; Tatsuya Akutsu; Geoffrey I Webb; James C Whisstock; Robert N Pike Journal: PLoS One Date: 2012-11-29 Impact factor: 3.240