MOTIVATION: High-throughput molecular data provide a wealth of information that can be integrated into network analysis. Several approaches exist that identify functional modules in the context of integrated biological networks. The objective of this study is 2-fold: first, to assess the accuracy and variability of identified modules and second, to develop an algorithm for deriving highly robust and accurate solutions. RESULTS: In a comparative simulation study accuracy and robustness of the proposed and established methodologies are validated, considering various sources of variation in the data. To assess this variation, we propose a jackknife resampling procedure resulting in an ensemble of optimal modules. A consensus approach summarizes the ensemble into one final module containing maximally robust nodes and edges. The resulting consensus module identifies and visualizes robust and variable regions by assigning support values to nodes and edges. Finally, the proposed approach is exemplified on two large gene expression studies: diffuse large B-cell lymphoma and acute lymphoblastic leukemia.
MOTIVATION: High-throughput molecular data provide a wealth of information that can be integrated into network analysis. Several approaches exist that identify functional modules in the context of integrated biological networks. The objective of this study is 2-fold: first, to assess the accuracy and variability of identified modules and second, to develop an algorithm for deriving highly robust and accurate solutions. RESULTS: In a comparative simulation study accuracy and robustness of the proposed and established methodologies are validated, considering various sources of variation in the data. To assess this variation, we propose a jackknife resampling procedure resulting in an ensemble of optimal modules. A consensus approach summarizes the ensemble into one final module containing maximally robust nodes and edges. The resulting consensus module identifies and visualizes robust and variable regions by assigning support values to nodes and edges. Finally, the proposed approach is exemplified on two large gene expression studies: diffuse large B-cell lymphoma and acute lymphoblastic leukemia.
Authors: Daniela Beisser; Markus A Grohme; Joachim Kopka; Marcus Frohme; Ralph O Schill; Steffen Hengherr; Thomas Dandekar; Gunnar W Klau; Marcus Dittrich; Tobias Müller Journal: BMC Syst Biol Date: 2012-06-19
Authors: Nicolas Alcaraz; Markus List; Richa Batra; Fabio Vandin; Henrik J Ditzel; Jan Baumbach Journal: Nucleic Acids Res Date: 2017-09-19 Impact factor: 16.971
Authors: Christian W Remmele; Christian H Luther; Johannes Balkenhol; Thomas Dandekar; Tobias Müller; Marcus T Dittrich Journal: Front Microbiol Date: 2015-08-04 Impact factor: 5.640
Authors: Alexey A Sergushichev; Alexander A Loboda; Abhishek K Jha; Emma E Vincent; Edward M Driggers; Russell G Jones; Edward J Pearce; Maxim N Artyomov Journal: Nucleic Acids Res Date: 2016-04-20 Impact factor: 16.971
Authors: Nicolas Alcaraz; Markus List; Martin Dissing-Hansen; Marc Rehmsmeier; Qihua Tan; Jan Mollenhauer; Henrik J Ditzel; Jan Baumbach Journal: F1000Res Date: 2016-06-28