Hiroyuki Toh1, Katsuhisa Horimoto. 1. Department of Bioinformatics, Biomolecular Engineering Research Institute 6-2-3, Furuedai, Suita, Osaka 565-0874, Japan. toh@beri.co.jp
Abstract
MOTIVATION: Recent advances in DNA microarray technologies have made it possible to measure the expression levels of thousands of genes simultaneously under different conditions. The data obtained by microarray analyses are called expression profile data. One type of important information underlying the expression profile data is the 'genetic network,' that is, the regulatory network among genes. Graphical Gaussian Modeling (GGM) is a widely utilized method to infer or test relationships among a plural of variables. RESULTS: In this study, we developed a method combining the cluster analysis with GGM for the inference of the genetic network from the expression profile data. The expression profile data of 2467 Saccharomyces cerevisiae genes measured under 79 different conditions (Eisen et al., PROC: Natl Acad. Sci. USA, 95, 14683-14868, 1998) were used for this study. At first, the 2467 genes were classified into 34 clusters by a cluster analysis, as a preprocessing for GGM. Then, the expression levels of the genes in each cluster were averaged for each condition. The averaged expression profile data of 34 clusters were subjected to GGM, and a partial correlation coefficient matrix was obtained as a model of the genetic network of S. cerevisiae. The accuracy of the inferred network was examined by the agreement of our results with the cumulative results of experimental studies.
MOTIVATION: Recent advances in DNA microarray technologies have made it possible to measure the expression levels of thousands of genes simultaneously under different conditions. The data obtained by microarray analyses are called expression profile data. One type of important information underlying the expression profile data is the 'genetic network,' that is, the regulatory network among genes. Graphical Gaussian Modeling (GGM) is a widely utilized method to infer or test relationships among a plural of variables. RESULTS: In this study, we developed a method combining the cluster analysis with GGM for the inference of the genetic network from the expression profile data. The expression profile data of 2467 Saccharomyces cerevisiae genes measured under 79 different conditions (Eisen et al., PROC: Natl Acad. Sci. USA, 95, 14683-14868, 1998) were used for this study. At first, the 2467 genes were classified into 34 clusters by a cluster analysis, as a preprocessing for GGM. Then, the expression levels of the genes in each cluster were averaged for each condition. The averaged expression profile data of 34 clusters were subjected to GGM, and a partial correlation coefficient matrix was obtained as a model of the genetic network of S. cerevisiae. The accuracy of the inferred network was examined by the agreement of our results with the cumulative results of experimental studies.
Authors: Joseph H Nadeau; Lindsay C Burrage; Joe Restivo; Yoh-Han Pao; Gary Churchill; Brian D Hoit Journal: Genome Res Date: 2003-09 Impact factor: 9.043
Authors: Tiina Blomster; Jarkko Salojärvi; Nina Sipari; Mikael Brosché; Reetta Ahlfors; Markku Keinänen; Kirk Overmyer; Jaakko Kangasjärvi Journal: Plant Physiol Date: 2011-10-17 Impact factor: 8.340
Authors: Timothy R Lezon; Jayanth R Banavar; Marek Cieplak; Amos Maritan; Nina V Fedoroff Journal: Proc Natl Acad Sci U S A Date: 2006-11-30 Impact factor: 11.205