Literature DB >> 20064242

Selecting high-dimensional mixed graphical models using minimal AIC or BIC forests.

David Edwards1, Gabriel C G de Abreu, Rodrigo Labouriau.   

Abstract

BACKGROUND: Chow and Liu showed that the maximum likelihood tree for multivariate discrete distributions may be found using a maximum weight spanning tree algorithm, for example Kruskal's algorithm. The efficiency of the algorithm makes it tractable for high-dimensional problems.
RESULTS: We extend Chow and Liu's approach in two ways: first, to find the forest optimizing a penalized likelihood criterion, for example AIC or BIC, and second, to handle data with both discrete and Gaussian variables. We apply the approach to three datasets: two from gene expression studies and the third from a genetics of gene expression study. The minimal BIC forest supplements a conventional analysis of differential expression by providing a tentative network for the differentially expressed genes. In the genetics of gene expression context the method identifies a network approximating the joint distribution of the DNA markers and the gene expression levels.
CONCLUSIONS: The approach is generally useful as a preliminary step towards understanding the overall dependence structure of high-dimensional discrete and/or continuous data. Trees and forests are unrealistically simple models for biological systems, but can provide useful insights. Uses include the following: identification of distinct connected components, which can be analysed separately (dimension reduction); identification of neighbourhoods for more detailed analyses; as initial models for search algorithms with a larger search space, for example decomposable models or Bayesian networks; and identification of interesting features, such as hub nodes.

Entities:  

Mesh:

Year:  2010        PMID: 20064242      PMCID: PMC2823705          DOI: 10.1186/1471-2105-11-18

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  13 in total

1.  Activation from a distance: roles of Lrp and integration host factor in transcriptional activation of gltBDF.

Authors:  L Paul; R M Blumenthal; R G Matthews
Journal:  J Bacteriol       Date:  2001-07       Impact factor: 3.490

2.  The mutual information: detecting and evaluating dependencies between variables.

Authors:  R Steuer; J Kurths; C O Daub; J Weise; J Selbig
Journal:  Bioinformatics       Date:  2002       Impact factor: 6.937

3.  Global gene expression profiling in Escherichia coli K12. The effects of leucine-responsive regulatory protein.

Authors:  She-pin Hung; Pierre Baldi; G Wesley Hatfield
Journal:  J Biol Chem       Date:  2002-07-18       Impact factor: 5.157

4.  Network motifs: simple building blocks of complex networks.

Authors:  R Milo; S Shen-Orr; S Itzkovitz; N Kashtan; D Chklovskii; U Alon
Journal:  Science       Date:  2002-10-25       Impact factor: 47.728

5.  Graphical modeling of the joint distribution of alleles at associated loci.

Authors:  Alun Thomas; Nicola J Camp
Journal:  Am J Hum Genet       Date:  2004-04-26       Impact factor: 11.025

6.  Linear models and empirical bayes methods for assessing differential expression in microarray experiments.

Authors:  Gordon K Smyth
Journal:  Stat Appl Genet Mol Biol       Date:  2004-02-12

7.  Reverse engineering molecular regulatory networks from microarray data with qp-graphs.

Authors:  Robert Castelo; Alberto Roverato
Journal:  J Comput Biol       Date:  2009-02       Impact factor: 1.479

Review 8.  A review on models and algorithms for motif discovery in protein-protein interaction networks.

Authors:  Giovanni Ciriello; Concettina Guerra
Journal:  Brief Funct Genomic Proteomic       Date:  2008-04-28

9.  An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival.

Authors:  Lance D Miller; Johanna Smeds; Joshy George; Vinsensius B Vega; Liza Vergara; Alexander Ploner; Yudi Pawitan; Per Hall; Sigrid Klaar; Edison T Liu; Jonas Bergh
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-02       Impact factor: 11.205

10.  Genome-scale reconstruction of the Lrp regulatory network in Escherichia coli.

Authors:  Byung-Kwan Cho; Christian L Barrett; Eric M Knight; Young Seoub Park; Bernhard Ø Palsson
Journal:  Proc Natl Acad Sci U S A       Date:  2008-12-03       Impact factor: 11.205

View more
  9 in total

1.  Mapping eQTL networks with mixed graphical Markov models.

Authors:  Inma Tur; Alberto Roverato; Robert Castelo
Journal:  Genetics       Date:  2014-09-29       Impact factor: 4.562

Review 2.  Sensitivity and specificity of information criteria.

Authors:  John J Dziak; Donna L Coffman; Stephanie T Lanza; Runze Li; Lars S Jermiin
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

3.  Multivariate analysis of microarray data: differential expression and differential connection.

Authors:  Harri T Kiiveri
Journal:  BMC Bioinformatics       Date:  2011-02-01       Impact factor: 3.169

4.  Mathematical and statistical modeling in cancer systems biology.

Authors:  Rachael Hageman Blair; David L Trichler; Daniel P Gaille
Journal:  Front Physiol       Date:  2012-06-28       Impact factor: 4.566

Review 5.  Integration of Metabolomic and Other Omics Data in Population-Based Study Designs: An Epidemiological Perspective.

Authors:  Su H Chu; Mengna Huang; Rachel S Kelly; Elisa Benedetti; Jalal K Siddiqui; Oana A Zeleznik; Alexandre Pereira; David Herrington; Craig E Wheelock; Jan Krumsiek; Michael McGeachie; Steven C Moore; Peter Kraft; Ewy Mathé; Jessica Lasky-Su
Journal:  Metabolites       Date:  2019-06-18

6.  Network-enabled gene expression analysis.

Authors:  David Edwards; Lei Wang; Peter Sørensen
Journal:  BMC Bioinformatics       Date:  2012-07-16       Impact factor: 3.169

7.  Automatic digital quantification of bone marrow myeloma volume in appendicular skeletons - clinical implications and prognostic significance.

Authors:  Yuki Nishida; Shinya Kimura; Hideaki Mizobe; Junta Yamamichi; Kensuke Kojima; Atsushi Kawaguchi; Manabu Fujisawa; Kosei Matsue
Journal:  Sci Rep       Date:  2017-10-10       Impact factor: 4.379

8.  Acquisition and persistence of strain-specific methicillin-resistant Staphylococcus aureus and their determinants in community nursing homes.

Authors:  Nataliya G Batina; Christopher J Crnich; Dörte Döpfer
Journal:  BMC Infect Dis       Date:  2017-12-06       Impact factor: 3.090

9.  What Is the Influence of Morphological Knowledge in the Early Stages of Reading Acquisition Among Low SES Children? A Graphical Modeling Approach.

Authors:  Pascale Colé; Eddy Cavalli; Lynne G Duncan; Anne Theurel; Edouard Gentaz; Liliane Sprenger-Charolles; Abdessadek El-Ahmadi
Journal:  Front Psychol       Date:  2018-04-19
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.