Literature DB >> 19261718

MIST: Maximum Information Spanning Trees for dimension reduction of biological data sets.

Bracken M King1, Bruce Tidor.   

Abstract

MOTIVATION: The study of complex biological relationships is aided by large and high-dimensional data sets whose analysis often involves dimension reduction to highlight representative or informative directions of variation. In principle, information theory provides a general framework for quantifying complex statistical relationships for dimension reduction. Unfortunately, direct estimation of high-dimensional information theoretic quantities, such as entropy and mutual information (MI), is often unreliable given the relatively small sample sizes available for biological problems. Here, we develop and evaluate a hierarchy of approximations for high-dimensional information theoretic statistics from associated low-order terms, which can be more reliably estimated from limited samples. Due to a relationship between this metric and the minimum spanning tree over a graph representation of the system, we refer to these approximations as MIST (Maximum Information Spanning Trees).
RESULTS: The MIST approximations are examined in the context of synthetic networks with analytically computable entropies and using experimental gene expression data as a basis for the classification of multiple cancer types. The approximations result in significantly more accurate estimates of entropy and MI, and also correlate better with biological classification error than direct estimation and another low-order approximation, minimum-redundancy-maximum-relevance (mRMR). AVAILABILITY: Software to compute the entropy approximations described here is available as Supplementary Material. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2009        PMID: 19261718      PMCID: PMC2672626          DOI: 10.1093/bioinformatics/btp109

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

1.  A systems model of signaling identifies a molecular basis set for cytokine-induced apoptosis.

Authors:  Kevin A Janes; John G Albeck; Suzanne Gaudet; Peter K Sorger; Douglas A Lauffenburger; Michael B Yaffe
Journal:  Science       Date:  2005-12-09       Impact factor: 47.728

2.  Information-based clustering.

Authors:  Noam Slonim; Gurinder Singh Atwal; Gasper Tkacik; William Bialek
Journal:  Proc Natl Acad Sci U S A       Date:  2005-12-13       Impact factor: 11.205

3.  Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.

Authors:  Hanchuan Peng; Fuhui Long; Chris Ding
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-08       Impact factor: 6.226

4.  Genetic test bed for feature selection.

Authors:  Ashish Choudhary; Marcel Brun; Jianping Hua; James Lowey; Ed Suh; Edward R Dougherty
Journal:  Bioinformatics       Date:  2006-01-20       Impact factor: 6.937

5.  JEDA: Joint entropy diversity analysis. An information-theoretic method for choosing diverse and representative subsets from combinatorial libraries.

Authors:  Melissa R Landon; Scott E Schaus
Journal:  Mol Divers       Date:  2006-09-21       Impact factor: 2.943

6.  The art and practice of systems biology in medicine: mapping patterns of relationships.

Authors:  J van der Greef; S Martin; P Juhasz; A Adourian; T Plasterer; E R Verheij; R N McBurney
Journal:  J Proteome Res       Date:  2007-03-21       Impact factor: 4.466

7.  Information-theoretic inference of large transcriptional regulatory networks.

Authors:  Patrick E Meyer; Kevin Kontos; Frederic Lafitte; Gianluca Bontempi
Journal:  EURASIP J Bioinform Syst Biol       Date:  2007

8.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures.

Authors:  S Liang; S Fuhrman; R Somogyi
Journal:  Pac Symp Biocomput       Date:  1998

9.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Authors:  U Alon; N Barkai; D A Notterman; K Gish; S Ybarra; D Mack; A J Levine
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-08       Impact factor: 11.205

10.  Gene expression correlates of clinical prostate cancer behavior.

Authors:  Dinesh Singh; Phillip G Febbo; Kenneth Ross; Donald G Jackson; Judith Manola; Christine Ladd; Pablo Tamayo; Andrew A Renshaw; Anthony V D'Amico; Jerome P Richie; Eric S Lander; Massimo Loda; Philip W Kantoff; Todd R Golub; William R Sellers
Journal:  Cancer Cell       Date:  2002-03       Impact factor: 31.743

View more
  23 in total

1.  Entropy-enthalpy transduction caused by conformational shifts can obscure the forces driving protein-ligand binding.

Authors:  Andrew T Fenley; Hari S Muddana; Michael K Gilson
Journal:  Proc Natl Acad Sci U S A       Date:  2012-11-13       Impact factor: 11.205

Review 2.  Spatiotemporal positioning of multipotent modules in diverse biological networks.

Authors:  Yinying Chen; Zhong Wang; Yongyan Wang
Journal:  Cell Mol Life Sci       Date:  2014-01-11       Impact factor: 9.261

3.  Entropy Hotspots for the Binding of Intrinsically Disordered Ligands to a Receptor Domain.

Authors:  Jie Shi; Qingliang Shen; Jae-Hyun Cho; Wonmuk Hwang
Journal:  Biophys J       Date:  2020-04-08       Impact factor: 4.033

4.  Microscopic insights into the NMR relaxation-based protein conformational entropy meter.

Authors:  Vignesh Kasinath; Kim A Sharp; A Joshua Wand
Journal:  J Am Chem Soc       Date:  2013-09-25       Impact factor: 15.419

5.  Sloppy models, parameter uncertainty, and the role of experimental design.

Authors:  Joshua F Apgar; David K Witmer; Forest M White; Bruce Tidor
Journal:  Mol Biosyst       Date:  2010-06-17

6.  Efficient calculation of molecular configurational entropies using an information theoretic approximation.

Authors:  Bracken M King; Nathaniel W Silver; Bruce Tidor
Journal:  J Phys Chem B       Date:  2012-02-22       Impact factor: 2.991

7.  Comparing Conformational Ensembles Using the Kullback-Leibler Divergence Expansion.

Authors:  Christopher L McClendon; Lan Hua; Abriela Barreiro; Matthew P Jacobson
Journal:  J Chem Theory Comput       Date:  2012-04-13       Impact factor: 6.006

8.  Designing Well-Structured Cyclic Pentapeptides Based on Sequence-Structure Relationships.

Authors:  Diana P Slough; Sean M McHugh; Ashleigh E Cummings; Peng Dai; Bradley L Pentelute; Joshua A Kritzer; Yu-Shan Lin
Journal:  J Phys Chem B       Date:  2018-03-28       Impact factor: 2.991

9.  Synergistic drug-cytokine induction of hepatocellular death as an in vitro approach for the study of inflammation-associated idiosyncratic drug hepatotoxicity.

Authors:  Benjamin D Cosgrove; Bracken M King; Maya A Hasan; Leonidas G Alexopoulos; Paraskevi A Farazi; Bart S Hendriks; Linda G Griffith; Peter K Sorger; Bruce Tidor; Jinghai J Xu; Douglas A Lauffenburger
Journal:  Toxicol Appl Pharmacol       Date:  2009-04-09       Impact factor: 4.219

10.  β-Branched Amino Acids Stabilize Specific Conformations of Cyclic Hexapeptides.

Authors:  Ashleigh E Cummings; Jiayuan Miao; Diana P Slough; Sean M McHugh; Joshua A Kritzer; Yu-Shan Lin
Journal:  Biophys J       Date:  2019-01-03       Impact factor: 4.033

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.