Literature DB >> 35910493

LARGE-SCALE MULTIPLE INFERENCE OF COLLECTIVE DEPENDENCE WITH APPLICATIONS TO PROTEIN FUNCTION.

Robert Jernigan1, Kejue Jia1, Zhao Ren2, Wen Zhou3.   

Abstract

Measuring the dependence of k ≥ 3 random variables and drawing inference from such higher-order dependences are scientifically important yet challenging. Motivated here by protein coevolution with multivariate categorical features, we consider an information theoretic measure of higher-order dependence. The proposed collective dependence is a symmetrization of differential interaction information which generalizes the mutual information of a pair of random variables. We show that the collective dependence can be easily estimated and facilitates a test on the dependence of k ≥ 3 random variables. Upon carefully exploring the null space of collective dependence, we devise a Classification-Assisted Large scaLe inference procedure to DEtect significant k-COllective DEpendence among d ≥ k random variables, with the false discovery rate controlled. Finite sample performance of our method is examined via simulations. We apply this method to the multiple protein sequence alignment data to study the residue or position coevolution for two protein families, the elongation factor P family and the zinc knuckle family. We identify novel functional triplets of amino acid residues, whose contributions to the protein function are further investigated. These confirm that the collective dependence does yield additional information important for understanding the protein coevolution compared to the pairwise measures.

Entities:  

Keywords:  Collective dependence; false discovery rate; information theoretic measure; multiple testing; protein coevolution; structural biology

Year:  2021        PMID: 35910493      PMCID: PMC9337751          DOI: 10.1214/20-aoas1431

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   1.959


  44 in total

1.  Direct-coupling analysis of residue coevolution captures native contacts across many protein families.

Authors:  Faruck Morcos; Andrea Pagnani; Bryan Lunt; Arianna Bertolino; Debora S Marks; Chris Sander; Riccardo Zecchina; José N Onuchic; Terence Hwa; Martin Weigt
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-21       Impact factor: 11.205

2.  Structural basis for the coevolution of a viral RNA-protein complex.

Authors:  Jeffrey A Chao; Yury Patskovsky; Steven C Almo; Robert H Singer
Journal:  Nat Struct Mol Biol       Date:  2007-12-09       Impact factor: 15.369

3.  Inferring the directionality of coupling with conditional mutual information.

Authors:  Martin Vejmelka; Milan Palus
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2008-02-21

4.  Describing the complexity of systems: multivariable "set complexity" and the information basis of systems biology.

Authors:  David J Galas; Nikita A Sakhanenko; Alexander Skupin; Tomasz Ignac
Journal:  J Comput Biol       Date:  2013-12-30       Impact factor: 1.479

5.  Biological Information as Set-Based Complexity.

Authors:  David J Galas; Matti Nykter; Gregory W Carter; Nathan D Price; Ilya Shmulevich
Journal:  IEEE Trans Inf Theory       Date:  2010-02-25       Impact factor: 2.501

6.  Structure of the HIV-1 nucleocapsid protein bound to the SL3 psi-RNA recognition element.

Authors:  R N De Guzman; Z R Wu; C C Stalling; L Pappalardo; P N Borer; M F Summers
Journal:  Science       Date:  1998-01-16       Impact factor: 47.728

7.  LARGE-SCALE MULTIPLE INFERENCE OF COLLECTIVE DEPENDENCE WITH APPLICATIONS TO PROTEIN FUNCTION.

Authors:  Robert Jernigan; Kejue Jia; Zhao Ren; Wen Zhou
Journal:  Ann Appl Stat       Date:  2021-07-12       Impact factor: 1.959

Review 8.  Linkage disequilibrium--understanding the evolutionary past and mapping the medical future.

Authors:  Montgomery Slatkin
Journal:  Nat Rev Genet       Date:  2008-06       Impact factor: 53.242

Review 9.  Practical aspects of protein co-evolution.

Authors:  David Ochoa; Florencio Pazos
Journal:  Front Cell Dev Biol       Date:  2014-04-22

10.  Hypergraphs and cellular networks.

Authors:  Steffen Klamt; Utz-Uwe Haus; Fabian Theis
Journal:  PLoS Comput Biol       Date:  2009-05-29       Impact factor: 4.475

View more
  1 in total

1.  LARGE-SCALE MULTIPLE INFERENCE OF COLLECTIVE DEPENDENCE WITH APPLICATIONS TO PROTEIN FUNCTION.

Authors:  Robert Jernigan; Kejue Jia; Zhao Ren; Wen Zhou
Journal:  Ann Appl Stat       Date:  2021-07-12       Impact factor: 1.959

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.