Literature DB >> 28392970

Generalized Hierarchical Sparse Model for Arbitrary-Order Interactive Antigenic Sites Identification in Flu Virus Data.

Lei Han1, Yu Zhang2, Xiu-Feng Wan3, Tong Zhang4.   

Abstract

Recent statistical evidence has shown that a regression model by incorporating the interactions among the original covariates/features can significantly improve the interpretability for biological data. One major challenge is the exponentially expanded feature space when adding high-order feature interactions to the model. To tackle the huge dimensionality, hierarchical sparse models (HSM) are developed by enforcing sparsity under heredity structures in the interactions among the covariates. However, existing methods only consider pairwise interactions, making the discovery of important high-order interactions a non-trivial open problem. In this paper, we propose a generalized hierarchical sparse model (GHSM) as a generalization of the HSM models to tackle arbitrary-order interactions. The GHSM applies the ℓ1 penalty to all the model coefficients under a constraint that given any covariate, if none of its associated kth-order interactions contribute to the regression model, then neither do its associated higher-order interactions. The resulting objective function is non-convex with a challenge lying in the coupled variables appearing in the arbitrary-order hierarchical constraints and we devise an efficient optimization algorithm to directly solve it. Specifically, we decouple the variables in the constraints via both the general iterative shrinkage and thresholding (GIST) and the alternating direction method of multipliers (ADMM) methods into three subproblems, each of which is proved to admit an efficiently analytical solution. We evaluate the GHSM method in both synthetic problem and the antigenic sites identification problem for the influenza virus data, where we expand the feature space up to the 5th-order interactions. Empirical results demonstrate the effectiveness and efficiency of the proposed methods and the learned high-order interactions have meaningful synergistic covariate patterns in the influenza virus antigenicity.

Entities:  

Keywords:  Analytical Solution; Antigenic Sites Identification; Heredity Structure; Hierarchical Sparsity; High-Order Interaction

Year:  2016        PMID: 28392970      PMCID: PMC5382970          DOI: 10.1145/2939672.2939786

Source DB:  PubMed          Journal:  KDD        ISSN: 2154-817X


  13 in total

1.  Coevolving protein residues: maximum likelihood identification and relationship to structure.

Authors:  D D Pollock; W R Taylor; N Goldman
Journal:  J Mol Biol       Date:  1999-03-19       Impact factor: 5.469

2.  Probing three-way interactions in moderated multiple regression: development and application of a slope difference test.

Authors:  Jeremy F Dawson; Andreas W Richter
Journal:  J Appl Psychol       Date:  2006-07

3.  Simultaneous amino acid substitutions at antigenic sites drive influenza A hemagglutinin evolution.

Authors:  Arthur Chun-Chieh Shih; Tzu-Chang Hsiao; Mei-Shang Ho; Wen-Hsiung Li
Journal:  Proc Natl Acad Sci U S A       Date:  2007-03-29       Impact factor: 11.205

4.  Convex Modeling of Interactions with Strong Heredity.

Authors:  Asad Haris; Daniela Witten; Noah Simon
Journal:  J Comput Graph Stat       Date:  2015-08-12       Impact factor: 2.302

5.  Gene-environment interaction analysis of serotonin system markers with adolescent depression.

Authors:  T C Eley; K Sugden; A Corsico; A M Gregory; P Sham; P McGuffin; R Plomin; I W Craig
Journal:  Mol Psychiatry       Date:  2004-10       Impact factor: 15.992

6.  Mapping the antigenic and genetic evolution of influenza virus.

Authors:  Derek J Smith; Alan S Lapedes; Jan C de Jong; Theo M Bestebroer; Guus F Rimmelzwaan; Albert D M E Osterhaus; Ron A M Fouchier
Journal:  Science       Date:  2004-06-24       Impact factor: 47.728

7.  Genetic-environmental interaction in the genesis of aggressivity and conduct disorders.

Authors:  R J Cadoret; W R Yates; E Troughton; G Woodworth; M A Stewart
Journal:  Arch Gen Psychiatry       Date:  1995-11

8.  Using sequence data to infer the antigenicity of influenza virus.

Authors:  Hailiang Sun; Jialiang Yang; Tong Zhang; Li-Ping Long; Kun Jia; Guohua Yang; Richard J Webby; Xiu-Feng Wan
Journal:  MBio       Date:  2013-07-02       Impact factor: 7.867

9.  Sequence-based antigenic change prediction by a sparse learning method incorporating co-evolutionary information.

Authors:  Jialiang Yang; Tong Zhang; Xiu-Feng Wan
Journal:  PLoS One       Date:  2014-09-04       Impact factor: 3.240

10.  Co-evolution positions and rules for antigenic variants of human influenza A/H3N2 viruses.

Authors:  Jhang-Wei Huang; Chwan-Chuen King; Jinn-Moon Yang
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

View more
  5 in total

1.  Graph-guided multi-task sparse learning model: a method for identifying antigenic variants of influenza A(H3N2) virus.

Authors:  Lei Han; Lei Li; Feng Wen; Lei Zhong; Tong Zhang; Xiu-Feng Wan
Journal:  Bioinformatics       Date:  2019-01-01       Impact factor: 6.937

2.  Variations outside the conserved motifs of PB1 catalytic active site may affect replication efficiency of the RNP complex of influenza A virus.

Authors:  Kaitlyn Waters; Hamilton J Wan; Lei Han; Jianli Xue; Matthew Ykema; Yizhi J Tao; Xiu-Feng Wan
Journal:  Virology       Date:  2021-04-09       Impact factor: 3.513

Review 3.  Antigenic characterization of influenza and SARS-CoV-2 viruses.

Authors:  Yang Wang; Cynthia Y Tang; Xiu-Feng Wan
Journal:  Anal Bioanal Chem       Date:  2021-12-14       Impact factor: 4.142

4.  Triple reassortment increases compatibility among viral ribonucleoprotein genes of contemporary avian and human influenza A viruses.

Authors:  Kaitlyn Waters; Cheng Gao; Matthew Ykema; Lei Han; Lynden Voth; Yizhi Jane Tao; Xiu-Feng Wan
Journal:  PLoS Pathog       Date:  2021-10-07       Impact factor: 6.823

5.  Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin.

Authors:  Lei Li; Deborah Chang; Lei Han; Xiaojian Zhang; Joseph Zaia; Xiu-Feng Wan
Journal:  BMC Bioinformatics       Date:  2020-05-11       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.