Literature DB >> 34935928

Gene set analysis with graph embedded kernel association test.

Jialin Qu1, Yuehua Cui1.   

Abstract

MOTIVATION: Kernel-based association test (KAT) has been a popular approach to evaluate the association of expressions of a gene set (e.g., pathway) with a phenotypic trait. KATs rely on kernel functions which capture the sample similarity across multiple features, to capture potential linear or nonlinear relationship among features in a gene set. When calculating the kernel functions, no network graphical information about the features is considered. While genes in a functional group (e.g., a pathway) are not independent in general due to regulatory interactions, incorporating regulatory network (or graph) information can potentially increase the power of KAT. In this work, we propose a graph-embedded kernel association test, termed gKAT. gKAT incorporates prior pathway knowledge when constructing a kernel function into hypothesis testing.
RESULTS: We apply a diffusion kernel to capture any graph structures in a gene set, then incorporate such information to build a kernel function for further association test. We illustrate the geometric meaning of the approach. Through extensive simulation studies, we show that the proposed gKAT algorithm can improve testing power compared to the one without considering graph structures. Application to a real data set further demonstrate the utility of the method. AVAILABILITY: The R code used for the analysis can be accessed at https://github.com/JialinQu/gKAT. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 34935928      PMCID: PMC8896609          DOI: 10.1093/bioinformatics/btab851

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  22 in total

1.  A powerful and flexible multilocus association test for quantitative traits.

Authors:  Lydia Coulter Kwee; Dawei Liu; Xihong Lin; Debashis Ghosh; Michael P Epstein
Journal:  Am J Hum Genet       Date:  2008-02       Impact factor: 11.025

2.  Integrative analysis of genetical genomics data incorporating network structures.

Authors:  Bin Gao; Xu Liu; Hongzhe Li; Yuehua Cui
Journal:  Biometrics       Date:  2019-04-29       Impact factor: 2.571

3.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

4.  Expansion of the BioCyc collection of pathway/genome databases to 160 genomes.

Authors:  Peter D Karp; Christos A Ouzounis; Caroline Moore-Kochlacs; Leon Goldovsky; Pallavi Kaipa; Dag Ahrén; Sophia Tsoka; Nikos Darzentas; Victor Kunin; Núria López-Bigas
Journal:  Nucleic Acids Res       Date:  2005-10-24       Impact factor: 16.971

5.  Reactome: a knowledgebase of biological pathways.

Authors:  G Joshi-Tope; M Gillespie; I Vastrik; P D'Eustachio; E Schmidt; B de Bono; B Jassal; G R Gopinath; G R Wu; L Matthews; S Lewis; E Birney; L Stein
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

6.  STRING v10: protein-protein interaction networks, integrated over the tree of life.

Authors:  Damian Szklarczyk; Andrea Franceschini; Stefan Wyder; Kristoffer Forslund; Davide Heller; Jaime Huerta-Cepas; Milan Simonovic; Alexander Roth; Alberto Santos; Kalliopi P Tsafou; Michael Kuhn; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2014-10-28       Impact factor: 16.971

7.  Gene set analysis methods: a systematic comparison.

Authors:  Ravi Mathur; Daniel Rotroff; Jun Ma; Ali Shojaie; Alison Motsinger-Reif
Journal:  BioData Min       Date:  2018-05-31       Impact factor: 2.522

8.  PIMKL: Pathway-Induced Multiple Kernel Learning.

Authors:  Matteo Manica; Joris Cadow; Roland Mathis; María Rodríguez Martínez
Journal:  NPJ Syst Biol Appl       Date:  2019-03-05

9.  Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models.

Authors:  Dawei Liu; Debashis Ghosh; Xihong Lin
Journal:  BMC Bioinformatics       Date:  2008-06-24       Impact factor: 3.169

10.  Adaptive diffusion kernel learning from biological networks for protein function prediction.

Authors:  Liang Sun; Shuiwang Ji; Jieping Ye
Journal:  BMC Bioinformatics       Date:  2008-03-25       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.