Literature DB >> 34643213

Disease category-specific annotation of variants using an ensemble learning framework.

Zhen Cao1,2, Yanting Huang3, Ran Duan4, Peng Jin5, Zhaohui S Qin3,6, Shihua Zhang1,2,7,8.   

Abstract

Understanding the impact of non-coding sequence variants on complex diseases is an essential problem. We present a novel ensemble learning framework-CASAVA, to predict genomic loci in terms of disease category-specific risk. Using disease-associated variants identified by GWAS as training data, and diverse sequencing-based genomics and epigenomics profiles as features, CASAVA provides risk prediction of 24 major categories of diseases throughout the human genome. Our studies showed that CASAVA scores at a genomic locus provide a reasonable prediction of the disease-specific and disease category-specific risk prediction for non-coding variants located within the locus. Taking MHC2TA and immune system diseases as an example, we demonstrate the potential of CASAVA in revealing variant-disease associations. A website (http://zhanglabtools.org/CASAVA) has been built to facilitate easily access to CASAVA scores.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  complex disease; disease category; ensemble learning; functional annotation; non-coding variant

Mesh:

Year:  2022        PMID: 34643213     DOI: 10.1093/bib/bbab438

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  1 in total

1.  scEpiLock: A Weakly Supervised Learning Framework for cis-Regulatory Element Localization and Variant Impact Quantification for Single-Cell Epigenetic Data.

Authors:  Yanwen Gong; Shushrruth Sai Srinivasan; Ruiyi Zhang; Kai Kessenbrock; Jing Zhang
Journal:  Biomolecules       Date:  2022-06-23
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.