| Literature DB >> 33906263 |
Sanguo Zhang1, Xiaonan Hu1, Ziye Luo2, Yu Jiang3, Yifan Sun2, Shuangge Ma2,4.
Abstract
Heterogeneity is a hallmark of many complex diseases. There are multiple ways of defining heterogeneity, among which the heterogeneity in genetic regulations, for example, gene expressions (GEs) by copy number variations (CNVs), and methylation, has been suggested but little investigated. Heterogeneity in genetic regulations can be linked with disease severity, progression, and other traits and is biologically important. However, the analysis can be very challenging with the high dimensionality of both sides of regulation as well as sparse and weak signals. In this article, we consider the scenario where subjects form unknown subgroups, and each subgroup has unique genetic regulation relationships. Further, such heterogeneity is "guided" by a known biomarker. We develop a multivariate sparse fusion (MSF) approach, which innovatively applies the penalized fusion technique to simultaneously determine the number and structure of subgroups and regulation relationships within each subgroup. An effective computational algorithm is developed, and extensive simulations are conducted. The analysis of heterogeneity in the GE-CNV regulations in melanoma and GE-methylation regulations in stomach cancer using the TCGA data leads to interesting findings.Entities:
Keywords: biomarker; genetic regulations; heterogeneity analysis; multivariate sparse fusion
Mesh:
Year: 2021 PMID: 33906263 PMCID: PMC8277716 DOI: 10.1002/sim.9006
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.497