| Literature DB >> 28326099 |
Lin Chen1, Gouri Mukerjee2, Ruslan Dorfman2, Seyed M Moghadas1.
Abstract
Much effort has been devoted to assess disease risk based on large-scale protein-protein network and genotype-phenotype associations. However, the challenge of risk prediction for complex diseases remains unaddressed. Here, we propose a framework to quantify the risk based on a Voronoi tessellation network analysis, taking into account the disease association scores of both genes and variants. By integrating ClinVar, SNPnexus, and DISEASES databases, we introduce a gene-variant map that is based on the pairwise disease-associated gene-variant scores. This map is clustered using Voronoi tessellation and network analysis with a threshold obtained from fitting the background Voronoi cell density distribution. We define the relative risk of disease that is inferred from the scores of the data points within the related clusters on the gene-variant map. We identify autoimmune-associated clusters that may interact at the system-level. The proposed framework can be used to determine the clusters that are specific to a subtype or contribute to multiple subtypes of complex diseases.Entities:
Keywords: Voronoi tessellation; cluster analysis; data analysis; disease risk assessment; gene-variant scores
Year: 2017 PMID: 28326099 PMCID: PMC5339255 DOI: 10.3389/fgene.2017.00029
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Gene-variant map of breast cancer, generated based on the disease association scores at both gene and variant levels. The x-axis represents the Z-score of genes with breast cancer association. The y-axis represents the ratio of 1-SIFT scores of variants associated with breast cancer to that of all variants for each gene.
Figure 2Identification of the threshold for clustering based on the Chi-square model fitting to the background distribution of the normalized Voronoi cell density of the autoimmune associated gene-variant map. The red curve represents the fit by the Chi-square distribution to the cumulative distribution of the background (80% of the normalized Voronoi cell density) of autoimmune associated data points (blue circle). The threshold (dashed line) was determined at 3.96, and at the significance level of 90%. The subplot shows the identified threshold (dashed line) on the cumulative distribution of the normalized Voronoi cell density of the entire data points associated with autoimmune.
Figure 3Autoimmune associated clusters identified by the Voronoi tessellation network analysis. (A) Voronoi tessellation of data points, in which four detected clusters are highlighted in colored cells. (B) Genes of data points belonging to the same clusters identified by Voronoi tessellation in (A).