| Literature DB >> 35327392 |
Adrian Garcia-Moreno1, Raul López-Domínguez1,2, Juan Antonio Villatoro-García1,2, Alberto Ramirez-Mena1, Ernesto Aparicio-Puerta3,4, Michael Hackenberg3,4, Alberto Pascual-Montano5, Pedro Carmona-Saez1,2.
Abstract
Statistical methods for enrichment analysis are important tools to extract biological information from omics experiments. Although these methods have been widely used for the analysis of gene and protein lists, the development of high-throughput technologies for regulatory elements demands dedicated statistical and bioinformatics tools. Here, we present a set of enrichment analysis methods for regulatory elements, including CpG sites, miRNAs, and transcription factors. Statistical significance is determined via a power weighting function for target genes and tested by the Wallenius noncentral hypergeometric distribution model to avoid selection bias. These new methodologies have been applied to the analysis of a set of miRNAs associated with arrhythmia, showing the potential of this tool to extract biological information from a list of regulatory elements. These new methods are available in GeneCodis 4, a web tool able to perform singular and modular enrichment analysis that allows the integration of heterogeneous information.Entities:
Keywords: enrichment analysis; functional analysis; gene set analysis; regulation; web tool
Year: 2022 PMID: 35327392 PMCID: PMC8945021 DOI: 10.3390/biomedicines10030590
Source DB: PubMed Journal: Biomedicines ISSN: 2227-9059
Count of cardiac excitability associated terms in each type of analysis and annotation database in the top 20.
| Approaches | GO BP | PharmGKB | HPO | MNDR |
|---|---|---|---|---|
| Wallenius target-genes | 7 | 9 | 5 | - |
| Transformed DBs | 20 | 19 | 17 | - |
| miRNAs-based DBs | - | - | - | 15 |
| Hypergeometric target-genes | 4 | 6 | 8 | - |
Figure 1Network plot with genes hidden in the GO BP results of the use case with Wallenius strategy. Three clusters can be observed from top to button, the first related to cell cycle regulation, the second to gene regulation and the last one contains cardiac excitability biological processes.
Figure 2GeneCodis4 bars chart plot of the MNDR database in the direct annotation of miRNAs strategy. 15 out of 20 are heart-related disorders.
Figure 3Show the top 20 terms enriched with the transformed database strategy. Only five different miRNAs cause the enrichment of these terms, being four antiarrhythmics associated with all of them.