| Literature DB >> 25914887 |
Michael A Newton1, Zhishi Wang2.
Abstract
An important data analysis task in statistical genomics involves the integration of genome-wide gene-level measurements with preexisting data on the same genes. A wide variety of statistical methodologies and computational tools have been developed for this general task. We emphasize one particular distinction among methodologies, namely whether they process gene sets one at a time (uniset) or simultaneously via some multiset technique. Owing to the complexity of collections of gene sets, the multiset approach offers some advantages, as it naturally accommodates set-size variations and among-set overlaps. However, this approach presents both computational and inferential challenges. After reviewing some statistical issues that arise in uniset analysis, we examine two model-based multiset methods for gene list data.Entities:
Keywords: gene set enrichment; role model; statistical genomics
Year: 2015 PMID: 25914887 PMCID: PMC4405258 DOI: 10.1146/annurev-statistics-010814-020335
Source DB: PubMed Journal: Annu Rev Stat Appl ISSN: 2326-8298 Impact factor: 5.810