Literature DB >> 29088314

An exact test for comparing a fixed quantitative property between gene sets.

Matthew M Parks1.   

Abstract

Motivation: A significant difference in the distribution of a feature between two gene sets can provide insight into function or regulation. This statistical setting differs from much of hypothesis testing theory because the genome is often considered to be effectively fixed, finite and entirely known in commonly studied organisms, such as human. The Mann-Whitney U test is commonly employed in this scenario despite the assumptions of the test not being met, leading to unreliable and generally underpowered results. Permutation tests are also commonly employed for this purpose, but are computationally burdensome and are not tractable for obtaining small P values or for multiple comparisons.
Results: We present an exact test for the null hypothesis that gene set membership is independent of the quantitative gene feature of interest. We derive an analytic expression for the randomization distribution of the median of the quantitative feature under the null hypothesis. Efficient implementation permits calculation of precise P values of arbitrary magnitude and makes thousands of simultaneous tests of transcriptome-sized gene sets computationally tractable. The flexibility of the hypothesis testing framework presented permits extension to a variety of related tests commonly found in genomics. The exact test is used to identify signatures of translation control and protein function in the human genome. Availability and implementation: The exact test presented here is implemented in R in the package kpmt available on CRAN. Contact: map2085@med.cornell.edu. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2018        PMID: 29088314     DOI: 10.1093/bioinformatics/btx693

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  Genome Instability-Associated Long Non-Coding RNAs Reveal Biomarkers for Glioma Immunotherapy and Prognosis.

Authors:  Xinzhuang Wang; Hong Zhang; Junyi Ye; Ming Gao; Qiuyi Jiang; Tingting Zhao; Shengtao Wang; Wenbin Mao; Kaili Wang; Qi Wang; Xin Chen; Xu Hou; Dayong Han
Journal:  Front Genet       Date:  2022-04-27       Impact factor: 4.772

2.  ebGSEA: an improved Gene Set Enrichment Analysis method for Epigenome-Wide-Association Studies.

Authors:  Danyue Dong; Yuan Tian; Shijie C Zheng; Andrew E Teschendorff
Journal:  Bioinformatics       Date:  2019-09-15       Impact factor: 6.937

Review 3.  Epigenome-wide association studies: current knowledge, strategies and recommendations.

Authors:  Maria Pia Campagna; Alexandre Xavier; Jeannette Lechner-Scott; Vicky Maltby; Rodney J Scott; Helmut Butzkueven; Vilija G Jokubaitis; Rodney A Lea
Journal:  Clin Epigenetics       Date:  2021-12-04       Impact factor: 6.551

4.  Endogenous rRNA Sequence Variation Can Regulate Stress Response Gene Expression and Phenotype.

Authors:  Chad M Kurylo; Matthew M Parks; Manuel F Juette; Boris Zinshteyn; Roger B Altman; Jordana K Thibado; C Theresa Vincent; Scott C Blanchard
Journal:  Cell Rep       Date:  2018-10-02       Impact factor: 9.423

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.