Literature DB >> 35707487

Interpoint distance tests for high-dimensional comparison studies.

Marco Marozzi1, Amitava Mukherjee2, Jan Kalina3.   

Abstract

Modern data collection techniques allow to analyze a very large number of endpoints. In biomedical research, for example, expressions of thousands of genes are commonly measured only on a small number of subjects. In these situations, traditional methods for comparison studies are not applicable. Moreover, the assumption of normal distribution is often questionable for high-dimensional data, and some variables may be at the same time highly correlated with others. Hypothesis tests based on interpoint distances are very appealing for studies involving the comparison of means, because they do not assume data to come from normally distributed populations and comprise tests that are distribution free, unbiased, consistent, and computationally feasible, even if the number of endpoints is much larger than the number of subjects. New tests based on interpoint distances are proposed for multivariate studies involving simultaneous comparison of means and variability, or the whole distribution shapes. The tests are shown to perform well in terms of power, when the endpoints have complex dependence relations, such as in genomic and metabolomic studies. A practical application to a genetic cardiovascular case-control study is discussed.
© 2019 Informa UK Limited, trading as Taylor & Francis Group.

Entities:  

Keywords:  Multivariate data; biomedicine; genomics; nonparametric combination; nonparametric tests

Year:  2019        PMID: 35707487      PMCID: PMC9042018          DOI: 10.1080/02664763.2019.1649374

Source DB:  PubMed          Journal:  J Appl Stat        ISSN: 0266-4763            Impact factor:   1.416


  5 in total

1.  Multivariate tests based on interpoint distances with application to magnetic resonance imaging.

Authors:  Marco Marozzi
Journal:  Stat Methods Med Res       Date:  2014-04-16       Impact factor: 3.021

2.  Distance-based analysis of variance for brain connectivity.

Authors:  Russell T Shinohara; Haochang Shou; Marco Carone; Robert Schultz; Birkan Tunc; Drew Parker; Melissa Lynne Martin; Ragini Verma
Journal:  Biometrics       Date:  2019-09-30       Impact factor: 2.571

3.  A Robust Supervised Variable Selection for Noisy High-Dimensional Data.

Authors:  Jan Kalina; Anna Schlenker
Journal:  Biomed Res Int       Date:  2015-06-02       Impact factor: 3.411

4.  A hybrid approach of gene sets and single genes for the prediction of survival risks with gene expression data.

Authors:  Junhee Seok; Ronald W Davis; Wenzhong Xiao
Journal:  PLoS One       Date:  2015-05-01       Impact factor: 3.240

5.  Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data.

Authors:  Franck Rapaport; Raya Khanin; Yupu Liang; Mono Pirun; Azra Krek; Paul Zumbo; Christopher E Mason; Nicholas D Socci; Doron Betel
Journal:  Genome Biol       Date:  2013       Impact factor: 13.583

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.