Literature DB >> 28605402

Sparse redundancy analysis of high-dimensional genetic and genomic data.

Attila Csala1, Frans P J M Voorbraak2, Aeilko H Zwinderman1, Michel H Hof1.   

Abstract

MOTIVATION: Recent technological developments have enabled the possibility of genetic and genomic integrated data analysis approaches, where multiple omics datasets from various biological levels are combined and used to describe (disease) phenotypic variations. The main goal is to explain and ultimately predict phenotypic variations by understanding their genetic basis and the interaction of the associated genetic factors. Therefore, understanding the underlying genetic mechanisms of phenotypic variations is an ever increasing research interest in biomedical sciences. In many situations, we have a set of variables that can be considered to be the outcome variables and a set that can be considered to be explanatory variables. Redundancy analysis (RDA) is an analytic method to deal with this type of directionality. Unfortunately, current implementations of RDA cannot deal optimally with the high dimensionality of omics data (p≫n). The existing theoretical framework, based on Ridge penalization, is suboptimal, since it includes all variables in the analysis. As a solution, we propose to use Elastic Net penalization in an iterative RDA framework to obtain a sparse solution.
RESULTS: We proposed sparse redundancy analysis (sRDA) for high dimensional omics data analysis. We conducted simulation studies with our software implementation of sRDA to assess the reliability of sRDA. Both the analysis of simulated data, and the analysis of 485 512 methylation markers and 18,424 gene-expression values measured in a set of 55 patients with Marfan syndrome show that sRDA is able to deal with the usual high dimensionality of omics data.
AVAILABILITY AND IMPLEMENTATION: http://uva.csala.me/rda. CONTACT: a.csala@amc.uva.nl. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Year:  2017        PMID: 28605402     DOI: 10.1093/bioinformatics/btx374

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  The immuno-behavioural covariation associated with the treatment response to bumetanide in young children with autism spectrum disorder.

Authors:  Qingyang Li; Lingli Zhang; Haidi Shan; Juehua Yu; Yuan Dai; Hua He; Wei-Guang Li; Christelle Langley; Barbara J Sahakian; Yin Yao; Qiang Luo; Fei Li
Journal:  Transl Psychiatry       Date:  2022-06-03       Impact factor: 7.989

2.  Multiset sparse redundancy analysis for high-dimensional omics data.

Authors:  Attila Csala; Michel H Hof; Aeilko H Zwinderman
Journal:  Biom J       Date:  2018-12-03       Impact factor: 2.207

3.  Multiset sparse partial least squares path modeling for high dimensional omics data analysis.

Authors:  Attila Csala; Aeilko H Zwinderman; Michel H Hof
Journal:  BMC Bioinformatics       Date:  2020-01-09       Impact factor: 3.169

4.  Where Do We Stand in Regularization for Life Science Studies?

Authors:  Veronica Tozzo; Chloé-Agathe Azencott; Samuele Fiorini; Emanuele Fava; Andrea Trucco; Annalisa Barla
Journal:  J Comput Biol       Date:  2021-04-29       Impact factor: 1.479

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.