Yinan Zheng1, Zhe Fei1, Wei Zhang1, Justin B Starren1, Lei Liu1, Andrea A Baccarelli1, Yi Li1, Lifang Hou2. 1. Institute for Public Health and Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, Institute of Human Genetics, University of Illinois at Chicago, Chicago, IL 60612, Division of Health and Biomedical Informatics, Departments of Preventive Medicine and Medical Social Sciences, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, Department of Environmental Health, Harvard School of Public Health, Boston, MA 02115 and The Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA. 2. Institute for Public Health and Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, Institute of Human Genetics, University of Illinois at Chicago, Chicago, IL 60612, Division of Health and Biomedical Informatics, Departments of Preventive Medicine and Medical Social Sciences, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, Department of Environmental Health, Harvard School of Public Health, Boston, MA 02115 and The Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA Institute for Public Health and Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, Institute of Human Genetics, University of Illinois at Chicago, Chicago, IL 60612, Division of Health and Biomedical Informatics, Departments of Preventive Medicine and Medical Social Sciences, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, Department of Environmental Health, Harvard School of Public Health, Boston, MA 02115 and The Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA.
Abstract
MOTIVATION: MicroRNAs (miRNAs) are short single-stranded non-coding molecules that usually function as negative regulators to silence or suppress gene expression. Owning to the dynamic nature of miRNA and reduced microarray and sequencing costs, a growing number of researchers are now measuring high-dimensional miRNA expression data using repeated or multiple measures in which each individual has more than one sample collected and measured over time. However, the commonly used univariate association testing or the site-by-site (SBS) testing may underutilize the longitudinal feature of the data, leading to underpowered results and less biologically meaningful results. RESULTS: We propose a penalized regression model incorporating grid search method (PGS), for analyzing associations of high-dimensional miRNA expression data with repeated measures. The development of this analytical framework was motivated by a real-world miRNA dataset. Comparisons between PGS and the SBS testing revealed that PGS provided smaller phenotype prediction errors and higher enrichment of phenotype-related biological pathways than the SBS testing. Our extensive simulations showed that PGS provided more accurate estimates and higher sensitivity than the SBS testing with comparable specificities. AVAILABILITY AND IMPLEMENTATION: R source code for PGS algorithm, implementation example and simulation study are available for download at https://github.com/feizhe/PGS.
MOTIVATION: MicroRNAs (miRNAs) are short single-stranded non-coding molecules that usually function as negative regulators to silence or suppress gene expression. Owning to the dynamic nature of miRNA and reduced microarray and sequencing costs, a growing number of researchers are now measuring high-dimensional miRNA expression data using repeated or multiple measures in which each individual has more than one sample collected and measured over time. However, the commonly used univariate association testing or the site-by-site (SBS) testing may underutilize the longitudinal feature of the data, leading to underpowered results and less biologically meaningful results. RESULTS: We propose a penalized regression model incorporating grid search method (PGS), for analyzing associations of high-dimensional miRNA expression data with repeated measures. The development of this analytical framework was motivated by a real-world miRNA dataset. Comparisons between PGS and the SBS testing revealed that PGS provided smaller phenotype prediction errors and higher enrichment of phenotype-related biological pathways than the SBS testing. Our extensive simulations showed that PGS provided more accurate estimates and higher sensitivity than the SBS testing with comparable specificities. AVAILABILITY AND IMPLEMENTATION: R source code for PGS algorithm, implementation example and simulation study are available for download at https://github.com/feizhe/PGS.
Authors: Daehyun Baek; Judit Villén; Chanseok Shin; Fernando D Camargo; Steven P Gygi; David P Bartel Journal: Nature Date: 2008-07-30 Impact factor: 49.962
Authors: Christopher A Hamm; Jeff W Stevens; Hehuang Xie; Elio F Vanin; Jose A Morcuende; Hakeem Abdulkawy; Elisabeth A Seftor; Simone T Sredni; Jared M Bischof; Deli Wang; Sergey Malchenko; Maria de Fatima Bonaldo; Thomas L Casavant; Mary J C Hendrix; Marcelo B Soares Journal: BMC Cancer Date: 2010-09-01 Impact factor: 4.430
Authors: Bino John; Anton J Enright; Alexei Aravin; Thomas Tuschl; Chris Sander; Debora S Marks Journal: PLoS Biol Date: 2004-10-05 Impact factor: 8.029
Authors: Grace Yoon; Yinan Zheng; Zhou Zhang; Haixiang Zhang; Tao Gao; Brian Joyce; Wei Zhang; Weihua Guan; Andrea A Baccarelli; Wenxin Jiang; Joel Schwartz; Pantel S Vokonas; Lifang Hou; Lei Liu Journal: BMC Bioinformatics Date: 2017-03-07 Impact factor: 3.169