Cheng Peng1, Jun Wang1, Isaac Asante2, Stan Louie2, Ran Jin1, Lida Chatzi1, Graham Casey3, Duncan C Thomas1, David V Conti1. 1. Department of Preventive Medicine, Keck School of Medicine, Los Angeles, CA 90089, USA. 2. Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, Los Angeles, CA 90089, USA. 3. Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908, USA.
Abstract
MOTIVATION: Epidemiologic, clinical and translational studies are increasingly generating multiplatform omics data. Methods that can integrate across multiple high-dimensional data types while accounting for differential patterns are critical for uncovering novel associations and underlying relevant subgroups. RESULTS: We propose an integrative model to estimate latent unknown clusters (LUCID) aiming to both distinguish unique genomic, exposure and informative biomarkers/omic effects while jointly estimating subgroups relevant to the outcome of interest. Simulation studies indicate that we can obtain consistent estimates reflective of the true simulated values, accurately estimate subgroups and recapitulate subgroup-specific effects. We also demonstrate the use of the integrated model for future prediction of risk subgroups and phenotypes. We apply this approach to two real data applications to highlight the integration of genomic, exposure and metabolomic data. AVAILABILITY AND IMPLEMENTATION: The LUCID method is implemented through the LUCIDus R package available on CRAN (https://CRAN.R-project.org/package=LUCIDus). SUPPLEMENTARY INFORMATION: Supplementary materials are available at Bioinformatics online.
MOTIVATION: Epidemiologic, clinical and translational studies are increasingly generating multiplatform omics data. Methods that can integrate across multiple high-dimensional data types while accounting for differential patterns are critical for uncovering novel associations and underlying relevant subgroups. RESULTS: We propose an integrative model to estimate latent unknown clusters (LUCID) aiming to both distinguish unique genomic, exposure and informative biomarkers/omic effects while jointly estimating subgroups relevant to the outcome of interest. Simulation studies indicate that we can obtain consistent estimates reflective of the true simulated values, accurately estimate subgroups and recapitulate subgroup-specific effects. We also demonstrate the use of the integrated model for future prediction of risk subgroups and phenotypes. We apply this approach to two real data applications to highlight the integration of genomic, exposure and metabolomic data. AVAILABILITY AND IMPLEMENTATION: The LUCID method is implemented through the LUCIDus R package available on CRAN (https://CRAN.R-project.org/package=LUCIDus). SUPPLEMENTARY INFORMATION: Supplementary materials are available at Bioinformatics online.
Authors: Yen-Tsung Huang; Liming Liang; Miriam F Moffatt; William O C M Cookson; Xihong Lin Journal: Genet Epidemiol Date: 2015-05-22 Impact factor: 2.135
Authors: Michael C Reed; H Frederik Nijhout; Marian L Neuhouser; Jesse F Gregory; Barry Shane; S Jill James; Alanna Boynton; Cornelia M Ulrich Journal: J Nutr Date: 2006-10 Impact factor: 4.798
Authors: Barbara Janssens; Bhagyalaxmi Mohapatra; Matteo Vatta; Steven Goossens; Griet Vanpoucke; Patrick Kools; Tony Montoye; Jolanda van Hengel; Neil E Bowles; Frans van Roy; Jeffrey A Towbin Journal: Hum Genet Date: 2002-12-05 Impact factor: 4.132
Authors: Fredrick R Schumacher; Stephanie L Schmit; Shuo Jiao; Christopher K Edlund; Hansong Wang; Ben Zhang; Li Hsu; Shu-Chen Huang; Christopher P Fischer; John F Harju; Gregory E Idos; Flavio Lejbkowicz; Frank J Manion; Kevin McDonnell; Caroline E McNeil; Marilena Melas; Hedy S Rennert; Wei Shi; Duncan C Thomas; David J Van Den Berg; Carolyn M Hutter; Aaron K Aragaki; Katja Butterbach; Bette J Caan; Christopher S Carlson; Stephen J Chanock; Keith R Curtis; Charles S Fuchs; Manish Gala; Edward L Giovannucci; Stephanie M Gogarten; Richard B Hayes; Brian Henderson; David J Hunter; Rebecca D Jackson; Laurence N Kolonel; Charles Kooperberg; Sébastien Küry; Andrea LaCroix; Cathy C Laurie; Cecelia A Laurie; Mathieu Lemire; David Levine; Jing Ma; Karen W Makar; Conghui Qu; Darin Taverna; Cornelia M Ulrich; Kana Wu; Suminori Kono; Dee W West; Sonja I Berndt; Stéphane Bezieau; Hermann Brenner; Peter T Campbell; Andrew T Chan; Jenny Chang-Claude; Gerhard A Coetzee; David V Conti; David Duggan; Jane C Figueiredo; Barbara K Fortini; Steven J Gallinger; W James Gauderman; Graham Giles; Roger Green; Robert Haile; Tabitha A Harrison; Michael Hoffmeister; John L Hopper; Thomas J Hudson; Eric Jacobs; Motoki Iwasaki; Sun Ha Jee; Mark Jenkins; Wei-Hua Jia; Amit Joshi; Li Li; Noralene M Lindor; Keitaro Matsuo; Victor Moreno; Bhramar Mukherjee; Polly A Newcomb; John D Potter; Leon Raskin; Gad Rennert; Stephanie Rosse; Gianluca Severi; Robert E Schoen; Daniela Seminara; Xiao-Ou Shu; Martha L Slattery; Shoichiro Tsugane; Emily White; Yong-Bing Xiang; Brent W Zanke; Wei Zheng; Loic Le Marchand; Graham Casey; Stephen B Gruber; Ulrike Peters Journal: Nat Commun Date: 2015-07-07 Impact factor: 14.919
Authors: Rigoberto Pallares-Méndez; Carlos A Aguilar-Salinas; Ivette Cruz-Bautista; Laura Del Bosque-Plata Journal: Ann Med Date: 2016-01-30 Impact factor: 4.709
Authors: Ran Jin; Rob McConnell; Cioffi Catherine; Shujing Xu; Douglas I Walker; Nikos Stratakis; Dean P Jones; Gary W Miller; Cheng Peng; David V Conti; Miriam B Vos; Leda Chatzi Journal: Environ Int Date: 2019-11-16 Impact factor: 13.352
Authors: Nikos Stratakis; Lucy Golden-Mason; Katerina Margetaki; Yinqi Zhao; Damaskini Valvi; Erika Garcia; Léa Maitre; Sandra Andrusaityte; Xavier Basagana; Eva Borràs; Mariona Bustamante; Maribel Casas; Serena Fossati; Regina Grazuleviciene; Line Småstuen Haug; Barbara Heude; Rosemary R C McEachan; Helle Margrete Meltzer; Eleni Papadopoulou; Theano Roumeliotaki; Oliver Robinson; Eduard Sabidó; Jose Urquiza; Marina Vafeiadi; Nerea Varo; John Wright; Miriam B Vos; Howard Hu; Martine Vrijheid; Kiros T Berhane; David V Conti; Rob McConnell; Hugo R Rosen; Lida Chatzi Journal: Hepatology Date: 2021-08-30 Impact factor: 17.298
Authors: Anat Zimmer; Yael Korem; Noa Rappaport; Tomasz Wilmanski; Priyanka Baloni; Kathleen Jade; Max Robinson; Andrew T Magis; Jennifer Lovejoy; Sean M Gibbons; Leroy Hood; Nathan D Price Journal: Nat Commun Date: 2021-06-11 Impact factor: 14.919
Authors: Claudia Kasper; David Ribeiro; André M de Almeida; Catherine Larzul; Laurence Liaubet; Eduard Murani Journal: Genes (Basel) Date: 2020-08-11 Impact factor: 4.096
Authors: Nikos Stratakis; David V Conti; Eva Borras; Eduardo Sabido; Theano Roumeliotaki; Eleni Papadopoulou; Lydiane Agier; Xavier Basagana; Mariona Bustamante; Maribel Casas; Shohreh F Farzan; Serena Fossati; Juan R Gonzalez; Regina Grazuleviciene; Barbara Heude; Lea Maitre; Rosemary R C McEachan; Ioannis Theologidis; Jose Urquiza; Marina Vafeiadi; Jane West; John Wright; Rob McConnell; Anne-Lise Brantsaeter; Helle-Margrete Meltzer; Martine Vrijheid; Leda Chatzi Journal: JAMA Netw Open Date: 2020-03-02
Authors: Nikos Stratakis; David V Conti; Ran Jin; Katerina Margetaki; Damaskini Valvi; Alexandros P Siskos; Léa Maitre; Erika Garcia; Nerea Varo; Yinqi Zhao; Theano Roumeliotaki; Marina Vafeiadi; Jose Urquiza; Silvia Fernández-Barrés; Barbara Heude; Xavier Basagana; Maribel Casas; Serena Fossati; Regina Gražulevičienė; Sandra Andrušaitytė; Karan Uppal; Rosemary R C McEachan; Eleni Papadopoulou; Oliver Robinson; Line Småstuen Haug; John Wright; Miriam B Vos; Hector C Keun; Martine Vrijheid; Kiros T Berhane; Rob McConnell; Lida Chatzi Journal: Hepatology Date: 2020-10-19 Impact factor: 17.425