Literature DB >> 33063116

Robust Huber-LASSO for improved prediction of protein, metabolite and gene expression levels relying on individual genotype data.

Heike Deutelmoser1, Dominique Scherer1, Hermann Brenner2, Melanie Waldenberger3, Karsten Suhre4, Gabi Kastenmüller5, Justo Lorenzo Bermejo6.   

Abstract

Least absolute shrinkage and selection operator (LASSO) regression is often applied to select the most promising set of single nucleotide polymorphisms (SNPs) associated with a molecular phenotype of interest. While the penalization parameter λ restricts the number of selected SNPs and the potential model overfitting, the least-squares loss function of standard LASSO regression translates into a strong dependence of statistical results on a small number of individuals with phenotypes or genotypes divergent from the majority of the study population-typically comprised of outliers and high-leverage observations. Robust methods have been developed to constrain the influence of divergent observations and generate statistical results that apply to the bulk of study data, but they have rarely been applied to genetic association studies. In this article, we review, for newcomers to the field of robust statistics, a novel version of standard LASSO that utilizes the Huber loss function. We conduct comprehensive simulations and analyze real protein, metabolite, mRNA expression and genotype data to compare the stability of penalization, the cross-iteration concordance of the model, the false-positive and true-positive rates and the prediction accuracy of standard and robust Huber-LASSO. Although the two methods showed controlled false-positive rates ≤2.1% and similar true-positive rates, robust Huber-LASSO outperformed standard LASSO in the accuracy of predicted protein, metabolite and gene expression levels using individual SNP data. The conducted simulations and real-data analyses show that robust Huber-LASSO represents a valuable alternative to standard LASSO in genetic studies of molecular phenotypes.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Keywords:  Huber loss function; LASSO; genetic prediction; molecular data; robust statistics

Year:  2021        PMID: 33063116      PMCID: PMC8293825          DOI: 10.1093/bib/bbaa230

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  12 in total

1.  Genome-wide association study of Hirschsprung disease detects a novel low-frequency variant at the RET locus.

Authors:  João Fadista; Marie Lund; Line Skotte; Frank Geller; Priyanka Nandakumar; Sumantra Chatterjee; Hans Matsson; Anna Löf Granström; Tomas Wester; Perttu Salo; Valtter Virtanen; Lisbeth Carstensen; Jonas Bybjerg-Grauholm; David Michael Hougaard; Mikko Pakarinen; Markus Perola; Agneta Nordenskjöld; Aravinda Chakravarti; Mads Melbye; Bjarke Feenstra
Journal:  Eur J Hum Genet       Date:  2018-01-29       Impact factor: 4.246

2.  Estimation of effect size distribution from genome-wide association studies and implications for future discoveries.

Authors:  Ju-Hyun Park; Sholom Wacholder; Mitchell H Gail; Ulrike Peters; Kevin B Jacobs; Stephen J Chanock; Nilanjan Chatterjee
Journal:  Nat Genet       Date:  2010-06-20       Impact factor: 38.330

3.  Efficiency and safety of varying the frequency of whole blood donation (INTERVAL): a randomised trial of 45 000 donors.

Authors:  Emanuele Di Angelantonio; Simon G Thompson; Stephen Kaptoge; Carmel Moore; Matthew Walker; Jane Armitage; Willem H Ouwehand; David J Roberts; John Danesh
Journal:  Lancet       Date:  2017-09-21       Impact factor: 79.321

4.  Connecting genetic risk to disease end points through the human blood plasma proteome.

Authors:  Karsten Suhre; Matthias Arnold; Aditya Mukund Bhagwat; Richard J Cotton; Rudolf Engelke; Johannes Raffler; Hina Sarwath; Gaurav Thareja; Annika Wahl; Robert Kirk DeLisle; Larry Gold; Marija Pezer; Gordan Lauc; Mohammed A El-Din Selim; Dennis O Mook-Kanamori; Eman K Al-Dous; Yasmin A Mohamoud; Joel Malek; Konstantin Strauch; Harald Grallert; Annette Peters; Gabi Kastenmüller; Christian Gieger; Johannes Graumann
Journal:  Nat Commun       Date:  2017-02-27       Impact factor: 14.919

5.  Identification of functionally connected multi-omic biomarkers for Alzheimer's disease using modularity-constrained Lasso.

Authors:  Linhui Xie; Pradeep Varathan; Kwangsik Nho; Andrew J Saykin; Paul Salama; Jingwen Yan
Journal:  PLoS One       Date:  2020-06-17       Impact factor: 3.240

6.  ASFMR1 splice variant: A predictor of fragile X-associated tremor/ataxia syndrome.

Authors:  Padmaja Vittal; Shrikant Pandya; Kevin Sharp; Elizabeth Berry-Kravis; Lili Zhou; Bichun Ouyang; Jonathan Jackson; Deborah A Hall
Journal:  Neurol Genet       Date:  2018-07-27

7.  Genetic variant predictors of gene expression provide new insight into risk of colorectal cancer.

Authors:  Stephanie A Bien; Yu-Ru Su; David V Conti; Tabitha A Harrison; Conghui Qu; Xingyi Guo; Yingchang Lu; Demetrius Albanes; Paul L Auer; Barbara L Banbury; Sonja I Berndt; Stéphane Bézieau; Hermann Brenner; Daniel D Buchanan; Bette J Caan; Peter T Campbell; Christopher S Carlson; Andrew T Chan; Jenny Chang-Claude; Sai Chen; Charles M Connolly; Douglas F Easton; Edith J M Feskens; Steven Gallinger; Graham G Giles; Marc J Gunter; Jochen Hampe; Jeroen R Huyghe; Michael Hoffmeister; Thomas J Hudson; Eric J Jacobs; Mark A Jenkins; Ellen Kampman; Hyun Min Kang; Tilman Kühn; Sébastien Küry; Flavio Lejbkowicz; Loic Le Marchand; Roger L Milne; Li Li; Christopher I Li; Annika Lindblom; Noralane M Lindor; Vicente Martín; Caroline E McNeil; Marilena Melas; Victor Moreno; Polly A Newcomb; Kenneth Offit; Paul D P Pharaoh; John D Potter; Chenxu Qu; Elio Riboli; Gad Rennert; Núria Sala; Clemens Schafmayer; Peter C Scacheri; Stephanie L Schmit; Gianluca Severi; Martha L Slattery; Joshua D Smith; Antonia Trichopoulou; Rosario Tumino; Cornelia M Ulrich; Fränzel J B van Duijnhoven; Bethany Van Guelpen; Stephanie J Weinstein; Emily White; Alicja Wolk; Michael O Woods; Anna H Wu; Goncalo R Abecasis; Graham Casey; Deborah A Nickerson; Stephen B Gruber; Li Hsu; Wei Zheng; Ulrike Peters
Journal:  Hum Genet       Date:  2019-02-28       Impact factor: 4.132

8.  Genomic atlas of the human plasma proteome.

Authors:  Benjamin B Sun; Joseph C Maranville; James E Peters; David Stacey; James R Staley; James Blackshaw; Stephen Burgess; Tao Jiang; Ellie Paige; Praveen Surendran; Clare Oliver-Williams; Mihir A Kamat; Bram P Prins; Sheri K Wilcox; Erik S Zimmerman; An Chi; Narinder Bansal; Sarah L Spain; Angela M Wood; Nicholas W Morrell; John R Bradley; Nebojsa Janjic; David J Roberts; Willem H Ouwehand; John A Todd; Nicole Soranzo; Karsten Suhre; Dirk S Paul; Caroline S Fox; Robert M Plenge; John Danesh; Heiko Runz; Adam S Butterworth
Journal:  Nature       Date:  2018-06-06       Impact factor: 49.962

9.  An atlas of genetic influences on human blood metabolites.

Authors:  So-Youn Shin; Eric B Fauman; Ann-Kristin Petersen; Jan Krumsiek; Rita Santos; Jie Huang; Matthias Arnold; Idil Erte; Vincenzo Forgetta; Tsun-Po Yang; Klaudia Walter; Cristina Menni; Lu Chen; Louella Vasquez; Ana M Valdes; Craig L Hyde; Vicky Wang; Daniel Ziemek; Phoebe Roberts; Li Xi; Elin Grundberg; Melanie Waldenberger; J Brent Richards; Robert P Mohney; Michael V Milburn; Sally L John; Jeff Trimmer; Fabian J Theis; John P Overington; Karsten Suhre; M Julia Brosnan; Christian Gieger; Gabi Kastenmüller; Tim D Spector; Nicole Soranzo
Journal:  Nat Genet       Date:  2014-05-11       Impact factor: 38.330

10.  A comparison of robust Mendelian randomization methods using summary data.

Authors:  Eric A W Slob; Stephen Burgess
Journal:  Genet Epidemiol       Date:  2020-04-06       Impact factor: 2.344

View more
  2 in total

1.  Classification of PR-positive and PR-negative subtypes in ER-positive and HER2-negative breast cancers based on pathway scores.

Authors:  Taobo Hu; Yan Chen; Yiqiang Liu; Danhua Zhang; Jiankang Pan; Mengping Long
Journal:  BMC Med Res Methodol       Date:  2021-05-22       Impact factor: 4.615

2.  Genotype-Based Gene Expression in Colon Tissue-Prediction Accuracy and Relationship with the Prognosis of Colorectal Cancer Patients.

Authors:  Heike Deutelmoser; Justo Lorenzo Bermejo; Axel Benner; Korbinian Weigl; Hanla A Park; Mariam Haffa; Esther Herpel; Martin Schneider; Cornelia M Ulrich; Michael Hoffmeister; Jenny Chang-Claude; Hermann Brenner; Dominique Scherer
Journal:  Int J Mol Sci       Date:  2020-10-31       Impact factor: 5.923

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.