Literature DB >> 27080919

An Object-Oriented Regression for Building Disease Predictive Models with Multiallelic HLA Genes.

Lue Ping Zhao1,2, Hamid Bolouri3, Michael Zhao4, Daniel E Geraghty5, Åke Lernmark6.   

Abstract

Recent genome-wide association studies confirm that human leukocyte antigen (HLA) genes have the strongest associations with several autoimmune diseases, including type 1 diabetes (T1D), providing an impetus to reduce this genetic association to practice through an HLA-based disease predictive model. However, conventional model-building methods tend to be suboptimal when predictors are highly polymorphic with many rare alleles combined with complex patterns of sequence homology within and between genes. To circumvent this challenge, we describe an alternative methodology; treating complex genotypes of HLA genes as "objects" or "exemplars," one focuses on systemic associations of disease phenotype with "objects" via similarity measurements. Conceptually, this approach assigns disease risks base on complex genotype profiles instead of specific disease-associated genotypes or alleles. Effectively, it transforms large, discrete, and sparse HLA genotypes into a matrix of similarity-based covariates. By the Kernel representative theorem and machine learning techniques, it uses a penalized likelihood method to select disease-associated exemplars in building predictive models. To illustrate this methodology, we apply it to a T1D study with eight HLA genes (HLA-DRB1, HLA-DRB3, HLA-DRB4, HLA-DRB5, HLA-DQA1, HLA-DQB1, HLA-DPA1, and HLA-DPB1) to build a predictive model. The resulted predictive model has an area under curve of 0.92 in the training set, and 0.89 in the validating set, indicating that this methodology is useful to build predictive models with complex HLA genotypes.
© 2016 WILEY PERIODICALS, INC.

Entities:  

Keywords:  generalized linear model; kernel machine; multiallelic genotypes; penalized regression; prediction; similarity regression; statistical learning

Mesh:

Substances:

Year:  2016        PMID: 27080919      PMCID: PMC4834870          DOI: 10.1002/gepi.21968

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  25 in total

1.  Powerful SNP-set analysis for case-control genome-wide association studies.

Authors:  Michael C Wu; Peter Kraft; Michael P Epstein; Deanne M Taylor; Stephen J Chanock; David J Hunter; Xihong Lin
Journal:  Am J Hum Genet       Date:  2010-06-11       Impact factor: 11.025

2.  Type 1 diabetes patients born to immigrants to Sweden increase their native diabetes risk and differ from Swedish patients in HLA types and islet autoantibodies.

Authors:  Ahmed J Delli; Bengt Lindblad; Annelie Carlsson; Gun Forsander; Sten-A Ivarsson; Johnny Ludvigsson; Claude Marcus; Ake Lernmark
Journal:  Pediatr Diabetes       Date:  2010-12       Impact factor: 4.866

Review 3.  Systems medicine: a new approach to clinical practice.

Authors:  Pablo Cardinal-Fernández; Nicolás Nin; Jesús Ruíz-Cabello; José A Lorente
Journal:  Arch Bronconeumol       Date:  2014-01-05       Impact factor: 4.872

4.  Characterization of recombination in the HLA class II region.

Authors:  M Cullen; J Noble; H Erlich; K Thorpe; S Beck; W Klitz; J Trowsdale; M Carrington
Journal:  Am J Hum Genet       Date:  1997-02       Impact factor: 11.025

Review 5.  Genetic epidemiology: a review of the statistical basis.

Authors:  E A Thompson
Journal:  Stat Med       Date:  1986 Jul-Aug       Impact factor: 2.373

6.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

Review 7.  The importance of phase information for human genomics.

Authors:  Ryan Tewhey; Vikas Bansal; Ali Torkamani; Eric J Topol; Nicholas J Schork
Journal:  Nat Rev Genet       Date:  2011-02-08       Impact factor: 53.242

8.  Strong rules for discarding predictors in lasso-type problems.

Authors:  Robert Tibshirani; Jacob Bien; Jerome Friedman; Trevor Hastie; Noah Simon; Jonathan Taylor; Ryan J Tibshirani
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-03       Impact factor: 4.488

9.  Risk Classification with an Adaptive Naive Bayes Kernel Machine Model.

Authors:  Jessica Minnier; Ming Yuan; Jun S Liu; Tianxi Cai
Journal:  J Am Stat Assoc       Date:  2015-04-22       Impact factor: 5.033

10.  Next-Generation Sequencing Reveals That HLA-DRB3, -DRB4, and -DRB5 May Be Associated With Islet Autoantibodies and Risk for Childhood Type 1 Diabetes.

Authors:  Lue Ping Zhao; Shehab Alshiekh; Michael Zhao; Annelie Carlsson; Helena Elding Larsson; Gun Forsander; Sten A Ivarsson; Johnny Ludvigsson; Ingrid Kockum; Claude Marcus; Martina Persson; Ulf Samuelsson; Eva Örtqvist; Chul-Woo Pyo; Wyatt C Nelson; Daniel E Geraghty; Åke Lernmark
Journal:  Diabetes       Date:  2016-01-06       Impact factor: 9.461

View more
  4 in total

Review 1.  A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases.

Authors:  I S Stafford; M Kellermann; E Mossotto; R M Beattie; B D MacArthur; S Ennis
Journal:  NPJ Digit Med       Date:  2020-03-09

Review 2.  Machine Learning and Data Mining Methods in Diabetes Research.

Authors:  Ioannis Kavakiotis; Olga Tsave; Athanasios Salifoglou; Nicos Maglaveras; Ioannis Vlahavas; Ioanna Chouvarda
Journal:  Comput Struct Biotechnol J       Date:  2017-01-08       Impact factor: 7.271

3.  Research of Epidemic Big Data Based on Improved Deep Convolutional Neural Network.

Authors:  Wendong Wang
Journal:  Comput Math Methods Med       Date:  2020-07-22       Impact factor: 2.238

Review 4.  A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases.

Authors:  I S Stafford; M Kellermann; E Mossotto; R M Beattie; B D MacArthur; S Ennis
Journal:  NPJ Digit Med       Date:  2020-03-09
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.