Literature DB >> 25029086

Imputing genotypes using regularized generalized linear regression models.

William W L Wong, Josh Griesman, Zeny Z Feng.   

Abstract

As genomic sequencing technologies continue to advance, researchers are furthering their understanding of the relationships between genetic variants and expressed traits. However, missing data can significantly limit the power of a genetic study. Here, the use of a regularized generalized linear model, denoted by GLMNET, is proposed to impute missing genotypes. The method aims to address certain limitations of earlier regression approaches in regards to genotype imputation, particularly the specification of the number of neighboring SNPs to be included for imputing the missing genotype. The performance of GLMNET-based method is compared to the conventional multinomial regression method and two phase-based methods: fastPHASE and BEAGLE. Two simulation scenarios are evaluated: a sparse-missing model, and a small-panel expansion model. The sparse-missing model simulates a scenario where SNPs were missing in a random fashion across the genome. In the small-panel expansion model, a set of individuals is only genotyped at a subset of the SNPs of the large panel. Each imputation method is tested in the context of two data-sets: Canadian Holstein cattle data and human HapMap CEU data. Results show that the proposed GLMNET method outperforms the other methods in the small panel expansion scenario and fastPHASE performs slightly better than the GLMNET method in the sparse-missing scenario.

Entities:  

Mesh:

Year:  2014        PMID: 25029086     DOI: 10.1515/sagmb-2012-0044

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  2 in total

1.  Marker-Based Estimates Reveal Significant Nonadditive Effects in Clonally Propagated Cassava (Manihot esculenta): Implications for the Prediction of Total Genetic Value and the Selection of Varieties.

Authors:  Marnin D Wolfe; Peter Kulakow; Ismail Y Rabbi; Jean-Luc Jannink
Journal:  G3 (Bethesda)       Date:  2016-11-08       Impact factor: 3.154

2.  Immune landscape and a promising immune prognostic model associated with TP53 in early-stage lung adenocarcinoma.

Authors:  Chengde Wu; Xiang Rao; Wei Lin
Journal:  Cancer Med       Date:  2020-12-12       Impact factor: 4.452

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.