Literature DB >> 24955378

Next Generation Statistical Genetics: Modeling, Penalization, and Optimization in High-Dimensional Data.

Kenneth Lange1, Jeanette C Papp2, Janet S Sinsheimer3, Eric M Sobel2.   

Abstract

Statistical genetics is undergoing the same transition to big data that all branches of applied statistics are experiencing. With the advent of inexpensive DNA sequencing, the transition is only accelerating. This brief review highlights some modern techniques with recent successes in statistical genetics. These include: (a) lasso penalized regression and association mapping, (b) ethnic admixture estimation, (c) matrix completion for genotype and sequence data, (d) the fused lasso and copy number variation, (e) haplotyping, (f) estimation of relatedness, (g) variance components models, and (h) rare variant testing. For more than a century, genetics has been both a driver and beneficiary of statistical theory and practice. This symbiotic relationship will persist for the foreseeable future.

Entities:  

Keywords:  DNA sequence analysis; computational statistics; data mining; gene mapping; pedigrees

Year:  2014        PMID: 24955378      PMCID: PMC4062304          DOI: 10.1146/annurev-statistics-022513-115638

Source DB:  PubMed          Journal:  Annu Rev Stat Appl        ISSN: 2326-8298            Impact factor:   5.810


  87 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Data mining and genetic algorithm based gene/SNP selection.

Authors:  Shital C Shah; Andrew Kusiak
Journal:  Artif Intell Med       Date:  2004-07       Impact factor: 5.326

3.  Genetic ancestry in lung-function predictions.

Authors:  Rajesh Kumar; Max A Seibold; Melinda C Aldrich; L Keoki Williams; Alex P Reiner; Laura Colangelo; Joshua Galanter; Christopher Gignoux; Donglei Hu; Saunak Sen; Shweta Choudhry; Edward L Peterson; Jose Rodriguez-Santana; William Rodriguez-Cintron; Michael A Nalls; Tennille S Leak; Ellen O'Meara; Bernd Meibohm; Stephen B Kritchevsky; Rongling Li; Tamara B Harris; Deborah A Nickerson; Myriam Fornage; Paul Enright; Elad Ziv; Lewis J Smith; Kiang Liu; Esteban González Burchard
Journal:  N Engl J Med       Date:  2010-07-07       Impact factor: 91.245

4.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

5.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

6.  ARIEL and AMELIA: testing for an accumulation of rare variants using next-generation sequencing data.

Authors:  Jennifer L Asimit; Aaron G Day-Williams; Andrew P Morris; Eleftheria Zeggini
Journal:  Hum Hered       Date:  2012-03-22       Impact factor: 0.444

7.  Population genomic and genome-wide association studies of agroclimatic traits in sorghum.

Authors:  Geoffrey P Morris; Punna Ramu; Santosh P Deshpande; C Thomas Hash; Trushar Shah; Hari D Upadhyaya; Oscar Riera-Lizarazu; Patrick J Brown; Charlotte B Acharya; Sharon E Mitchell; James Harriman; Jeffrey C Glaubitz; Edward S Buckler; Stephen Kresovich
Journal:  Proc Natl Acad Sci U S A       Date:  2012-12-24       Impact factor: 11.205

8.  Phasing of many thousands of genotyped samples.

Authors:  Amy L Williams; Nick Patterson; Joseph Glessner; Hakon Hakonarson; David Reich
Journal:  Am J Hum Genet       Date:  2012-08-10       Impact factor: 11.025

9.  A general model for the genetic analysis of pedigree data.

Authors:  R C Elston; J Stewart
Journal:  Hum Hered       Date:  1971       Impact factor: 0.444

10.  PUMA: a unified framework for penalized multiple regression analysis of GWAS data.

Authors:  Gabriel E Hoffman; Benjamin A Logsdon; Jason G Mezey
Journal:  PLoS Comput Biol       Date:  2013-06-27       Impact factor: 4.475

View more
  13 in total

1.  The use of vector bootstrapping to improve variable selection precision in Lasso models.

Authors:  Charles Laurin; Dorret Boomsma; Gitta Lubke
Journal:  Stat Appl Genet Mol Biol       Date:  2016-08-01

2.  Iterative hard thresholding for model selection in genome-wide association studies.

Authors:  Kevin L Keys; Gary K Chen; Kenneth Lange
Journal:  Genet Epidemiol       Date:  2017-09-06       Impact factor: 2.135

3.  Finding genomic function for genetic associations in nicotine addiction research: the ENCODE project's role in future pharmacogenomic analysis.

Authors:  David J Vandenbergh; Gabriel L Schlomer
Journal:  Pharmacol Biochem Behav       Date:  2014-01-31       Impact factor: 3.533

4.  Fast Genome-Wide QTL Association Mapping on Pedigree and Population Data.

Authors:  Hua Zhou; John Blangero; Thomas D Dyer; Kei-Hang K Chan; Kenneth Lange; Eric M Sobel
Journal:  Genet Epidemiol       Date:  2016-12-12       Impact factor: 2.135

5.  Matrix Completion Discriminant Analysis.

Authors:  Tong Tong Wu; Kenneth Lange
Journal:  Comput Stat Data Anal       Date:  2015-12       Impact factor: 1.681

6.  Detecting the Genomic Signature of Divergent Selection in Presence of Gene Flow.

Authors:  Ping Zeng; Ting Wang
Journal:  Curr Genomics       Date:  2015-06       Impact factor: 2.236

7.  Genetics of Coronary Artery Disease in Taiwan: A Cardiometabochip Study by the Taichi Consortium.

Authors:  Themistocles L Assimes; I-T Lee; Jyh-Ming Juang; Xiuqing Guo; Tzung-Dau Wang; Eric T Kim; Wen-Jane Lee; Devin Absher; Yen-Feng Chiu; Chih-Cheng Hsu; Lee-Ming Chuang; Thomas Quertermous; Chao A Hsiung; Jerome I Rotter; Wayne H-H Sheu; Yii-Der Ida Chen; Kent D Taylor
Journal:  PLoS One       Date:  2016-03-16       Impact factor: 3.240

8.  Genome scans for detecting footprints of local adaptation using a Bayesian factor model.

Authors:  Nicolas Duforet-Frebourg; Eric Bazin; Michael G B Blum
Journal:  Mol Biol Evol       Date:  2014-06-03       Impact factor: 16.240

Review 9.  Transforming big data into computational models for personalized medicine and health care.

Authors:  S M Reza Soroushmehr; Kayvan Najarian
Journal:  Dialogues Clin Neurosci       Date:  2016-09       Impact factor: 5.986

10.  Genetic Architecture of Primary Open-Angle Glaucoma in Individuals of African Descent: The African Descent and Glaucoma Evaluation Study III.

Authors:  Kent D Taylor; Xiuqing Guo; Linda M Zangwill; Jeffrey M Liebmann; Christopher A Girkin; Robert M Feldman; Harvey Dubiner; Yang Hai; Brian C Samuels; Joseph F Panarelli; John P Mitchell; Lama A Al-Aswad; Sung Chul Park; Celso Tello; Jeremy Cotliar; Rajendra Bansal; Paul A Sidoti; George A Cioffi; Dana Blumberg; Robert Ritch; Nicholas P Bell; Lauren S Blieden; Garvin Davis; Felipe A Medeiros; Swapan K Das; Jasmin Divers; Carl D Langefeld; Nicholette D Palmer; Barry I Freedman; Donald W Bowden; Maggie C Y Ng; Yii-Der Ida Chen; Radha Ayyagari; Jerome I Rotter; Robert N Weinreb
Journal:  Ophthalmology       Date:  2018-10-21       Impact factor: 14.277

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.