Literature DB >> 30624692

PopCluster: an algorithm to identify genetic variants with ethnicity-dependent effects.

Anastasia Gurinovich1, Harold Bae2, John J Farrell3, Stacy L Andersen3, Stefano Monti3, Annibale Puca4,5, Gil Atzmon6, Nir Barzilai6, Thomas T Perls3, Paola Sebastiani7.   

Abstract

MOTIVATION: Over the last decade, more diverse populations have been included in genome-wide association studies. If a genetic variant has a varying effect on a phenotype in different populations, genome-wide association studies applied to a dataset as a whole may not pinpoint such differences. It is especially important to be able to identify population-specific effects of genetic variants in studies that would eventually lead to development of diagnostic tests or drug discovery.
RESULTS: In this paper, we propose PopCluster: an algorithm to automatically discover subsets of individuals in which the genetic effects of a variant are statistically different. PopCluster provides a simple framework to directly analyze genotype data without prior knowledge of subjects' ethnicities. PopCluster combines logistic regression modeling, principal component analysis, hierarchical clustering and a recursive bottom-up tree parsing procedure. The evaluation of PopCluster suggests that the algorithm has a stable low false positive rate (∼4%) and high true positive rate (>80%) in simulations with large differences in allele frequencies between cases and controls. Application of PopCluster to data from genetic studies of longevity discovers ethnicity-dependent heterogeneity in the association of rs3764814 (USP42) with the phenotype.
AVAILABILITY AND IMPLEMENTATION: PopCluster was implemented using the R programming language, PLINK and Eigensoft software, and can be found at the following GitHub repository: https://github.com/gurinovich/PopCluster with instructions on its installation and usage. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30624692      PMCID: PMC6735784          DOI: 10.1093/bioinformatics/btz017

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  39 in total

1.  Clinical phenotype of families with longevity.

Authors:  Gil Atzmon; Clyde Schechter; William Greiner; Deborah Davidson; Gad Rennert; Nir Barzilai
Journal:  J Am Geriatr Soc       Date:  2004-02       Impact factor: 5.562

2.  Genetic structure of human populations.

Authors:  Noah A Rosenberg; Jonathan K Pritchard; James L Weber; Howard M Cann; Kenneth K Kidd; Lev A Zhivotovsky; Marcus W Feldman
Journal:  Science       Date:  2002-12-20       Impact factor: 47.728

3.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

4.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

5.  Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses.

Authors:  Franz Faul; Edgar Erdfelder; Axel Buchner; Albert-Georg Lang
Journal:  Behav Res Methods       Date:  2009-11

6.  Next generation disparities in human genomics: concerns and remedies.

Authors:  Anna C Need; David B Goldstein
Journal:  Trends Genet       Date:  2009-11       Impact factor: 11.639

7.  Association study on long-living individuals from Southern Italy identifies rs10491334 in the CAMKIV gene that regulates survival proteins.

Authors:  Alberto Malovini; Maddalena Illario; Guido Iaccarino; Francesco Villa; Anna Ferrario; Roberta Roncarati; Chiara Viviani Anselmi; Valeria Novelli; Ersilia Cipolletta; Eleonora Leggiero; Alessandro Orro; Maria Rosaria Rusciano; Luciano Milanesi; Angela Serena Maione; Gianluigi Condorelli; Riccardo Bellazzi; Annibale A Puca
Journal:  Rejuvenation Res       Date:  2011-05-25       Impact factor: 4.663

8.  Apolipoprotein E (APOE) allele distribution in the world. Is APOE*4 a 'thrifty' allele?

Authors:  R M Corbo; R Scacchi
Journal:  Ann Hum Genet       Date:  1999-07       Impact factor: 1.670

9.  Health and function of participants in the Long Life Family Study: A comparison with other cohorts.

Authors:  Anne B Newman; Nancy W Glynn; Christopher A Taylor; Paola Sebastiani; Thomas T Perls; Richard Mayeux; Kaare Christensen; Joseph M Zmuda; Sandra Barral; Joseph H Lee; Eleanor M Simonsick; Jeremy D Walston; Anatoli I Yashin; Evan Hadley
Journal:  Aging (Albany NY)       Date:  2011-01       Impact factor: 5.682

10.  Clustering by genetic ancestry using genome-wide SNP data.

Authors:  Nadia Solovieff; Stephen W Hartley; Clinton T Baldwin; Thomas T Perls; Martin H Steinberg; Paola Sebastiani
Journal:  BMC Genet       Date:  2010-12-09       Impact factor: 2.797

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.