Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 PopCluster: an algorithm to identify genetic variants with ethnicity-dependent effects.

Literature DB >> 30624692

PopCluster: an algorithm to identify genetic variants with ethnicity-dependent effects.

Anastasia Gurinovich¹, Harold Bae², John J Farrell³, Stacy L Andersen³, Stefano Monti³, Annibale Puca^4,5, Gil Atzmon⁶, Nir Barzilai⁶, Thomas T Perls³, Paola Sebastiani⁷.

Abstract

MOTIVATION: Over the last decade, more diverse populations have been included in genome-wide association studies. If a genetic variant has a varying effect on a phenotype in different populations, genome-wide association studies applied to a dataset as a whole may not pinpoint such differences. It is especially important to be able to identify population-specific effects of genetic variants in studies that would eventually lead to development of diagnostic tests or drug discovery.
RESULTS: In this paper, we propose PopCluster: an algorithm to automatically discover subsets of individuals in which the genetic effects of a variant are statistically different. PopCluster provides a simple framework to directly analyze genotype data without prior knowledge of subjects' ethnicities. PopCluster combines logistic regression modeling, principal component analysis, hierarchical clustering and a recursive bottom-up tree parsing procedure. The evaluation of PopCluster suggests that the algorithm has a stable low false positive rate (∼4%) and high true positive rate (>80%) in simulations with large differences in allele frequencies between cases and controls. Application of PopCluster to data from genetic studies of longevity discovers ethnicity-dependent heterogeneity in the association of rs3764814 (USP42) with the phenotype.
AVAILABILITY AND IMPLEMENTATION: PopCluster was implemented using the R programming language, PLINK and Eigensoft software, and can be found at the following GitHub repository: https://github.com/gurinovich/PopCluster with instructions on its installation and usage. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities: Gene Mutation

Mesh：

Substances：

Year: 2019 PMID： 30624692 PMCID： PMC6735784 DOI： 10.1093/bioinformatics/btz017

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
References

39 in total

1. Clinical phenotype of families with longevity.

Authors: Gil Atzmon; Clyde Schechter; William Greiner; Deborah Davidson; Gad Rennert; Nir Barzilai
Journal: J Am Geriatr Soc Date: 2004-02 Impact factor: 5.562

2. Genetic structure of human populations.

Authors: Noah A Rosenberg; Jonathan K Pritchard; James L Weber; Howard M Cann; Kenneth K Kidd; Lev A Zhivotovsky; Marcus W Feldman
Journal: Science Date: 2002-12-20 Impact factor: 47.728

3. Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors: Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal: Genome Res Date: 2003-11 Impact factor: 9.043

4. Principal components analysis corrects for stratification in genome-wide association studies.

Authors: Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal: Nat Genet Date: 2006-07-23 Impact factor: 38.330

5. Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses.

Authors: Franz Faul; Edgar Erdfelder; Axel Buchner; Albert-Georg Lang
Journal: Behav Res Methods Date: 2009-11

6. Next generation disparities in human genomics: concerns and remedies.

Authors: Anna C Need; David B Goldstein
Journal: Trends Genet Date: 2009-11 Impact factor: 11.639

7. Association study on long-living individuals from Southern Italy identifies rs10491334 in the CAMKIV gene that regulates survival proteins.

Authors: Alberto Malovini; Maddalena Illario; Guido Iaccarino; Francesco Villa; Anna Ferrario; Roberta Roncarati; Chiara Viviani Anselmi; Valeria Novelli; Ersilia Cipolletta; Eleonora Leggiero; Alessandro Orro; Maria Rosaria Rusciano; Luciano Milanesi; Angela Serena Maione; Gianluigi Condorelli; Riccardo Bellazzi; Annibale A Puca
Journal: Rejuvenation Res Date: 2011-05-25 Impact factor: 4.663

8. Apolipoprotein E (APOE) allele distribution in the world. Is APOE*4 a 'thrifty' allele?

Authors: R M Corbo; R Scacchi
Journal: Ann Hum Genet Date: 1999-07 Impact factor: 1.670

9. Health and function of participants in the Long Life Family Study: A comparison with other cohorts.

Authors: Anne B Newman; Nancy W Glynn; Christopher A Taylor; Paola Sebastiani; Thomas T Perls; Richard Mayeux; Kaare Christensen; Joseph M Zmuda; Sandra Barral; Joseph H Lee; Eleanor M Simonsick; Jeremy D Walston; Anatoli I Yashin; Evan Hadley
Journal: Aging (Albany NY) Date: 2011-01 Impact factor: 5.682

10. Clustering by genetic ancestry using genome-wide SNP data.

Authors: Nadia Solovieff; Stephen W Hartley; Clinton T Baldwin; Thomas T Perls; Martin H Steinberg; Paola Sebastiani
Journal: BMC Genet Date: 2010-12-09 Impact factor: 2.797