Literature DB >> 23060615

A high-performance computing toolset for relatedness and principal component analysis of SNP data.

Xiuwen Zheng1, David Levine, Jess Shen, Stephanie M Gogarten, Cathy Laurie, Bruce S Weir.   

Abstract

Genome-wide association studies are widely used to investigate the genetic basis of diseases and traits, but they pose many computational challenges. We developed gdsfmt and SNPRelate (R packages for multi-core symmetric multiprocessing computer architectures) to accelerate two key computations on SNP data: principal component analysis (PCA) and relatedness analysis using identity-by-descent measures. The kernels of our algorithms are written in C/C++ and highly optimized. Benchmarks show the uniprocessor implementations of PCA and identity-by-descent are ∼8-50 times faster than the implementations provided in the popular EIGENSTRAT (v3.0) and PLINK (v1.07) programs, respectively, and can be sped up to 30-300-fold by using eight cores. SNPRelate can analyse tens of thousands of samples with millions of SNPs. For example, our package was used to perform PCA on 55 324 subjects from the 'Gene-Environment Association Studies' consortium studies.

Mesh:

Year:  2012        PMID: 23060615      PMCID: PMC3519454          DOI: 10.1093/bioinformatics/bts606

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

1.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

2.  A unified association analysis approach for family and unrelated samples correcting for stratification.

Authors:  Xiaofeng Zhu; Shengchao Li; Richard S Cooper; Robert C Elston
Journal:  Am J Hum Genet       Date:  2008-02       Impact factor: 11.025

3.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

4.  GWASTools: an R/Bioconductor package for quality control and analysis of genome-wide association studies.

Authors:  Stephanie M Gogarten; Tushar Bhangale; Matthew P Conomos; Cecelia A Laurie; Caitlin P McHugh; Ian Painter; Xiuwen Zheng; David R Crosslin; David Levine; Thomas Lumley; Sarah C Nelson; Kenneth Rice; Jess Shen; Rohit Swarnkar; Bruce S Weir; Cathy C Laurie
Journal:  Bioinformatics       Date:  2012-10-10       Impact factor: 6.937

Review 5.  New approaches to population stratification in genome-wide association studies.

Authors:  Alkes L Price; Noah A Zaitlen; David Reich; Nick Patterson
Journal:  Nat Rev Genet       Date:  2010-07       Impact factor: 53.242

6.  Quality control and quality assurance in genotypic data for genome-wide association studies.

Authors:  Cathy C Laurie; Kimberly F Doheny; Daniel B Mirel; Elizabeth W Pugh; Laura J Bierut; Tushar Bhangale; Frederick Boehm; Neil E Caporaso; Marilyn C Cornelis; Howard J Edenberg; Stacy B Gabriel; Emily L Harris; Frank B Hu; Kevin B Jacobs; Peter Kraft; Maria Teresa Landi; Thomas Lumley; Teri A Manolio; Caitlin McHugh; Ian Painter; Justin Paschall; John P Rice; Kenneth M Rice; Xiuwen Zheng; Bruce S Weir
Journal:  Genet Epidemiol       Date:  2010-09       Impact factor: 2.135

7.  A map of human genome variation from population-scale sequencing.

Authors:  Gonçalo R Abecasis; David Altshuler; Adam Auton; Lisa D Brooks; Richard M Durbin; Richard A Gibbs; Matt E Hurles; Gil A McVean
Journal:  Nature       Date:  2010-10-28       Impact factor: 49.962

8.  The Gene, Environment Association Studies consortium (GENEVA): maximizing the knowledge obtained from GWAS by collaboration across studies of multiple conditions.

Authors:  Marilyn C Cornelis; Arpana Agrawal; John W Cole; Nadia N Hansel; Kathleen C Barnes; Terri H Beaty; Siiri N Bennett; Laura J Bierut; Eric Boerwinkle; Kimberly F Doheny; Bjarke Feenstra; Eleanor Feingold; Myriam Fornage; Christopher A Haiman; Emily L Harris; M Geoffrey Hayes; John A Heit; Frank B Hu; Jae H Kang; Cathy C Laurie; Hua Ling; Teri A Manolio; Mary L Marazita; Rasika A Mathias; Daniel B Mirel; Justin Paschall; Louis R Pasquale; Elizabeth W Pugh; John P Rice; Jenna Udren; Rob M van Dam; Xiaojing Wang; Janey L Wiggs; Kayleen Williams; Kai Yu
Journal:  Genet Epidemiol       Date:  2010-05       Impact factor: 2.135

9.  Case-control association testing in the presence of unknown relationships.

Authors:  Yoonha Choi; Ellen M Wijsman; Bruce S Weir
Journal:  Genet Epidemiol       Date:  2009-12       Impact factor: 2.135

10.  The variant call format and VCFtools.

Authors:  Petr Danecek; Adam Auton; Goncalo Abecasis; Cornelis A Albers; Eric Banks; Mark A DePristo; Robert E Handsaker; Gerton Lunter; Gabor T Marth; Stephen T Sherry; Gilean McVean; Richard Durbin
Journal:  Bioinformatics       Date:  2011-06-07       Impact factor: 6.937

View more
  664 in total

1.  Genetic Basis of Maize Resistance to Multiple Insect Pests: Integrated Genome-Wide Comparative Mapping and Candidate Gene Prioritization.

Authors:  A Badji; D B Kwemoi; L Machida; D Okii; N Mwila; S Agbahoungba; F Kumi; A Ibanda; A Bararyenya; M Solemanegy; T Odong; P Wasswa; M Otim; G Asea; M Ochwo-Ssemakula; H Talwana; S Kyamanywa; P Rubaihayo
Journal:  Genes (Basel)       Date:  2020-06-24       Impact factor: 4.096

2.  Genetic Diversity and Association Studies in US Hispanic/Latino Populations: Applications in the Hispanic Community Health Study/Study of Latinos.

Authors:  Matthew P Conomos; Cecelia A Laurie; Adrienne M Stilp; Stephanie M Gogarten; Caitlin P McHugh; Sarah C Nelson; Tamar Sofer; Lindsay Fernández-Rhodes; Anne E Justice; Mariaelisa Graff; Kristin L Young; Amanda A Seyerle; Christy L Avery; Kent D Taylor; Jerome I Rotter; Gregory A Talavera; Martha L Daviglus; Sylvia Wassertheil-Smoller; Neil Schneiderman; Gerardo Heiss; Robert C Kaplan; Nora Franceschini; Alex P Reiner; John R Shaffer; R Graham Barr; Kathleen F Kerr; Sharon R Browning; Brian L Browning; Bruce S Weir; M Larissa Avilés-Santa; George J Papanicolaou; Thomas Lumley; Adam A Szpiro; Kari E North; Ken Rice; Timothy A Thornton; Cathy C Laurie
Journal:  Am J Hum Genet       Date:  2016-01-07       Impact factor: 11.025

3.  Relatedness predicts multiple measures of investment in cooperative nest construction in sociable weavers.

Authors:  Gavin M Leighton; Sebastian Echeverri; Dirk Heinrich; Holger Kolberg
Journal:  Behav Ecol Sociobiol       Date:  2015-08-29       Impact factor: 2.980

4.  The (in)famous GWAS P-value threshold revisited and updated for low-frequency variants.

Authors:  João Fadista; Alisa K Manning; Jose C Florez; Leif Groop
Journal:  Eur J Hum Genet       Date:  2016-01-06       Impact factor: 4.246

5.  Using an Alzheimer Disease Polygenic Risk Score to Predict Memory Decline in Black and White Americans Over 14 Years of Follow-up.

Authors:  Jessica R Marden; Elizabeth R Mayeda; Stefan Walter; Alexandre Vivot; Eric J Tchetgen Tchetgen; Ichiro Kawachi; M Maria Glymour
Journal:  Alzheimer Dis Assoc Disord       Date:  2016 Jul-Sep       Impact factor: 2.703

6.  Inferring the demographic history of Japanese cedar, Cryptomeria japonica, using amplicon sequencing.

Authors:  Natsuki Moriguchi; Kentaro Uchiyama; Ryutaro Miyagi; Etsuko Moritsuka; Aya Takahashi; Koichiro Tamura; Yoshihiko Tsumura; Kosuke M Teshima; Hidenori Tachida; Junko Kusumi
Journal:  Heredity (Edinb)       Date:  2019-02-26       Impact factor: 3.821

7.  Genomic analysis of MHC-based mate choice in the monogamous California mouse.

Authors:  Jesyka Meléndez-Rosa; Ke Bi; Eileen A Lacey
Journal:  Behav Ecol       Date:  2018-07-12       Impact factor: 2.671

8.  Next-generation sequencing of AV nodal reentrant tachycardia patients identifies broad spectrum of variants in ion channel genes.

Authors:  Laura Andreasen; Gustav Ahlberg; Chuyi Tang; Charlotte Andreasen; Jacob P Hartmann; Jacob Tfelt-Hansen; Elijah R Behr; Steen Pehrson; Stig Haunsø; Peter E Weeke; Thomas Jespersen; Morten S Olesen; Jesper H Svendsen
Journal:  Eur J Hum Genet       Date:  2018-02-02       Impact factor: 4.246

9.  Genome-Wide Analysis of SNPs Is Consistent with No Domestic Dog Ancestry in the Endangered Mexican Wolf (Canis lupus baileyi).

Authors:  Robert R Fitak; Sarah E Rinkevich; Melanie Culver
Journal:  J Hered       Date:  2018-05-11       Impact factor: 2.645

10.  Genomic analyses in African populations identify novel risk loci for cleft palate.

Authors:  Azeez Butali; Peter A Mossey; Wasiu L Adeyemo; Mekonen A Eshete; Lord J J Gowans; Tamara D Busch; Deepti Jain; Wenjie Yu; Liu Huan; Cecelia A Laurie; Cathy C Laurie; Sarah Nelson; Mary Li; Pedro A Sanchez-Lara; William P Magee; Kathleen S Magee; Allyn Auslander; Frederick Brindopke; Denise M Kay; Michele Caggana; Paul A Romitti; James L Mills; Rosemary Audu; Chika Onwuamah; Ganiyu O Oseni; Arwa Owais; Olutayo James; Peter B Olaitan; Babatunde S Aregbesola; Ramat O Braimah; Fadekemi O Oginni; Ayodeji O Oladele; Saidu A Bello; Jennifer Rhodes; Rita Shiang; Peter Donkor; Solomon Obiri-Yeboah; Fareed Kow Nanse Arthur; Peter Twumasi; Pius Agbenorku; Gyikua Plange-Rhule; Alexander Acheampong Oti; Olugbenga M Ogunlewe; Afisu A Oladega; Adegbayi A Adekunle; Akinwunmi O Erinoso; Olatunbosun O Adamson; Abosede A Elufowoju; Oluwanifemi I Ayelomi; Taiye Hailu; Abiye Hailu; Yohannes Demissie; Miliard Derebew; Steve Eliason; Miguel Romero-Bustillous; Cynthia Lo; James Park; Shaan Desai; Muiawa Mohammed; Firke Abate; Lukman O Abdur-Rahman; Deepti Anand; Irfaan Saadi; Abimibola V Oladugba; Salil A Lachke; Brad A Amendt; Charles N Rotimi; Mary L Marazita; Robert A Cornell; Jeffrey C Murray; Adebowale A Adeyemo
Journal:  Hum Mol Genet       Date:  2019-03-15       Impact factor: 6.150

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.