Literature DB >> 34157305

Summix: A method for detecting and adjusting for population structure in genetic summary data.

Ian S Arriaga-MacKenzie1, Gregory Matesi1, Samuel Chen1, Alexandria Ronco1, Katie M Marker2, Jordan R Hall1, Ryan Scherenberg3, Mobin Khajeh-Sharafabadi4, Yinfei Wu1, Christopher R Gignoux5, Megan Null6, Audrey E Hendricks7.   

Abstract

Publicly available genetic summary data have high utility in research and the clinic, including prioritizing putative causal variants, polygenic scoring, and leveraging common controls. However, summarizing individual-level data can mask population structure, resulting in confounding, reduced power, and incorrect prioritization of putative causal variants. This limits the utility of publicly available data, especially for understudied or admixed populations where additional research and resources are most needed. Although several methods exist to estimate ancestry in individual-level data, methods to estimate ancestry proportions in summary data are lacking. Here, we present Summix, a method to efficiently deconvolute ancestry and provide ancestry-adjusted allele frequencies (AFs) from summary data. Using continental reference ancestry, African (AFR), non-Finnish European (EUR), East Asian (EAS), Indigenous American (IAM), South Asian (SAS), we obtain accurate and precise estimates (within 0.1%) for all simulation scenarios. We apply Summix to gnomAD v.2.1 exome and genome groups and subgroups, finding heterogeneous continental ancestry for several groups, including African/African American (∼84% AFR, ∼14% EUR) and American/Latinx (∼4% AFR, ∼5% EAS, ∼43% EUR, ∼46% IAM). Compared to the unadjusted gnomAD AFs, Summix's ancestry-adjusted AFs more closely match respective African and Latinx reference samples. Even on modern, dense panels of summary statistics, Summix yields results in seconds, allowing for estimation of confidence intervals via block bootstrap. Given an accompanying R package, Summix increases the utility and equity of public genetic resources, empowering novel research opportunities.
Copyright © 2021 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  allele frequency; ancestry; common controls; external controls; gnomAD; population stratification; population structure; summary

Mesh:

Year:  2021        PMID: 34157305      PMCID: PMC8322937          DOI: 10.1016/j.ajhg.2021.05.016

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  48 in total

1.  A comprehensive genetic association study of Alzheimer disease in African Americans.

Authors:  Mark W Logue; Matthew Schu; Badri N Vardarajan; Jacki Buros; Robert C Green; Rodney C P Go; Patrick Griffith; Thomas O Obisesan; Rhonna Shatz; Amy Borenstein; L Adrienne Cupples; Kathryn L Lunetta; M Daniele Fallin; Clinton T Baldwin; Lindsay A Farrer
Journal:  Arch Neurol       Date:  2011-12

2.  Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations.

Authors:  Alicia R Martin; Christopher R Gignoux; Raymond K Walters; Genevieve L Wojcik; Benjamin M Neale; Simon Gravel; Mark J Daly; Carlos D Bustamante; Eimear E Kenny
Journal:  Am J Hum Genet       Date:  2017-03-30       Impact factor: 11.025

3.  Variant PADI3 in Central Centrifugal Cicatricial Alopecia.

Authors:  Liron Malki; Ofer Sarig; Maria-Teresa Romano; Marie-Claire Méchin; Alon Peled; Mor Pavlovsky; Emily Warshauer; Liat Samuelov; Laura Uwakwe; Valeria Briskin; Janan Mohamad; Andrea Gat; Ofer Isakov; Tom Rabinowitz; Noam Shomron; Noam Adir; Michel Simon; Amy McMichael; Ncoza C Dlova; Regina C Betz; Eli Sprecher
Journal:  N Engl J Med       Date:  2019-02-13       Impact factor: 91.245

4.  Rapid assessment of genetic ancestry in populations of unknown origin by genome-wide genotyping of pooled samples.

Authors:  Charleston W K Chiang; Zofia K Z Gajdos; Joshua M Korn; Finny G Kuruvilla; Johannah L Butler; Rachel Hackett; Candace Guiducci; Thutrang T Nguyen; Rainford Wilks; Terrence Forrester; Christopher A Haiman; Katherine D Henderson; Loic Le Marchand; Brian E Henderson; Mark R Palmert; Colin A McKenzie; Helen N Lyon; Richard S Cooper; Xiaofeng Zhu; Joel N Hirschhorn
Journal:  PLoS Genet       Date:  2010-03-05       Impact factor: 5.917

5.  Novel score test to increase power in association test by integrating external controls.

Authors:  Yatong Li; Seunggeun Lee
Journal:  Genet Epidemiol       Date:  2020-11-08       Impact factor: 2.344

6.  Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology.

Authors:  Sue Richards; Nazneen Aziz; Sherri Bale; David Bick; Soma Das; Julie Gastier-Foster; Wayne W Grody; Madhuri Hegde; Elaine Lyon; Elaine Spector; Karl Voelkerding; Heidi L Rehm
Journal:  Genet Med       Date:  2015-03-05       Impact factor: 8.822

7.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

8.  Genomic analysis of 21 patients with corneal neuralgia after refractive surgery.

Authors:  Jun-Hui Yuan; Betsy R Schulman; Philip R Effraim; Dib-Hajj Sulayman; Deborah S Jacobs; Stephen G Waxman
Journal:  Pain Rep       Date:  2020-07-27

9.  Fast and robust deconvolution of tumor infiltrating lymphocyte from expression profiles using least trimmed squares.

Authors:  Yuning Hao; Ming Yan; Blake R Heath; Yu L Lei; Yuying Xie
Journal:  PLoS Comput Biol       Date:  2019-05-06       Impact factor: 4.475

10.  The mutational constraint spectrum quantified from variation in 141,456 humans.

Authors:  Konrad J Karczewski; Laurent C Francioli; Grace Tiao; Beryl B Cummings; Jessica Alföldi; Qingbo Wang; Ryan L Collins; Kristen M Laricchia; Andrea Ganna; Daniel P Birnbaum; Laura D Gauthier; Harrison Brand; Matthew Solomonson; Nicholas A Watts; Daniel Rhodes; Moriel Singer-Berk; Eleina M England; Eleanor G Seaby; Jack A Kosmicki; Raymond K Walters; Katherine Tashman; Yossi Farjoun; Eric Banks; Timothy Poterba; Arcturus Wang; Cotton Seed; Nicola Whiffin; Jessica X Chong; Kaitlin E Samocha; Emma Pierce-Hoffman; Zachary Zappala; Anne H O'Donnell-Luria; Eric Vallabh Minikel; Ben Weisburd; Monkol Lek; James S Ware; Christopher Vittal; Irina M Armean; Louis Bergelson; Kristian Cibulskis; Kristen M Connolly; Miguel Covarrubias; Stacey Donnelly; Steven Ferriera; Stacey Gabriel; Jeff Gentry; Namrata Gupta; Thibault Jeandet; Diane Kaplan; Christopher Llanwarne; Ruchi Munshi; Sam Novod; Nikelle Petrillo; David Roazen; Valentin Ruano-Rubio; Andrea Saltzman; Molly Schleicher; Jose Soto; Kathleen Tibbetts; Charlotte Tolonen; Gordon Wade; Michael E Talkowski; Benjamin M Neale; Mark J Daly; Daniel G MacArthur
Journal:  Nature       Date:  2020-05-27       Impact factor: 69.504

View more
  3 in total

Review 1.  Opportunities and challenges for the use of common controls in sequencing studies.

Authors:  Genevieve L Wojcik; Jessica Murphy; Jacob L Edelson; Christopher R Gignoux; Alexander G Ioannidis; Alisa Manning; Manuel A Rivas; Steven Buyske; Audrey E Hendricks
Journal:  Nat Rev Genet       Date:  2022-05-17       Impact factor: 59.581

2.  Using the UK Biobank as a global reference of worldwide populations: application to measuring ancestry diversity from GWAS summary statistics.

Authors:  Florian Privé
Journal:  Bioinformatics       Date:  2022-05-23       Impact factor: 6.931

3.  Fully exploiting SNP arrays: a systematic review on the tools to extract underlying genomic structure.

Authors:  Laura Balagué-Dobón; Alejandro Cáceres; Juan R González
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.