| Literature DB >> 33770507 |
Alicia R Martin1, Elizabeth G Atkinson2, Sinéad B Chapman3, Anne Stevenson4, Rocky E Stroud4, Tamrat Abebe5, Dickens Akena6, Melkam Alemayehu7, Fred K Ashaba8, Lukoye Atwoli9, Tera Bowers10, Lori B Chibnik11, Mark J Daly12, Timothy DeSmet10, Sheila Dodge10, Abebaw Fekadu13, Steven Ferriera10, Bizu Gelaye14, Stella Gichuru15, Wilfred E Injera16, Roxanne James17, Symon M Kariuki18, Gabriel Kigen19, Karestan C Koenen4, Edith Kwobah15, Joseph Kyebuzibwa6, Lerato Majara20, Henry Musinguzi8, Rehema M Mwema21, Benjamin M Neale2, Carter P Newman4, Charles R J C Newton18, Joseph K Pickrell22, Raj Ramesar23, Welelta Shiferaw5, Dan J Stein24, Solomon Teferra7, Celia van der Merwe25, Zukiswa Zingela26.
Abstract
Genetic studies in underrepresented populations identify disproportionate numbers of novel associations. However, most genetic studies use genotyping arrays and sequenced reference panels that best capture variation most common in European ancestry populations. To compare data generation strategies best suited for underrepresented populations, we sequenced the whole genomes of 91 individuals to high coverage as part of the Neuropsychiatric Genetics of African Population-Psychosis (NeuroGAP-Psychosis) study with participants from Ethiopia, Kenya, South Africa, and Uganda. We used a downsampling approach to evaluate the quality of two cost-effective data generation strategies, GWAS arrays versus low-coverage sequencing, by calculating the concordance of imputed variants from these technologies with those from deep whole-genome sequencing data. We show that low-coverage sequencing at a depth of ≥4× captures variants of all frequencies more accurately than all commonly used GWAS arrays investigated and at a comparable cost. Lower depths of sequencing (0.5-1×) performed comparably to commonly used low-density GWAS arrays. Low-coverage sequencing is also sensitive to novel variation; 4× sequencing detects 45% of singletons and 95% of common variants identified in high-coverage African whole genomes. Low-coverage sequencing approaches surmount the problems induced by the ascertainment of common genotyping arrays, effectively identify novel variation particularly in underrepresented populations, and present opportunities to enhance variant discovery at a cost similar to traditional approaches.Entities:
Keywords: Africa; GWAS; GWAS arrays; cost comparison; low-coverage sequencing; study design; whole-genome sequencing
Mesh:
Year: 2021 PMID: 33770507 PMCID: PMC8059370 DOI: 10.1016/j.ajhg.2021.03.012
Source DB: PubMed Journal: Am J Hum Genet ISSN: 0002-9297 Impact factor: 11.025