| Literature DB >> 23717212 |
Gavin Band1, Quang Si Le, Luke Jostins, Matti Pirinen, Katja Kivinen, Muminatou Jallow, Fatoumatta Sisay-Joof, Kalifa Bojang, Margaret Pinder, Giorgio Sirugo, David J Conway, Vysaul Nyirongo, David Kachala, Malcolm Molyneux, Terrie Taylor, Carolyne Ndila, Norbert Peshu, Kevin Marsh, Thomas N Williams, Daniel Alcock, Robert Andrews, Sarah Edkins, Emma Gray, Christina Hubbart, Anna Jeffreys, Kate Rowlands, Kathrin Schuldt, Taane G Clark, Kerrin S Small, Yik Ying Teo, Dominic P Kwiatkowski, Kirk A Rockett, Jeffrey C Barrett, Chris C A Spencer.
Abstract
Combining data from genome-wide association studies (GWAS) conducted at different locations, using genotype imputation and fixed-effects meta-analysis, has been a powerful approach for dissecting complex disease genetics in populations of European ancestry. Here we investigate the feasibility of applying the same approach in Africa, where genetic diversity, both within and between populations, is far more extensive. We analyse genome-wide data from approximately 5,000 individuals with severe malaria and 7,000 population controls from three different locations in Africa. Our results show that the standard approach is well powered to detect known malaria susceptibility loci when sample sizes are large, and that modern methods for association analysis can control the potential confounding effects of population structure. We show that pattern of association around the haemoglobin S allele differs substantially across populations due to differences in haplotype structure. Motivated by these observations we consider new approaches to association analysis that might prove valuable for multicentre GWAS in Africa: we relax the assumptions of SNP-based fixed effect analysis; we apply Bayesian approaches to allow for heterogeneity in the effect of an allele on risk across studies; and we introduce a region-based test to allow for heterogeneity in the location of causal alleles.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23717212 PMCID: PMC3662650 DOI: 10.1371/journal.pgen.1003509
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Per-sample imputation accuracy measured by r2 between true genotypes and genotypes predicted by imputation, averaged over imputation chunks.
Black vertical line shows typical imputation accuracy in a UK population, taken from [8]. Gambian samples (red) perform worst due to the poor coverage of African variation by the Illumina 550 K platform, followed by Kenyan samples (green) on the Illumina Omni2.5M, which while dense has limited overlap with our HapMap3 reference, with Malawian samples (yellow) performing best. Note that imputation accuracy in the Kenyan sample showed a bimodal distribution driven at least partially by ethnic ancestry (Figures S12 and S13).
Figure 2Principle components analysis.
Top left: principal components analysis (PCA) of the African populations from Hapmap 3 (LWK = Luhya in Webuye, Kenya, 90 individuals; MKK = Maasai in Kinyawa, Kenya, 143 individuals; YRI = Yoruba in Ibadan, Nigeria, 113 individuals) with 500 randomly selected control samples from each of the three study cohorts. Top right, bottom left, bottom right: PCA of all non-excluded samples in each study cohort, coloured by reported ethnic group. Ethnic group is shown as “OTHER” for groups constituting less than 5% of individuals in the cohort, or where the ethnic group was unreported.
Figure 3Patterns of association around the HBB and ABO loci.
In each figure the top three panels of the plot is the P value of the logistic regression analysis using 5 PCAs in each cohort with the fixed-effect meta-analysis P value shown in the bottom panel. Circles represent genotyped SNP and triangles imputed SNPs. SNPs in r2>0.2 with the functional SNPs in each region:rs334 in HBB which encodes the sickle locus [HbS], and rs8176719 in ABO are coloured with yellow indicating more correlation and red indicating less. Vertical lines indicate the positions of the functional SNPs.
Figure 4Comparison of fixed and structure effect Bayes factor at autosomal SNPs.
Red dots indicate SNPs mapping to the HBB region in and blue dots indicate those mapping to the ABO region (see Figure 3). Note that points below the diagonal line have more evidence for association under a structured effects model, whereas those above the line have more evidence for association under a fixed effect model.
Figure 5Evidence for association at approximately 1.3M autosomal SNPs.
The plot shows the log10 Bayes factor comparing the structured effects model (model 4; see main text) to a model of no association. Chromosomes are coloured alternatively light and dark blue. The horizontal line indicates regions with strong evidence of association (BF>1e4).