| Literature DB >> 34917125 |
Bart Ferwerda1, Abdel Abdellaoui2, Max Nieuwdorp3,4,5, Koos Zwinderman1.
Abstract
Genetic differences between individuals underlie susceptibility to many diseases. Genome-wide association studies (GWAS) have discovered many susceptibility genes but were often limited to cohorts of predominantly European ancestry. Genetic diversity between individuals due to different ancestries and evolutionary histories shows that this approach has limitations. In order to gain a better understanding of the associated genetic variation, we need a more global genomics approach including a greater diversity. Here, we introduce the Healthy Life in an Urban Setting (HELIUS) cohort. The HELIUS cohort consists of participants living in Amsterdam, with a level of diversity that reflects the Dutch colonial and recent migration past. The current study includes 10,283 participants with genetic data available from seven groups of inhabitants, namely, Dutch, African Surinamese, South-Asian Surinamese, Turkish, Moroccan, Ghanaian, and Javanese Surinamese. First, we describe the genetic variation and admixture within the HELIUS cohort. Second, we show the challenges during imputation when having a genetically diverse cohort. Third, we conduct a body mass index (BMI) and height GWAS where we investigate the effects of a joint analysis of the entire cohort and a meta-analysis approach for the different subgroups. Finally, we construct polygenic scores for BMI and height and compare their predictive power across the different ethnic groups. Overall, we give a comprehensive overview of a genetically diverse cohort from Amsterdam. Our study emphasizes the importance of a less biased and more realistic representation of urban populations for mapping genetic associations with complex traits and disease risk for all.Entities:
Keywords: GWAS; HELIUS; admixture; genetics; multiethnic cohort
Year: 2021 PMID: 34917125 PMCID: PMC8670378 DOI: 10.3389/fgene.2021.727269
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Genetic diversity between the HELIUS Cohort populations. Ancestry was explored by merging the HELIUS cohort together with the 1000 Genome project populations. Differences were first inferred by using principal components analysis (PCA). (A) First two PCAs of the non-admixed populations grouped at their continental origin. Population ancestry was estimated using ADMIXTURE. (B) Results of a genome-wide ancestry with K = 3 or 8 ancestral populations. All HELIUS populations are displayed, and 1000G populations are marked by their continental origin (AFR = African, EUR = European, SAS = South Asian, EAS = East Asian, and AMR = Admixed America). Numbers indicate the exact 1000G populations which can be found in Supplementary Figure S3. PCA of the Surinamese HELIUS participants in (C) together with the African American (ASW) and African Caribbean (ACB) admixed 1000G populations highlighted.
FIGURE 2Comparison of imputation reference panels on HELIUS. Imputation performance was determined by filtering out 25% of the measured markers per ethnicity. After filtering out, the markers imputation was performed with the Sanger server using the 1000G and HRC imputation reference panels, Michigan imputation server with the 1000G, and TopMed imputation reference panel. The Michigan imputation cohort CAAPA was also used for imputation on the African Surinamese. Plots show the density of the percentage mismatches between imputed and measured genotypes per individual for each imputation server and reference panel per ethnicity.
HELIUS study cohort overview of genotyped participants with number of markers genotyped and imputed.
| Population/ethnicity | N | Age (years) | Gender (% female) | Array markers | TopMed | Sanger 1000G |
|---|---|---|---|---|---|---|
| Ghanaians | 480 | 48.0 ± 9.2 | 57.3 | 352,956 | 14,170,590 | 11,684,750 |
| Moroccans | 3048 | 40.8 ± 12.8 | 60.6 | 436,920 | 10,314,717 | 9,216,391 |
| Turks | 2649 | 40.6 ± 12.1 | 53,6 | 458,667 | 8,226,331 | 6,674,703 |
| Dutch | 1287 | 51.8 ± 12.6 | 50.3 | 487,008 | 8,257,774 | 8,436,030 |
| African Surinamese | 1156 | 51.7 ± 10.5 | 60.6 | 381,499 | 14,011,528 | 12,324,000 |
| South-Asian Surinamese | 1502 | 46.9 ± 13.2 | 53.9 | 406,867 | 8,136,254 | 7,973,372 |
| Javanese Surinamese | 57 | 51.1 ± 10.9 | 52.6 | 373,195 | 7,575,865 | 7,213,944 |
| HELIUS (joint) | 10283 | 44.7 ± 13.1 | 56.2 | 327,690 | 10,427,937 | 7,001,772 |
A total of 104 samples come from other than these 7 backgrounds.
Illumina GSA array resulted in 700078 genotyped markers before QC. For each population, QC included the markers present in at least 95% of the individuals, MAF below 1%, and deviation of Hardy–Weinberg equilibrium (p ≥ 0.00001).
TopMed imputed markers were filtered on SNPs only, MAF ≥1%, and R 2 ≥ 0.3.
Sanger 1000G imputed markers were filtered on SNPs only, MAF ≥0.8, and INFO ≥0.8.
FIGURE 3Height and BMI association study on the HELIUS cohort. Miami plot of the height and BMI using the TopMed (A, C) or Sanger 1000G imputation reference cohort (B, D) GWAS. Results of the entire cohort are plotted at the top panels, and the MR-MEGA meta-analysis combines the ethnic-specific GWAS results at the bottom panel. Y-axis displays the -log10 p values and the x-axis the associated marker location on each chromosome. Gray dashed lines indicate the 1.0e-5 and genome-wide significant 5.0e-8 p-value thresholds. (E) Locus zoom of the height association peak on chromosome 20 showing the associated loci UQCC1/GDF5 within the area. (F) Heritability, and standard error, bar plot for the entire or ethnic-specific cohort, and each ethnic-specific GWAS for the BMI and height. The PRS for BMI and height is displayed in (G). The y-axis represents the “best fitted” R2 for the entire cohort and each ethnicity with vertical error bars representing the bootstrapping confidence interval. The average proportion of European ancestry, based on the ADMIXTURE K = 3 calculations, for the entire cohort and per ethnicity is displayed at the x-axis. Horizontal lines were drawn between the smallest and largest measured European ancestry values for that group.