Literature DB >> 32226016

Minority-centric meta-analyses of blood lipid levels identify novel loci in the Population Architecture using Genomics and Epidemiology (PAGE) study.

Yao Hu1, Mariaelisa Graff2, Jeffrey Haessler1, Steven Buyske3, Stephanie A Bien1, Ran Tao4,5, Heather M Highland2, Katherine K Nishimura1, Niha Zubair1, Yingchang Lu6, Marie Verbanck6, Austin T Hilliard7, Derek Klarin8,9,10, Scott M Damrauer11,12,13, Yuk-Lam Ho14, Peter W F Wilson11,15, Kyong-Mi Chang12,16, Philip S Tsao17,18, Kelly Cho14, Christopher J O'Donnell14,19, Themistocles L Assimes17,18, Lauren E Petty5,20, Jennifer E Below5,20, Ozan Dikilitas21, Daniel J Schaid22, Matthew L Kosel22, Iftikhar J Kullo21, Laura J Rasmussen-Torvik23, Gail P Jarvik24, Qiping Feng25, Wei-Qi Wei25, Eric B Larson26, Frank D Mentch27, Berta Almoguera27, Patrick M Sleiman27, Laura M Raffield28, Adolfo Correa29, Lisa W Martin30, Martha Daviglus31,32, Tara C Matise3, Jose Luis Ambite33, Christopher S Carlson1, Ron Do6, Ruth J F Loos6, Lynne R Wilkens34, Loic Le Marchand34, Chris Haiman35, Daniel O Stram35, Lucia A Hindorff36, Kari E North2, Charles Kooperberg1, Iona Cheng37, Ulrike Peters1.   

Abstract

Lipid levels are important markers for the development of cardio-metabolic diseases. Although hundreds of associated loci have been identified through genetic association studies, the contribution of genetic factors to variation in lipids is not fully understood, particularly in U.S. minority groups. We performed genome-wide association analyses for four lipid traits in over 45,000 ancestrally diverse participants from the Population Architecture using Genomics and Epidemiology (PAGE) Study, followed by a meta-analysis with several European ancestry studies. We identified nine novel lipid loci, five of which showed evidence of replication in independent studies. Furthermore, we discovered one novel gene in a PrediXcan analysis, minority-specific independent signals at eight previously reported loci, and potential functional variants at two known loci through fine-mapping. Systematic examination of known lipid loci revealed smaller effect estimates in African American and Hispanic ancestry populations than those in Europeans, and better performance of polygenic risk scores based on minority-specific effect estimates. Our findings provide new insight into the genetic architecture of lipid traits and highlight the importance of conducting genetic studies in diverse populations in the era of precision medicine.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32226016      PMCID: PMC7145272          DOI: 10.1371/journal.pgen.1008684

Source DB:  PubMed          Journal:  PLoS Genet        ISSN: 1553-7390            Impact factor:   5.917


Introduction

Circulating levels of lipids such as high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), total cholesterol (TC), and triglycerides (TG), are associated with atherosclerotic cardiovascular disease, type 2 diabetes, and fatty liver disease [1-3]. Plasma lipid levels are heritable polygenic traits, with twin studies estimating narrow-sense heritability from 0.48 to 0.76 [4]. Genetic association studies have identified over 400 loci associated with lipid traits [5–10, 13]. However, the majority of these findings were based on European ancestry populations, and African American (AA), Hispanic, and other minority populations are underrepresented in these studies. Previous studies have demonstrated distinct lipid profiles in minority populations compared to Europeans, with higher HDL-C and lower TG levels in African ancestry populations and lower levels of HDL-C and TC in Hispanics [11, 12]. In addition, ancestry-specific variants at established lipid loci have been identified in AA and Hispanic ancestry populations, which were monomorphic or with extremely low minor allele frequencies (MAFs) in populations of European descent [13]. The phenotypic variance explained by the established loci is considerably lower in American minority (8.8–12.3%) than in European ancestry populations (12.9–27.8%) [7, 13], demonstrating that discovery and fine-mapping in non-European populations has fallen behind and that focused efforts in these populations are needed. The Population Architecture using Genomics and Epidemiology (PAGE) Study funded by the National Human Genome Research Institute and the National Institute on Minority Health and Health Disparities was designed to characterize the genetic architecture of complex traits among underrepresented minority populations through large-scale genetic epidemiological research [14]. As part of this initiative, we developed the Multiethnic Genotyping Array (MEGA) to improve fine-mapping and discovery by increasing variant coverage across multiple ethnicities [15]. Using this array in PAGE enabled us to perform genomic analyses to identify lipid loci that may have been missed by previous Euro-centric GWAS, and to explore the generalizability and heterogeneity of the previous findings across major U.S. ethnic groups. We also performed a meta-analysis combining PAGE results with those from the European Network for Genetic and Genomic Epidemiology (ENGAGE) Consortium [16] and other available European GWAS to search for additional novel lipid loci, and sought replication of our new findings in the Million Veteran Program (MVP) [13], the Global Hispanic Lipids Consortium, and the Electronic Medical Records and Genomics (eMERGE) Network [17], the Kaiser Permanente Research Bank [18], the Jackson Heart Study (JHS) [19], and the UK BioBank (UKBB, https://www.ukbiobank.ac.uk/).

Results

Study overview

Data on 45,698 participants were included in the minority meta-analysis with 17,641 AAs, 22,830 Hispanics, 2,387 East Asians, 1,912 Native Hawaiians, 604 Native Americans and 333 others (primarily South Asian, mixed heritage, and other racial/ethnic groups) [20]. These participants were drawn from six large well-characterized epidemiological studies: the Atherosclerosis Risk in Communities Study (ARIC), the BioME Biobank (BioMe), the Coronary Artery Risk Development in Young Adults Study (CARDIA), the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), the Multiethnic Cohort Study (MEC), and the Women’s Health Initiative (WHI). An additional 22,887 European ancestry participants from ARIC, BioMe, CARDIA, and WHI with individual level data in PAGE were included in the minority plus European meta-analyses along with publicly available summary statistics from ENGAGE in over 62,000 participants (http://diagram-consortium.org/2015_ENGAGE_1KG/) [16]. Across the five major ancestral groups in PAGE, the lowest HDL-C level was observed in Native Hawaiians, the highest LDL-C level was observed in Europeans, and the highest TC and TG levels were observed in Native Americans (Table 1 and S1 Table).
Table 1

Characteristics of the ancestrally diverse populations in PAGE .

African AmericanHispanicEast AsianNative HawaiianNative AmericanOther 3European
N17,64122,8302,3781,91260433322,887
Age (years)57.9±12.653.0±14.965.6±11.464.3±8.261.0±7.947.0±14.759.0±12.6
Female (%)80.864.858.350.298.047.176.1
HDL (mg/dL)54.9±16.248.6±14.450.1±17.541.1±15.453.0±13.250.4±18.151.9±15.0
LDL (mg/dL)142.3±43.7132.5±39.6140.9±38.8144.3±36.6139.9±38.7123.3±40.9144.9±39.9
TC (mg/dL)218.5±47.5209.9±46.4218.0±42.4210.3±39.3225.0±43.6199.0±45.5224.3±44.8
ln(TG (mg/dL)) 24.60±0.504.86±0.554.79±0.534.72±0.534.95±0.534.86±0.544.80±0.53

1 Values are shown as mean±SD.

2 Triglyceride levels were natural-log transformed.

3 Primarily South Asian, mixed heritage, and other racial/ethnic groups [20].

1 Values are shown as mean±SD. 2 Triglyceride levels were natural-log transformed. 3 Primarily South Asian, mixed heritage, and other racial/ethnic groups [20].

Identification of novel loci

In the first discovery stage, we performed a minority-centric analysis that included 45,698 non-European ancestry participants in PAGE and conducted a fixed-effect inverse-variance-weighted meta-analysis (S1 Fig). We identified four novel loci for HDL-C (5q31-rs17102282, DLC1-rs11782435, ZCCHC6-rs145312881 and DDHD1-rs75405126) and one novel locus for TG (MTHFD2-rs182013227) (Table 2 and S2 Fig). These five novel loci remained genome-wide significant after adjustment for previously established variants on the same chromosome (P<5.0E-8), and none of them exhibited significant evidence of heterogeneity across studies (S2 Table). Ancestry-stratified analysis revealed which ancestral population contributed most to these novel loci (S3 Table). MTHFD2-rs182013227 is only polymorphic in AAs (MAF = 0.004). DDHD1-rs75405126 is monomorphic in AA and Hispanic populations and was mainly driven by the signal in Native Hawaiians (MAF = 0.008, P = 2.7E-7, S3 Table). In addition to the five novel loci from the minority-combined meta-analysis, we discovered one novel TC locus, PCSK1-rs903381, in the Hispanic-specific meta-analysis (Table 2 and S2 Fig).
Table 2

Novel loci identified in the discovery stage.

SNPchr:posGeneCA/NCACAF (%)(AA/HA/EA)Minoritymeta-analysisAA-specific meta-analysisHA-specific meta-analysisMinority + EuropeanCombinedreplication 1Discovery + Replication
BETAPBETAPBETAPBETAPZPZP
HDL
rs171022825:1441034085q31A/G47/25/120.0403.3E-80.0444.9E-50.0453.2E-50.0231.1E-52.30.0223.81.7E-4
rs117824358:13536115DLC1T/C8.9/13/220.0636.8E-100.0140.470.0858.8E-100.0238.6E-62.69.6E-34.14.6E-5
rs1453128819:89053469ZCCHC6A/G0.37/0/00.501.6E-80.461.1E-6----0.930.353.71.8E-4
rs75405126 214:53797383DDHD1T/C0/0/0.100.762.1E-8----0.498.8E-5-0.160.883.21.2E-3
rs1294063617:53400110HLFC/T20/24/340.0251.3E-30.0359.2E-30.0250.0250.0259.1E-94.56.6E-66.32.3E-10
LDL
rs3588235012:623129B4GALNT3G/A19/16/250.0331.2E-30.0409.9E-30.0260.0540.0331.2E-106.34.1E-108.23.0E-16
rs374791020:5528518GPCPD1G/A25/17/21-0.0282.4E-3-0.0339.4E-3-0.0260.038-0.0282.1E-8-5.62.5E-8-7.25.3E-13
TC
rs9033815:95399878PCSK1A/G3.8/0.85/00.102.6E-40.0390.170.342.8E-8---0.780.430.440.66
rs3588235012:623129B4GALNT3G/A19/16/250.0283.8E-30.0270.0780.0250.0560.0282.0E-85.88.0E-97.49.8E-14
rs19998601820:5544985GPCPD1C/A24/17/21-0.0382.0E-4-0.0364.6E-3-0.0372.8E-3-0.0381.4E-8-4.99.6E-7-6.41.5E-10
TG
rs1820132272:74612959MTHFD2C/T0.39/0/00.545.7E-100.573.6E-9----0.730.462.76.1E-3

SNP, single nucleotide polymorphism; chr, chromosome; pos, position; CA, coding allele; NCA, non-coding allele; CAF, coding allele frequency; AA, African American; HA, Hispanic-ancestry; EA, European-ancestry.

1 The combined replication results were generated through meta-analyzing all replication datasets. Ancestry-specific replication results are shown in S5 Table.

2 This variant was mainly driven by signal in Hawaiians (MAF = 0.79%, BETA = 0.95, P = 2.7E-7) and details are presented in S3 Table.

SNP, single nucleotide polymorphism; chr, chromosome; pos, position; CA, coding allele; NCA, non-coding allele; CAF, coding allele frequency; AA, African American; HA, Hispanic-ancestry; EA, European-ancestry. 1 The combined replication results were generated through meta-analyzing all replication datasets. Ancestry-specific replication results are shown in S5 Table. 2 This variant was mainly driven by signal in Hawaiians (MAF = 0.79%, BETA = 0.95, P = 2.7E-7) and details are presented in S3 Table. In the second discovery stage, we performed a minority plus European ancestry meta-analysis in over 131,000 participants (S1 Fig). We identified three additional novel loci (HLF-rs12940636 for HDL-C, B4GALNT3-rs35882350 for LDL-C and TC, and GPCPD1-rs3747910/rs199986018 for LDL-C/TC), which remained genome-wide significant after adjusting for all established variants (P<5.0E-8, Table 2 and S4 Table). The lead variants at GPCPD1 locus are highly correlated (r2 = 0.97). All these loci exhibited evidence of association in both minority and European ancestry populations, showing no significant evidence of heterogeneity across studies (S4 Table) and were common across ancestral groups (Table 2). Of the five novel loci that we identified in the PAGE minority meta-analysis, the association signals were either attenuated (5q31, DLC1 and DDHD1) or the variants were not available in European ancestry populations due to low MAF (i.e. ZCCHC6) or monomorphism (i.e. PCSK1 and MTHFD2). Next, we sought replication for our nine newly discovered loci listed in Table 2 in MVP, the Global Hispanic Lipids Consortium, eMERGE, Kaiser, JHS, and UKBB. Details about these replication studies are provided in the S1 Text. For each novel locus, study-specific results were combined through sample-size-weighted meta-analyses. Three novel loci were successfully replicated (HLF, B4GALNT3, and GPCPD1, P <5.0E-8), and two additional novel loci exhibited suggestive evidence of replication (5q31 and DLC1, P<0.05), with smaller effect estimates in the replication results (S5 Table). The failure to replicate the DDHD1 locus, which was driven by Native Hawaiian signals, may resulted from the absence of ancestry-matched replication studies.

PrediXcan

We performed a PrediXcan analysis to identify associations between lipid traits and the heritable component of gene expression (GREx) in liver, adipose tissue, and whole blood. GREx of SCN11A, which does not map to any known regions, showed significant association with TG in visceral adipose tissue (P = 1.4E-6, S6 Table). SCN11A gene encodes a member of the voltage-gated sodium channel alpha subunit and is responsible for the generation and propagation of action potentials in neurons and muscles [21]. Replication is needed to confirm this novel finding. In addition, we identified 37 genes mapped to 19 previously reported loci that exhibited significant associations with at least one of the four lipid traits (P<2.0E-6, S6 Table). Among the 19 previously reported loci, 16 of them corresponded to expected biological candidate gene(s) while long intergenic noncoding RNAs (AC067959.1 near APOB, and AP000770.1 and AP006216.11 near APOA5) and pseudogene (HNRNPA1P10 near DOCK6/ANGPTL8) reached significance at the other three loci (S6 Table). Failure to detect associations with well-established candidate genes for some known loci likely reflected, at least in part, attenuation in statistical power as a result of small reference sample sizes or from applying weights derived from primarily European reference transcriptomes to minority populations [22]. Identification of associations with genes other than the candidate genes in these regions likely reflected co-regulation of variant predictors and correlation of gene expressions.

Evaluation of previously established loci in PAGE minority ancestry populations

We focused the evaluation of 433 previously reported loci in 33,063 minority ancestry participants uniformly genotyped by the MEGA array because of its higher coverage of variants of diverse ancestral groups and better imputation quality [15]. Associations of 276 and 839 unique variants in 30 and 240 previously established loci with at least one lipid trait were identified at P≤5.0E-8 and P≤0.05 level, respectively (S7 Table). Estimation of allelic heterogeneity of the established loci through inclusion of the SNP by principal component (SNP×PC) interaction term in the main model in MEGA minority populations revealed significant heterogeneity for 31 variants at six replicated loci (CETP, TOMM40, TRIB1, BUD13, GCKR and GFOD2) after Bonferroni correction (P≤1.8E-4, 0.05/276 SNPs, S7 Table). We then evaluated overall trends of the strength of the effect estimates at all known GWAS loci reported by the Global Lipids Genetic Consortium (GLGC) [7] across AA, Hispanic, and European ancestry populations. For all four lipid traits, the effect estimates from Hispanic ancestry populations showed stronger correlation with those from European ancestry populations than the effect estimates from AA ancestry populations (Fig 1). The phenotypic variance explained by these reported variants ranged from 13.1%-23.9%, 12.5%-20.7%, 11.6%-17.0%, and 8.89%-19.6% for HDL-C, LDL-C, TC and TG, respectively, across AA, Hispanic, and European ancestry populations, with the lowest variance explained seen in AA populations (S8 Table).
Fig 1

Comparison of effect estimates across AA, Hispanic and European ancestry populations.

A total of 444 independent variants in 250 loci previously reported by GLGC were included in the comparison. (A) HDL-C; (B) LDL-C; (C) TC; (D) TG.

Comparison of effect estimates across AA, Hispanic and European ancestry populations.

A total of 444 independent variants in 250 loci previously reported by GLGC were included in the comparison. (A) HDL-C; (B) LDL-C; (C) TC; (D) TG. We explored independent signals in our minority ancestry populations at previously known loci through step-wise conditional analysis adjusting for the most significant hit at each round. We identified independent signals at 12 known loci, and eight of them harbored variants that were monomorphic in European ancestry populations (ABCA1, APOA5, CETP, LCAT, PCSK9, LDLR, APOE and TM6SF2, S9 Table). Among these eight loci, six of them harbored either missense (ABCA1-rs9282541, APOA5-rs142953140/rs147210663, LCAT-rs35673026, PCSK9-rs28362263, TM6SF2-rs142056540 and APOE-rs769455) or loss of function (LoF) variants (PCSK9-rs28362286/rs67608943, S9 Table). Next, we performed fine-mapping of previously reported loci leveraging the relatively shorter LD ranges in AA populations and using FINEMAP [23]. In each locus, the number of variants in strong LD (r2≥0.6) with the lead variant in AA participants genotyped on MEGA were calculated using European- and AA-specific LD matrices from the 1000 Genome Phase 3 data. Among the 11 reported loci that showed genome-wide significance with at least one lipid trait in MEGA AA populations, all loci showed a reduction in the number of SNPs except for APOE and LDLR associated with HDL-C (S10 Table). The most substantial refinement was observed at APOA5 locus associated with TG, where the number of variants that were highly correlated with the lead variant rs3135506 reduced from 49 (based on European-specific LD) to zero (based on AA-specific LD, S10 Table). This observation is consistent with the findings in our published paper using Metabochip data [24]. We then calculated the 99% credible sets for the loci reported by GLGC [6] using FINEMAP. We calculated the credible sets based on GLGC-only data and GLGC plus PAGE minority combined data. After adding PAGE minority participants, 67% of the known loci showed a reduced number of SNPs in the 99% credible sets or reduced length of the 99% credible sets, and 59% of the known loci exhibited over 10% reduction. The most substantially fine-mapped region was the VLDLR region, for which the number of SNPs included in the credible set was reduced from over 2000 to one (rs3780181); recently published evidence supports it as the best candidate functional variant at this locus [25]. In addition, the only SNP in the 99% credible sets at CETP and SORT1 loci after including the PAGE minority data was the top hit in the AA- and Hispanic-specific analyses (CETP-rs183130 and SORT1-rs12740374) rather than the top hit in GLGC (S11 Table). In fine-map analyses of the nine novel loci, 99% credible sets were estimated based on minority meta-analysis results using FINEMAP; the variants included in each credible set are shown in S12 Table. We further built multiple constructs of weighted polygenic risk scores (PRSs) based on previously reported loci in European ancestry populations for each lipid trait and implemented 10-fold cross validation to evaluate their performance in minority populations. Reported loci from GLGC [6] were used and three PRS constructs were evaluated: (1) PRS1 using reported variants, weighted by the reported effect estimates from GLGC; (2) PRS2 using reported variants from GLGC, weighted by effect estimates observed in PAGE; (3) PRS3 using the most significantly associated variant from PAGE at each reported locus (see Methods section), weighted by PAGE effect estimates. For all four lipid traits, PRS2 explained more variance compared to PRS1 in terms of lower residual values (S13 Table). PRS3 explained more variance of TG compared to PRS2 while no improvement was observed for HDL-C, LDL-C or TC (S13 Table).

Functional annotation of the novel loci

Bioinformatic follow-up of the novel loci listed in Table 2 was performed using a comprehensive annotation database constructed from whole genome sequence annotator (WGSA) [26] and a custom UCSC analysis hub visualizing enhancer and repressor activities, DNase I hypersensitive sites (DHS) and transcribed regions in adult liver and adipose tissue, which facilitated prioritization of putative functional genes and variants. At the HLF locus, the index SNP is located at the 3’UTR region of HLF (encodes a member of the proline and acidic-rich protein family, a subset of the bZIP transcription factors) and shows a DANN rank score of 0.90 (≥0.9, possibly deleterious) [27]. In vitro experiments demonstrated reduced cellular lipid content after knockdown of HLF [28]. Of note, the TOM1L1 gene is located ~400kb away from the index SNP, which belongs to the same family of the previously reported TOM1 gene (S3 Fig). At the B4GALNT3 locus, which was associated with both LDL-C and TC levels, an LD proxy (rs34019521, r2 = 0.85) of the lead variant overlapped with enhancer activity in liver and showed an Eigen PC phred score of 22.11 (≥17, functional) [29]. This variant was part of the 99% credible set of this locus in the fine-map analysis (S12 Table). At each of the nine novel locus, the index SNP and its proxies (r2≥0.4) with a DANN rank score ≥0.9 (deleterious), Eigen PC phred score ≥17 (functional) [29], eQTL in GTEx database [30], or overlapped with enhancer, repressor, DHS and transcribed regions are summarized in S14 Table. Visualization of novel loci using our custom UCSC analysis hub are presented in S3 Fig.

Discussion

Employing a two stage meta-analysis, first in our ancestrally diverse minority populations, then combining with populations of European descent, we identified nine novel loci, five of which showed evidence of replication in independent studies. We further identified a novel gene SCN11A, and strong candidate target genes in previously known loci using the PrediXcan approach. Independent minority-specific signals at eight previously established loci were identified in the conditional analysis using individual level data. The systematic evaluation of previously reported loci in PAGE revealed shared genetic background across ethnicities and heterogeneity of allelic effects between minority and European ancestry populations. Our findings demonstrated three benefits of performing GWAS for lipid traits in ancestrally diverse populations focusing on non-European participants. The first benefit lies in the refinement of previously established loci in terms of more accurate estimation of the effect sizes in non-Europeans. In the examination of reported loci using PRS, we demonstrated that PRSs which were weighted on minority-specific effect sizes explained more variance of all four lipid traits in our non-European populations than the ones weighted on effect sizes reported by GLGC, which focused on participants of European descent. PRSs which used the top hit at each locus from the minority-specific results and weighted on their effect sizes explained even more variance of TG level, although no improvement was observed for the other three lipid traits, which may be the result of insufficient samples sizes in our current analyses. In the comparison of effect estimates for each single variant across ancestral groups, we discovered an attenuation in the magnitude of effects, especially in AA participants, which reflected less European admixture in AA compared to Hispanic populations. The second benefit is related to the potential of pinpointing the functional variants. Our minority population enabled us to narrow in on a potential functional variant rs3135506 at the APOA5 locus associated with TG through LD-based fine-mapping. This index SNP is a missense variant (p.Ser19Trp) with a DANN rank score of 0.995 (deleterious), and exhibits suggestive evidence of association with coronary artery disease (P = 7.8E-4) [31]. In a previous exome chip analysis of Norwegian participants with a smaller sample size, this missense variant was no longer significant after accounting for the GWAS index SNP rs964184 [32]. However, we discovered that rs3135506 was much more significant than rs964184 in association with TG (P = 1.00E-23 and 1.5E-4, respectively) and the association result of rs3135506 remained almost unchanged after conditioning on rs964184 (P = 1.0E-23 and 2.6E-20 before and after conditional analysis, respectively) in our AA populations. These results reflected different LD patterns across ancestral groups (r2 = 0.02 and 0.37 in African and European ancestry population from 1000 Genome Phase 3 data, respectively) and emphasized that extra caution would be needed when generalizing results across ancestral populations. Another example is the successful refinement of the VLDLR locus, with only one variant rs3780181 in the combined 99% credible set. This variant was recently reported for regulating enhancer activity and VLDLR gene expression, supporting it as the best candidate functional variant for this signal [25]. The third benefit was the identification of ancestry-specific variants both at novel and previously reported loci. Among the five novel loci identified in the minority meta-analysis, four of them were either monomorphic (PCSK1 and MTHFD2) or had extremely low MAFs (DDHD1 and ZCCHC6) in European ancestry populations. At the DDHD1 locus, the novel association was mainly driven by signal from Native Hawaiian ancestry populations that have not been examined before. The minor allele T of rs75405126 is more common in Native Hawaiians than in Europeans (MAF = 0.008 and 0.001, respectively). It also shows fairly high MAF in Asians (0.007), and it is monomorphic in AA and Hispanic ancestry populations. A previous study has shown that expression of DDHD1 in leukocyte was correlated with plasma HDL-C level [33]. Nevertheless, further investigation and successful replication of these ancestry-specific novel loci are needed. In addition, the availability of individual level data for all minority participants in PAGE enabled us to perform more accurate conditional analyses, leading to identification of minority-specific independent signals at eight previously reported loci. These signals were monomorphic in European ancestry populations, and six of them harbored either missense or LoF variants, which could be potential targets in future pharmaceutical studies (S9 Table). These findings, that may have been missed by the European-focused GWAS, contribute to a more complete picture of the genetic architecture of lipid metabolism. Despite the heterogeneity and disparity we observed in our analyses, the evidence of the shared genetic architecture of the four lipid traits among different ancestral groups was overwhelming. Among the independent and significant variants reported by GLGC [7], 21%, 18%, 20% and 25% of them reached genome-wide significance in the minority meta-analysis for HDL-C, LDL-C, TC and TG, respectively, with consistent association directions between minority and European ancestry populations. The total sample size of our minority population was only 15% of that in the GLGC discovery stage, and undoubtedly many more loci will surpass the genome-wide significant threshold with an increased sample size. In addition, the two novel loci (B4GALNT3 and GPCPD1) we identified in the minority plus European ancestry meta-analysis, which showed successful replication, were jointly driven by signals in minority and European ancestry populations. B4GALNT3 (beta-1,4-N-acetyl-galactosaminyltransferase 3) and the previously reported GALNT2 gene both belong to the N-acetylgalactosaminyltransferases family [34]. GPCPD1 encodes glycerophosphocholine phosphodiesterase 1, and knockdown of this gene resulted in altered lipid metabolites in cells [35]. Although we represented the most ancestrally diverse minority dataset for lipid traits to date, several limitations need to be mentioned. First, the sample sizes for the Native Hawaiian and Native American ancestry populations were limited, which may have obstructed the discovery of novel loci in these two ancestral groups. Second, we were not well-powered to systematically examine the potentially different effects of lipid traits-associated loci between male and female participants in our minority ancestry populations, with previous studies in European ancestry populations reporting a global difference in SNP effects between sexes [9]. Third, we may not be able to capture potential heterogeneity within ancestral groups. For example, the Hispanic participants in the current analysis came from various regions including Central and South America, Puerto Rico, and Mexico, and were grouped together when performing the association testing. The potential heterogeneity may also contribute to the failure of replication of some novel loci even in ancestrally matched samples. Fourth, the associations of the novel loci and independent signals with lipids and related diseases need to be further explored using functional studies to gain a better understanding of the underlying mechanisms. Finally, using the European reference transcriptome may introduced bias in our PrediXcan analysis, emphasizing the need of data collection and better model construction in minority populations. In summary, we identified nine novel loci, minority-specific independent signals at eight previously established loci, and one novel gene from the PrediXcan analysis, and observed different effect estimates of the associated variants across ancestral groups, which reinforce the need to conduct genetic association studies in participants of diverse ancestral background. The findings in these currently underrepresented populations will provide new insights into the genetics of lipids and associated diseases, thus paving the road to precision medicine.

Materials and Methods

Ethics statement

All studies were approved by local Institutional Review Boards and written informed consent was obtained from each participant. The Fred Hutch Institutional Review Board approved the study with protocol number of 8071.

Study population

The PAGE study incorporated 45,698 minority participants from ARIC, BioMe, CARDIA, MEC, HCHS/SOL, and WHI with available lipid measurements, primarily AAs, Hispanics, Asians, Native Hawaiians, and Native Americans (Table 1). In addition, a total of 22,887 European ancestry participants with lipid measurements from ARIC, BioMe, CARDIA and WHI were available in PAGE. Detailed characteristics of each study are provided in S1 Table, and detailed information for each study is presented in S1 Text.

Measurement of lipid levels

HDL-C, TC, and TG levels (mg/dL) in fasting blood were measured while LDL-C levels were calculated using the Friedewald Equation. LDL-C levels were not calculated if the corresponding TG levels were greater than 400mg/dL. Lipid levels were further adjusted for medication by adding a constant based on previous publications [24] (S15 Table). If multiple medications were used, only the largest constant was applied. Participants who were pregnant at blood draw or who had fasted less than 8 hours prior to blood draw were excluded from the analysis. TG levels after adjustment for medication were natural-log transformed. Summary statistics of lipid levels in all PAGE participants and the ancestry/study-stratified results are provided in S1 Table.

Genotyping, imputation and quality control

The PAGE minority populations were genotyped using two different strategies. A total of 33,063 participants with lipids measurements (10,085 AAs, 17,751 Hispanics and 2,378 Asians, 1,912 Native Hawaiians, 604 Native Americans and 333 others) were genotyped using the MEGA array, which was specifically designed to substantially increase variant coverage across multiple ethnic groups [15]. On the MEGA array, 1,705,969 genetic variants were genotyped. Quality control (QC) filters were applied at both the individual sample and the SNP level. At the individual sample level, samples with evidence of sex discrepancy, Mendelian inconsistency, unexpected duplication/non-duplication, poor performance, DNA mixture, identity issue or restricted consent were excluded. At the SNP level, SNPs meeting the following criteria were excluded: (1) failed the Center for Inherited Disease Research (CIDR) technical filters at John Hopkins University; (2) call rate <98%; (3) discordant calls in study duplicates; (4) >1 Mendelian errors in trio and duos; (5) Hardy-Weinberg P<1E-4; (6) sex difference in allele frequency ≥0.2 for autosomes/XY; (7) sex difference in heterozygosity >0.3 for autosomes/XY; (8) positional duplicates. SNPs that passed QC were further imputed to 1000 Genomes Phase 3 data using SHAPEIT2 and IMPUTE (version 2.3.2), resulting in 39,723,562 imputed SNPs with IMPUTE info score no less than 0.4. A total of 7,556 AA, 5,079 Hispanic and 22,887 European ancestry participants with lipid measurements from ARIC, BioMe, CARDIA, MEC and WHI were previously genotyped using either Affymetrix or Illumina arrays within each individual study (S1 Text). While the QC filters vary slightly by study, similar criteria as those listed above were used including exclusion of: (1) low call rate <90%; (2) discordant calls in study duplicates; (3) >1 Mendelian errors in trio and duos; (4) Hardy-Weinberg P<1E-6; (5) sex difference in allele frequency; (7) positional duplicates; (8) ancestry outliers. The genotype data from these studies was imputed to the 1000 Genome Phase 3 panel using IMPUTE (version 2.3.2) in each study separately, and SNPs with info score less than 0.4 were excluded.

Statistical analyses

The association analysis in the discovery stage was divided into two stages, the minority meta-analysis and the minority plus European meta-analysis. In the minority meta-analysis, we combined the 33,063 participants genotyped on the MEGA array and the additional 7,556 AAs and 5,079 Hispanics genotyped on different arrays through fixed-effect inverse-variance-weighted meta-analysis in METAL [36]. Before the minority meta-analysis, all participants genotyped on the MEGA array were pooled together for association testing, with adjustment for age, sex, study, self-identified ethnicity as a proxy for cultural background, center, household membership, and the first 10 PCs. Lipid levels were inverse-normally transformed by sex. For the additional 7,556 AAs and 5,079 Hispanics, association analyses were performed in each study separately, with adjustment for age, sex and the first 10 PCs. Lipid levels were inverse-normally transformed in each study/genotyping array by sex. All association analyses were performed using SUGEN, which implements a generalized estimating equation (GEE) method and accounts for relatedness [37]. SNPs with effective sample size (effN, effN = 2×MAF×(1-MAF)×N×info, where MAF was the minor allele frequency, N was the sample size and info was the IMPUTE2 info score) less than 30 in MEGA, or less than 5 in the individual studies were excluded before meta-analysis. In the minority plus European ancestry meta-analysis, we combined 45,698 PAGE minority ancestry participants, 22,887 PAGE European ancestry participants, and publicly available association summary statistics from the ENGAGE Consortium through fixed-effect inverse-variance-weighted meta-analysis in METAL, reaching a total sample size of over 131,000. Sample-size-weighted meta-analyses were also performed; the results are shown in S16 Table. SNPs that were only available in one study were excluded after meta-analysis. Conditional analyses adjusting for previously established loci were performed to determine the independency of the novel loci and to explore residual signals as well. The previously established lipids-associated loci list was hand-curated integrating SNPs indexed in the GWAS Catalog (Access date: September 23, 2018) or identified through non-GWAS arrays with covered part of the genome (metabochip or exomechip) [5–7, 9]. Reported SNPs with P<0.05 in our meta-analysis on each chromosome were adjusted in the model to achieve an efficient conditional analysis. All conditional analyses in PAGE were performed using individual level data by SUGEN while conditional analysis in ENGAGE was performed using summary statistics by GCTA-COJO [38]. Since all participants in ENGAGE were of European descent, the LD matrix was estimated from 9,345 European ancestry participants from the ARIC study that were available in PAGE. Novel loci were defined as those that fulfilled all of the three criteria: (1) the lead SNP reached genome-wide significance (P<5.0E-8) in both the marginal and conditional analysis; (2) the lead SNP was located more than 500kb of any previously established loci; (3) the lead SNP had at least one neighboring SNP (within ±500kb) showing suggestive genome-wide significance (P<1.0E-5). In the replication stage, summary statistics for the nine novel loci were extracted from MVP, the Hispanic GWAS meta-analysis, eMERGE, Kaiser, JHS and UKBB, respectively, in ancestry-stratified and ancestry-combined (if applicable) manners. A proxy variant (rs537734545, r2 = 1) of the MTHFD2-rs186278890 was used in MVP. All replication studies used the similar trait transformation strategy as in the discovery studies except for the Kaiser study (HDL was square-root-transformed, TG was log-transformed, and LDL and TC were not transformed) [18], and sample-size-weighted meta-analyses were performed to combine the summary statistics from each study. Details of each replication study are presented in S1 Text. When examining the potential heterogeneity of previous known loci in PAGE minority ancestry participants, interaction analyses were performed by including SNP×PC terms for all the first 10 PCs in the models. Models with and without the interaction terms were compared using the F-statistic, and an overall SNP×PC interaction P value (P) for each known variant was estimated, which indicated whether the additional variance explained by the interaction terms was statistically significant and represented effect modification driven by genetic ancestry [20]. The P values for all previously reported variants are shown in S7 Table. The PRSs were constructed by combining the lipid trait-increasing allele counts of the associated variants (reported by GLGC [6]) weighted by the corresponding effect sizes of each allele. In PRS3, the most significant variants in PAGE at each GLGC reported locus (±500kb) were used. Full list of used variants in the PRSs are shown in S17 Table. Ten-fold cross validation was implemented to estimate the trait variance explained by PRS1, PRS2 and PRS3 using four linear models: (1) model 0 which included all covariates in the association analysis (age, sex, study, self-identified ethnicity, center, household membership and the first 10 PCs); (2) model 1 which included PRS1 in additional to all covariates in model 0; (3) model 2 which included PRS2 in addition to all covariates in model 0; (4) model 3 which included PRS3 in addition to all covariates in model 0. The residual values of the four models were used to determine whether there was improvement in estimating lipid levels using different scores. Explained phenotypic variance for each genetic variant was calculated using the equation below [39]. Explained phenotypic variance = In the fine-map analyses for previously GLGC reported loci [6] using FINEMAP [23], we first generated the posterior probability (PP) of each variant within ±1Mb regions of the top hits in GLGC, PAGE AA and PAGE Hispanic participants. Only variants that were available in both GLGC and PAGE minority data were included in the analysis. The number of causal variants in each region was set to one in FINEMAP. With a single number of casual variants in each region the results of FINEMAP do not depend on the reference LD. We then constructed the 99% credible set for each reported locus in GLGC, PAGE AA and PAGE Hispanic participants, respectively. The combined PP of each variant were calculated by multiplying PP from each ancestral group and rescaling it based on the sum of the PPs for all variants [(PP_i, AA × PP_i, HA × PP_i, GLGC) / sum_i(PP_i, AA × PP_i, HA × PP_i, GLGC), where i refers to the locus, and the AA and HA results are from PAGE]. We used this approach instead of meta-analyzing these three groups because the sample size of European ancestry population from GLGC (N = 188,577) is overwhelming compared to the ones of AA (N = 17,641) and Hispanic (N = 22,830) ancestry populations in PAGE. The meta-analysis results would be driven mainly by GLGC results, which in turn introduces bias in the fine-map analysis. The approach we used took into account both the association P value and the sample size of each group. In the fine-map analyses of the nine novel loci, the LD matrix was estimated using the ethnicity that drove the significance of the locus. In particular, LD estimates for 5q31 and DDHD1 were generated using all minority ancestry participants genotyped on the MEGA array (N = 51,520) and Native Hawaiian participants genotyped on the MEGA array (N = 3,944), respectively. LD estimations of HLF, B4GALNT3 and GPCPD1 were generated by combining LD information from all minority ancestry participants genotyped by MEGA array (N = 51,520) and all available European participants from ARIC in PAGE (N = 9,345) through a sample-size-weighted approach. For DLC1 and PCSK1 loci, which were driven by Hispanic signals, and ZCCHC6 and MTHFD2 loci, which were driven by AA signals, LD estimations were generated using all Hispanic (N = 22,244) and AA (N = 17,324) participants genotyped on the MEGA array, respectively.

Bioinformatic functional follow-up and PrediXcan analysis

Bioinformatic functional follow-up was performed for each novel locus using our comprehensive functional annotation database and a custom UCSC analysis data hub. In the PrediXcan analysis, we focused on adipose tissue, liver and whole blood, which are closely linked to lipid metabolism. There were 8,271, 6,594, 3,355 and 6,298 genes included in the models in subcutaneous adipose tissue, visceral adipose tissue, liver and whole blood, respectively, and genes with P<2.04E-6 [0.05/(8271+6594+3355+6298)] were considered as significant. Details are presented in S1 Text.

QQ plots of the meta-analyses.

(A) HDL for the minority meta-analysis; (B) LDL for the minority meta-analysis; (C) TC for the minority meta-analysis; (D) TG for the minority meta-analysis; (E) TC for the Hispanic-specific meta-analysis; (F) HDL for the minority plus European meta-analysis; (G) LDL for the minority plus European meta-analysis; (H) TC for the minority plus European meta-analysis; (I) TG for the minority plus European meta-analysis. (DOCX) Click here for additional data file.

Locuszoom plots for the nine novel loci.

Genetic coordinates are displayed along the x-axis (hg19) and genome-wide association significance level is plotted against the y-axis as -log10(P value). LD is indicated by color scale in relationship to the most significant SNP (colored as purple diamond) in each association (red: r2≥0.8, orange: 0.6≤r2<0.8, green: 0.4≤r2<0.6, blue: 0.2≤r2<0.4, navy: r2<0.2). (A) 5q31 for HDL; (B) DLC1 for HDL; (C) ZCCHC6 for HDL; (D) DDHD1 for HDL; (E) HLF for HDL; (F) B4GALNT3 for LDL; (G) GPCPD1 for LDL; (H) PCSK1 for TC; (I) B4GALNT3 for TC; (J) GPCPD1 for TC; (K) MTHFD2 for TG. (DOCX) Click here for additional data file.

Functional annotation of the nine novel loci.

The top hit at each locus was colored purple, and the SNPs showing r2≥0.8, 0.8>r2≥0.6, 0.6>r2≥0.4 and 0.4>r2≥0.2 were colored red, orange, green and blue, respectively. (A) 5q31; (B) DLC1; (C) ZCCHC6; (D) DDHD1; (E) HLF; (F) B4GALNT3; (G) GPCPD1; (H) PCSK1; (I) MTHFD2. (DOCX) Click here for additional data file.

Characteristics of study samples in PAGE MEGA and non-MEGA studies.

(XLSX) Click here for additional data file.

Study-specific and conditional results of the novel loci identified in the minority meta-analysis.

(XLSX) Click here for additional data file.

Ethnic-specific results of the novel loci identified in the minority meta-analysis.

(XLSX) Click here for additional data file.

Study-specific and conditional results of the novel loci identified in the minority plus European meta-analysis.

(XLSX) Click here for additional data file.

Replication of the nine novel loci.

(XLSX) Click here for additional data file.

Significant gene-lipid associations identified in the PrediXcan analysis.

(XLSX) Click here for additional data file.

Results of previously reported variants in PAGE minorities genotyped on MEGA.

(XLSX) Click here for additional data file.

Phenotypic variance explained by the GLGC reported variants.

(XLSX) Click here for additional data file.

Independent signals at previously established loci.

(XLSX) Click here for additional data file.

Fine-map of reported loci using AA-specific LD.

(XLSX) Click here for additional data file.

Fine-map of the GLGC reported loci using FINEMAP.

(XLSX) Click here for additional data file.

Fine-map of the nine novel loci using FINEMAP.

(XLSX) Click here for additional data file.

Performance of the PRSs for each lipid trait.

(XLSX) Click here for additional data file.

Novel loci annotation summary.

(XLSX) Click here for additional data file.

Medication adjustment of lipid levels.

(XLSX) Click here for additional data file.

Sample-size-weighted meta-analysis results for the nine novel loci.

(XLSX) Click here for additional data file.

SNPs used in building the 3 constructs of PRSs.

(XLSX) Click here for additional data file.

Supplementary methods.

(DOCX) Click here for additional data file. 8 Jul 2019 Dear Dr Hu, Thank you very much for submitting your Research Article entitled 'Minority-centric meta-analyses of blood lipid levels identify novel loci in the Population Architecture using Genomics and Epidemiology (PAGE) Study' to PLOS Genetics. The manuscript was fully evaluated at the editorial level and by an independent peer reviewer. In general, PLOS Genetics seeks input from 3 or more peer reviewers, but we have been unable to secure additional reviewer comments despite many attempts to do so. The reviewer's comments have been assessed and considered by members of the editorial board; we agree with and fully support those comments, and have therefore decided to proceed with the single review in hand. As you will see, the reviewer (and the editors) appreciate the attention to an important problem, but note that there are some substantial concerns about the current manuscript. We will not be able to accept this version of the manuscript, but we would be willing to review again a much-revised version. Many of the concerns will be straightforward to address, but please note that the concern regarding replication (point 2) is especially important and will need to be fully addressed for a revised manuscript to move forward. Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see our guidelines. Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. [LINK] We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions. Yours sincerely, Gregory Barsh Editor-in-Chief PLOS Genetics Gregory Copenhaver Editor-in-Chief PLOS Genetics Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: Hu et al. present results of GWAS analyses of lipid profiles in ethnic minorities in the PAGE Consortium. They identified novel loci, some of which are driven by one ancestry, and/or are rare or monomorphic in Europeans. Some of the novel loci were replicated in additional published ethnic-specific and trans-ethnic studies. Allelic effects on lipids were not heterogeneous across ethnicities. Loci were interrogated for potential causal genes and fine-mapped to localise potential causal variants. The study is very timely, and the focus on the genetics of lipids in minority groups is extremely important. The messages about fine-mapping and use of multi-ethnic risk scores are very important to the human genetics research community. However, I have some concerns over the analytical approaches used. 1. Trait transformations. The authors state that lipid levels were inverse-normally transformed by sex. It was not clear to me whether this meant inverse-rank normalisation, or a Z-score transformation. In either case, this means that the effect estimates obtained for each study are NOT on the same scale, so it was not clear to me that a fixed-effects meta-analysis with inverse-variance weighting of effect sizes was appropriate, and that a Stouffer fixed-effects meta-analysis would be more appropriate. There are also no details (that I could find) on trait transformations in the replication sets, but again the same concern remains about effect size differences between studies. 2. Lack of replication. For several of the novel loci, there is very weak (if any) evidence of replication, even in ethnically matched components. I couldn’t find the sample sizes of the replication sets anywhere in the manuscript, but I’m assuming that they are substantially larger than PAGE? What is the explanation for this lack of replication – without this, I find it difficult to accept these associations as real. 3. Conditional analyses. These were performed using individual level data in PAGE, and approximate conditioning with GCTA in ENGAGE, but what was the reference used for LD. 4. The authors mention testing for heterogeneity in effects due to ethnicity by testing for interaction with PCs in PAGE, but I couldn’t find any information in the methods about this, nor any plots of PCs to demonstrate that this was an appropriate analysis/interpretation. Overall, I found the methods very sparse, and much more detail is needed to allow others to repeat experiments. 5. FINEMAP analyses. My understanding from reading the methods is that FINEMAP with first applied within each ethnic group, and then the results combined across ethnic groups. However, there are no details provided for the LD reference used for these analyses, and the parameter settings used, such as number of causal variants in a region. It is also not clear to me that combining the posterior probabilities across ethnicities as described is appropriate, so some justification of this approach is needed. 6. MTHFD2 locus. The authors highlight a SNP in strong LD with the index SNP, but it si not clear in which population group this LD is calculated? It would be more sensible to quote the relative posterior probability from the fine-mapping analysis, compared to the lead SNP – is it even in the 99% credible set? Similarly, I found listing SNPs that happened to have some interesting annotation that have r2>0.2 with an index SNP unimportant – again, it is not clear in which population the LD has been measured, or the relative posterior probabilities. 7. Table 2. It would be useful to have: MAF for each ethnic-specific analysis; the final combined meta-analysis p-value across discovery and replication; p-values to two significant figures throughout. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No 5 Dec 2019 Submitted filename: PAGE_lipids_rebuttal.docx Click here for additional data file. 23 Jan 2020 * Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. * Dear Dr Hu, Thank you very much for submitting your Research Article entitled 'Minority-centric meta-analyses of blood lipid levels identify novel loci in the Population Architecture using Genomics and Epidemiology (PAGE) Study' to PLOS Genetics. Your manuscript was fully evaluated at the editorial level and by independent peer reviewers. The revised manuscript was seen by the original reviewer. As you will see, their comments are generally positive, but there are some remaining concerns that we ask you address in a hopefully final round of minor revision. We therefore ask you to modify the manuscript according to the review recommendations before we can consider your manuscript for acceptance. Your revisions should address the specific points made by each reviewer. In addition we ask that you: 1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. 2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images. We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. [LINK] Please let us know if you have any questions while making these revisions. Yours sincerely, Gregory Barsh Editor-in-Chief PLOS Genetics Gregory Copenhaver Editor-in-Chief PLOS Genetics Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The authors have addressed most of my comments, and I believe the manuscript has been improved. However, I have a couple of outstanding comments... 1. I had not realised that the FINEMAP analysis had been performed assuming a single causal variant at each locus. Given this assumption, it is not clear why the authors have run FINEMAP, since this has been designed to allow for multiple causal variants at the locus. With a single causal variants, I believe that the LD reference is irrelevant, since no joint models with multiple SNPs are fitted. In theory, the association summary statistics for each ethnicity could just be used to calculate a Bayes' factor and posterior probability for each SNP. I think the methods should also state that the posterior probability calculation across ethnicities assumes that there is a single causal variant at the locus, and that this variant is the same in each ethnic group. 2. Heterogeneity assessment using SNP x PC interaction effect. I like this way of assessing heterogeneity due to ancestry, but wasn't clear why 10 PCs were used. Please provide justification. Including 10 PCs (if they are not needed) will substantially reduce power to detect heterogeneity (since an extra degree of freedom is needed for each PC in the test). 3. Tables still do not have p-values presented to two significant figures - Table 2 has a mixture of two and three significant figures (e.g. 4.89E-5 is three significant figures, 0.47 is two significant figures). I think all tables (including supplementary could do with proof reading) - e.g. in ST16, the causal allele frequency of rs11782435 is 2022%? ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No 10 Feb 2020 Submitted filename: PAGE_lipids_rebuttal_R2.docx Click here for additional data file. 19 Feb 2020 Dear Dr Hu, We are pleased to inform you that your manuscript entitled "Minority-centric meta-analyses of blood lipid levels identify novel loci in the Population Architecture using Genomics and Epidemiology (PAGE) Study" has been editorially accepted for publication in PLOS Genetics. Congratulations! Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional accept, but your manuscript will not be scheduled for publication until the required changes have been made. Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org. In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. If you have a press-related query, or would like to know about one way to make your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date. Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics! Yours sincerely, Gregory S. Barsh Editor-in-Chief PLOS Genetics Gregory Copenhaver Editor-in-Chief PLOS Genetics www.plosgenetics.org Twitter: @PLOSGenetics ---------------------------------------------------- Comments from the reviewers (if applicable): ---------------------------------------------------- Data Deposition If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website. The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-19-00861R2 More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support. Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present. ---------------------------------------------------- Press Queries If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org. 11 Mar 2020 PGENETICS-D-19-00861R2 Minority-centric meta-analyses of blood lipid levels identify novel loci in the Population Architecture using Genomics and Epidemiology (PAGE) Study Dear Dr Hu, We are pleased to inform you that your manuscript entitled "Minority-centric meta-analyses of blood lipid levels identify novel loci in the Population Architecture using Genomics and Epidemiology (PAGE) Study" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work! With kind regards, Matt Lyles PLOS Genetics On behalf of: The PLOS Genetics Team Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom plosgenetics@plos.org | +44 (0) 1223-442823 plosgenetics.org | Twitter: @PLOSGenetics
  38 in total

1.  Identifying leukocyte gene expression patterns associated with plasma lipid levels in human subjects.

Authors:  Jun Ma; Adam A Dempsey; Dimitri Stamatiou; K Wayne Marshall; Choong-Chin Liew
Journal:  Atherosclerosis       Date:  2006-06-30       Impact factor: 5.162

2.  Dietary and plasma lipid, lipoprotein, and apolipoprotein profiles among elderly Hispanics and non-Hispanics and their association with diabetes.

Authors:  Odilia I Bermudez; Wanda Velez-Carrasco; Ernst J Schaefer; Katherine L Tucker
Journal:  Am J Clin Nutr       Date:  2002-12       Impact factor: 7.045

3.  Major lipids, apolipoproteins, and risk of vascular disease.

Authors:  Emanuele Di Angelantonio; Nadeem Sarwar; Philip Perry; Stephen Kaptoge; Kausik K Ray; Alexander Thompson; Angela M Wood; Sarah Lewington; Naveed Sattar; Chris J Packard; Rory Collins; Simon G Thompson; John Danesh
Journal:  JAMA       Date:  2009-11-11       Impact factor: 56.272

4.  A de novo gain-of-function mutation in SCN11A causes loss of pain perception.

Authors:  Enrico Leipold; Lutz Liebmann; G Christoph Korenke; Theresa Heinrich; Sebastian Giesselmann; Jonathan Baets; Matthias Ebbinghaus; R Oliver Goral; Tommy Stödberg; J Christopher Hennings; Markus Bergmann; Janine Altmüller; Holger Thiele; Andrea Wetzel; Peter Nürnberg; Vincent Timmerman; Peter De Jonghe; Robert Blum; Hans-Georg Schaible; Joachim Weis; Stefan H Heinemann; Christian A Hübner; Ingo Kurth
Journal:  Nat Genet       Date:  2013-09-15       Impact factor: 38.330

5.  Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits.

Authors:  Jian Yang; Teresa Ferreira; Andrew P Morris; Sarah E Medland; Pamela A F Madden; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael N Weedon; Ruth J Loos; Timothy M Frayling; Mark I McCarthy; Joel N Hirschhorn; Michael E Goddard; Peter M Visscher
Journal:  Nat Genet       Date:  2012-03-18       Impact factor: 38.330

6.  Genetic predisposition to dyslipidemia and type 2 diabetes risk in two prospective cohorts.

Authors:  Qibin Qi; Liming Liang; Alessandro Doria; Frank B Hu; Lu Qi
Journal:  Diabetes       Date:  2012-02-07       Impact factor: 9.461

7.  A spectral approach integrating functional genomic annotations for coding and noncoding variants.

Authors:  Iuliana Ionita-Laza; Kenneth McCallum; Bin Xu; Joseph D Buxbaum
Journal:  Nat Genet       Date:  2016-01-04       Impact factor: 38.330

8.  Strategies for Enriching Variant Coverage in Candidate Disease Loci on a Multiethnic Genotyping Array.

Authors:  Stephanie A Bien; Genevieve L Wojcik; Niha Zubair; Christopher R Gignoux; Alicia R Martin; Jonathan M Kocarnik; Lisa W Martin; Steven Buyske; Jeffrey Haessler; Ryan W Walker; Iona Cheng; Mariaelisa Graff; Lucy Xia; Nora Franceschini; Tara Matise; Regina James; Lucia Hindorff; Loic Le Marchand; Kari E North; Christopher A Haiman; Ulrike Peters; Ruth J F Loos; Charles L Kooperberg; Carlos D Bustamante; Eimear E Kenny; Christopher S Carlson
Journal:  PLoS One       Date:  2016-12-14       Impact factor: 3.240

9.  Genetic differences between the determinants of lipid profile phenotypes in African and European Americans: the Jackson Heart Study.

Authors:  Rahul C Deo; David Reich; Arti Tandon; Ermeg Akylbekova; Nick Patterson; Alicja Waliszewska; Sekar Kathiresan; Daniel Sarpong; Herman A Taylor; James G Wilson
Journal:  PLoS Genet       Date:  2009-01-16       Impact factor: 5.917

10.  FINEMAP: efficient variable selection using summary data from genome-wide association studies.

Authors:  Christian Benner; Chris C A Spencer; Aki S Havulinna; Veikko Salomaa; Samuli Ripatti; Matti Pirinen
Journal:  Bioinformatics       Date:  2016-01-14       Impact factor: 6.937

View more
  7 in total

1.  MendelVar: gene prioritisation at GWAS loci using phenotypic enrichment of mendelian disease genes.

Authors:  Sobczyk M K; Gaunt T R; Paternoster L
Journal:  Bioinformatics       Date:  2021-01-16       Impact factor: 6.937

2.  Multi-Omic Approaches to Identify Genetic Factors in Metabolic Syndrome.

Authors:  Karen C Clark; Anne E Kwitek
Journal:  Compr Physiol       Date:  2021-12-29       Impact factor: 8.915

3.  MendelVar: gene prioritization at GWAS loci using phenotypic enrichment of Mendelian disease genes.

Authors:  M K Sobczyk; T R Gaunt; L Paternoster
Journal:  Bioinformatics       Date:  2021-04-09       Impact factor: 6.937

4.  Whole genome sequence analysis of blood lipid levels in >66,000 individuals.

Authors:  Margaret Sunitha Selvaraj; Xihao Li; Zilin Li; Akhil Pampana; David Y Zhang; Joseph Park; Stella Aslibekyan; Joshua C Bis; Jennifer A Brody; Brian E Cade; Lee-Ming Chuang; Ren-Hua Chung; Joanne E Curran; Lisa de Las Fuentes; Paul S de Vries; Ravindranath Duggirala; Barry I Freedman; Mariaelisa Graff; Xiuqing Guo; Nancy Heard-Costa; Bertha Hidalgo; Chii-Min Hwu; Marguerite R Irvin; Tanika N Kelly; Brian G Kral; Leslie Lange; Xiaohui Li; Martin Lisa; Steven A Lubitz; Ani W Manichaikul; Preuss Michael; May E Montasser; Alanna C Morrison; Take Naseri; Jeffrey R O'Connell; Nicholette D Palmer; Patricia A Peyser; Muagututia S Reupena; Jennifer A Smith; Xiao Sun; Kent D Taylor; Russell P Tracy; Michael Y Tsai; Zhe Wang; Yuxuan Wang; Wei Bao; John T Wilkins; Lisa R Yanek; Wei Zhao; Donna K Arnett; John Blangero; Eric Boerwinkle; Donald W Bowden; Yii-Der Ida Chen; Adolfo Correa; L Adrienne Cupples; Susan K Dutcher; Patrick T Ellinor; Myriam Fornage; Stacey Gabriel; Soren Germer; Richard Gibbs; Jiang He; Robert C Kaplan; Sharon L R Kardia; Ryan Kim; Charles Kooperberg; Ruth J F Loos; Karine A Viaud-Martinez; Rasika A Mathias; Stephen T McGarvey; Braxton D Mitchell; Deborah Nickerson; Kari E North; Bruce M Psaty; Susan Redline; Alexander P Reiner; Ramachandran S Vasan; Stephen S Rich; Cristen Willer; Jerome I Rotter; Daniel J Rader; Xihong Lin; Gina M Peloso; Pradeep Natarajan
Journal:  Nat Commun       Date:  2022-10-11       Impact factor: 17.694

Review 5.  The Genetic Basis of Hypertriglyceridemia.

Authors:  Germán D Carrasquilla; Malene Revsbech Christiansen; Tuomas O Kilpeläinen
Journal:  Curr Atheroscler Rep       Date:  2021-06-19       Impact factor: 5.113

6.  Genotype-Based Gene Expression in Colon Tissue-Prediction Accuracy and Relationship with the Prognosis of Colorectal Cancer Patients.

Authors:  Heike Deutelmoser; Justo Lorenzo Bermejo; Axel Benner; Korbinian Weigl; Hanla A Park; Mariam Haffa; Esther Herpel; Martin Schneider; Cornelia M Ulrich; Michael Hoffmeister; Jenny Chang-Claude; Hermann Brenner; Dominique Scherer
Journal:  Int J Mol Sci       Date:  2020-10-31       Impact factor: 5.923

7.  Genetic underpinnings of regional adiposity distribution in African Americans: Assessments from the Jackson Heart Study.

Authors:  Mohammad Y Anwar; Laura M Raffield; Leslie A Lange; Adolfo Correa; Kira C Taylor
Journal:  PLoS One       Date:  2021-08-04       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.