Literature DB >> 36138484

Detecting associated genes for complex traits shared across East Asian and European populations under the framework of composite null hypothesis testing.

Jiahao Qiao1, Zhonghe Shao1, Yuxuan Wu1, Ping Zeng2,3,4,5,6, Ting Wang7.   

Abstract

BACKGROUND: Detecting trans-ethnic common associated genetic loci can offer important insights into shared genetic components underlying complex diseases/traits across diverse continental populations. However, effective statistical methods for such a goal are currently lacking.
METHODS: By leveraging summary statistics available from global-scale genome-wide association studies, we herein proposed a novel genetic overlap detection method called CONTO (COmposite Null hypothesis test for Trans-ethnic genetic Overlap) from the perspective of high-dimensional composite null hypothesis testing. Unlike previous studies which generally analyzed individual genetic variants, CONTO is a gene-centric method which focuses on a set of genetic variants located within a gene simultaneously and assesses their joint significance with the trait of interest. By borrowing the similar principle of joint significance test (JST), CONTO takes the maximum P value of multiple associations as the significance measurement.
RESULTS: Compared to JST which is often overly conservative, CONTO is improved in two aspects, including the construction of three-component mixture null distribution and the adjustment of trans-ethnic genetic correlation. Consequently, CONTO corrects the conservativeness of JST with well-calibrated P values and is much more powerful validated by extensive simulation studies. We applied CONTO to discover common associated genes for 31 complex diseases/traits between the East Asian and European populations, and identified many shared trait-associated genes that had otherwise been missed by JST. We further revealed that population-common genes were generally more evolutionarily conserved than population-specific or null ones.
CONCLUSION: Overall, CONTO represents a powerful method for detecting common associated genes across diverse ancestral groups; our results provide important implications on the transferability of GWAS discoveries in one population to others.
© 2022. The Author(s).

Entities:  

Keywords:  CONTO; Composite null hypothesis testing; Gene-centric association analysis; Genome-wide association study; Joint significance test; Summary statistics; Trans-ethnic genetic overlap

Mesh:

Year:  2022        PMID: 36138484      PMCID: PMC9503281          DOI: 10.1186/s12967-022-03637-8

Source DB:  PubMed          Journal:  J Transl Med        ISSN: 1479-5876            Impact factor:   8.440


Background

The past decade has witnessed great success of global-scale genome-wide association studies (GWASs) in discovering a large number of single nucleotide polymorphisms (SNPs) associated with many diseases and traits in diverse ancestries (e.g., blood lipids [1], prostate cancer [2], blood pressure [3, 4], type II diabetes [5, 6], glycemic traits [7], and schizophrenia [8]). These discoveries dramatically revolutionize our biological knowledge regarding genetic basis underlying complex phenotypes [9-11], and pave the way toward novel molecular therapeutic strategies for complex diseases and disorders [12-16]. One of the important findings of trans-ethnic GWASs is that a few of association signals identified in one population are found to be replicated in others in the sense that trait-associated genes exhibit consistently statistical association and effect direction in multiple populations [12, 14, 17–21], indicating that the same trait shares similar genetic components across diverse continental populations [22-28]. Indeed, current GWASs have sought to identify trait-associated genetic loci in the hope that discoveries in one population are likely transferred to others. However, given the population genetic differentiation among various ancestral groups worldwide [14, 29–31], the extent to which discovered associations can be generalized across populations is not completely known. We herein aim to formally investigate a central problem in population genetics using widespread summary statistics data publicly available from large-scale GWASs in different populations: are significant genes associated with a complex trait in one population also statistically related to the same trait in another population? Understanding of shared genetic foundation for traits across diverse populations has profound implications from both statistical and practical perspectives. First, it helps improve power for trans-ethnic analysis by integrating multiple studies from various ethnicities [11, 32–35], increase accuracy of trans-ethnic genetic risk prediction in racial and ethnic minorities [28, 36], and enhance resolution in fine-mapping causal genes in various populations [37, 38]. Second, it offers additional insight into biological mechanism underlying complex diseases and helps measure the extent of interaction between genetic and environmental influences on disease risk [14]. Third, it holds the key to examine the transferability of associations discovered from current medical genomics researches which focus primarily on European (EUR) descent to other populations; that is, it is greatly of interest to examine whether the identified associations also hold in other populations and even discover more trait-related loci with higher power by leveraging genetic similarity across populations [39-41]. Fourth, biologically, replicating trait-associated genes across various ancestral groups is crucial for identifying truly causal genes as genetic loci that are simultaneously related to the trait in various populations much more likely contain important causal variants. To assess the common genetic component underlying traits across distinct populations, novel trans-ethnic genetic correlation has been proposed using only summary statistics data [26, 28, 42]. Conceptually, trans-ethnic genetic correlation quantifies the extent to which a set of SNPs exert the same or similar effects on phenotypic variation in various ancestral groups. Although such correlation provides an overall insight into genetic foundation of the trait shared between populations, it cannot characterize detailed association pattern for individual genetic loci or genes. Moreover, a near-zero estimate of trans-ethnic genetic correlation does not necessarily indicate the absence of genetic overlap because mixed trans-ethnic genetic correlations in both positive and negative directions might dilute the overall estimate. The standard GWAS conventionally considers only significant genetic loci to examine whether they could be consistently discovered across ethnicities [19], which however ignores many significantly insignificant ones but with weak effects and thus might lead to biased conclusions. In addition, trans-ethnic meta-analysis is widely used [11, 32–35], whereas the determined associations might be present only in a single population. To our knowledge, only few statistical methods have been currently developed to identify trans-ethnic trait-associated genes shared across the entire genome. To fill this knowledge gap, in the present study we propose a novel gene-centric genetic overlap detection method called CONTO. Unlike previous studies which analyzed individual SNPs [19, 28, 43], we instead focus on a set of SNPs located within a gene simultaneously and assess their joint significance with the trait of interest. From a statistical perspective, we observe that the identification of population-common genes across the whole genome can be effectively handled under the high-dimensional framework of composite null hypothesis testing by borrowing the idea of joint significance test (JST). Methodologically, JST employs the maximum P value of multiple associations as the significance measurement [44] and can be equivalently expressed as a combination of three disjoint component null hypotheses [45]. However, JST is often overly conservative because it depends on the 0–1 uniform distribution as its null distribution, which fails to consider the nature of composite null hypothesis test [46]. We make two key improvements of CONTO relative to JST. First, it constructs three-component mixture null distribution by taking the nature of composite null hypothesis test into account [47]. Second, it generates decorrelated test statistics to explain the trans-ethnic genetic correlation, which ultimately leads to well-calibrated P value for significance evaluation. Consequently, CONTO corrects the conservativeness of JST and is expected to more powerful than JST, which is validated through a wide range of simulation scenarios. We finally applied CONTO to detect population-common genes for 31 complex traits between the East Asian (EAS) and EUR populations. We identified many shared trait-associated genes that had otherwise been missed by JST. We also revealed that population-common genes were generally more evolutionarily conserved than population-specific ones.

Methods

Gene-set association method

Let the marginal Z score and P value of a gene for the analyzed trait in the EAS and EUR populations to be Z1 and Z2, and P1 and P2, respectively. These gene-level summary statistics can be easily to obtain with SNP-level association results publicly released by GWASs [48-51]. Therefore, we first aggregate multiple association signals at the SNP level into a single association signal at the gene level [52]. To this aim, we employ a powerful gene-set based association method called MAGMA [53], which is efficiently conducted via user-friendly software. Afterwards, the P value for each gene is obtained in both populations, which is immediately converted into Z score. The direction of Z score is determined by the sign of the summation of the product of effect sizes and MAFs across all SNPs of that gene [54, 55]. These gene-level summary statistics would be taken as inputs to measure the evidence of gene association with the trait in the two populations.

Trans-ethnic genetic overlap test under the composite null hypothesis framework

Our primary objective is to examine whether a particular trait-associated gene identified in one population is also significant in another population throughout the entire genome. The trans-ethnic genetic overlap can be defined in terms of distinct types of summary statistics. For example, the alternative hypothesis implies that both |Z1| and |Z2| are larger than a pre-assigned threshold value or that both P1 and P2 are less than a given significance level [say α; in our analysis we sought to control false discovery rate (FDR)]. This alternative hypothesis corresponds to three null hypotheses: (i) H00: the gene is not associated with the trait in either population; (ii) H10: the gene is associated with the trait in the first population but not the second; (iii) H01: the gene is associated with the trait in the second population but not the first. Formally, if defining the hypothesis test according to P values, we have Under this framework we intend to identify shared trait-associated genes in both populations from the viewpoint of composite null hypothesis testing.

CONTO: composite null hypothesis test for trans-ethnic genetic overlap

Like JST, we take Pmax = max (P1, P2) as our test statistic for the detection of trans-ethnic genetic overlap. However, in contrast to JST which uses the 0–1 uniform distribution as its null distribution, we directly build the null distribution of Pmax to correct the conservativeness of JST by borrowing the idea given in [47], which was proposed under the context of high-dimensional epigenetic mediation analysis [45]. Specifically, we estimate the proportions of the three sub-null hypotheses and fit a mixture null distribution for Pmaxwhere p01 is the power of rejecting P2 ≤ α under H01 and p10 is the power of rejecting P1 ≤ α under H10, both of which are estimated via the Grenander method [56]; λ01, λ10, and λ00 are the proportions for the three sub-null null hypotheses, all of which are calculated with well-established methods for estimating FDR [57, 58]; see more details in Additional file 1. It needs to highlight that the proposed method above implicitly assumes that the two P values are uncorrelated with each other. Although this condition is guaranteed by the sequential negligibility assumption in the mediation analysis [45, 59, 60], such independence does not necessarily hold in trans-ethnic genetic overlap test because of pervasive cross-population genetic correlation [26, 28, 42], which could cause inflated false discoveries than expected if not properly handled. Therefore, when implementing our method, we first decorrelate test statistics for each gene across populations by multiplying Z scores by the inverse of a correlation matrix. The cross-population correlation coefficient is calculated with Z scores of null genes (e.g., those with P1 > 0.05 and P2 > 0.05) [61, 62]. The uncorrelated Z scores can be in turn transformed into two-sided P values based on the normal approximation. Theoretically, this decorrelation strategy maximizes the transformed test statistics and the original ones [63]; therefore, it has the minimal influence on identifying shared associations. We refer to the above method as CONTO. The code for implementing CONTO is freely available at https://github.com/biostatpzeng/CONTO.

Simulation studies and real data applications

Simulation settings

We here implemented simulation studies to evaluate the performance of CONTO. Because it is conducted with only summary-level data, we thus directly sampled two sets of Z scores from a given multivariate normal (MVN) distribution under various scenarios. Specifically, for a gene in the first population, we generated its Z scores randomly from MVN((0, 0), Λ) under H00 with a probability λ00, or from MVN((τ10, 0), Λ) under H10 with a probability λ10, or from MVN((τ01, 0), Λ) under H01 with a probability λ01, or from MVN((τ11, 0), Λ) under H11 with a probability λ11. For the same gene in the second population, we drew its Z scores at random from MVN((0, 0), Λ) under H00 with a probability π00, or from MVN((0, 0), Λ) under H10 with a probability π10, or from MVN((0, τ01), Λ) under H01 with a probability λ01, or from MVN((0, τ11), Λ) under H11 with a probability λ11. The magnitude of τ10 (or τ01 and τ11) measures the strength of association, with larger value indicating stronger association signal. For simplicity, we set τ10 = τ01 = τ11 = 2, 3 or 4, and the total number of genes to 10000, 15000, or 20000. After obtaining Z scores, we transformed them into P values based on the normal approximation. In our simulation, we set Λ to be a two-dimensional identify matrix. We considered three various probability settings with λ11 ≠ 0 to evaluate FDR control and power: (i) λ00 = 0.40, λ10 = 0.20, λ01 = 0.20, and λ11 = 0.20, constructing a highly polygenic but less overlapped genetic architecture, in which 40% genes were related to the trait in each population. and approximately 33.3% of associated genes were shared across populations; (ii) λ00 = 0.80, λ10 = 0.05, λ01 = 0.05, and λ11 = 0.10, building a less polygenic and moderately overlapped genetic architecture, in which 15% genes were related to the trait in each population and approximately 50% of associated genes were shared across populations; (iii) λ00 = 0.90, λ10 = 0.01, λ01 = 0.01, and λ11 = 0.08, generating a sparse but highly overlapped genetic architecture, in which 9% genes were related to the trait in each population, but approximately 80% of associated genes were shared by the trait across populations. Note that, to a great extent, these simulation parameters were selected based on our results of real data applications (see below). Besides CONTO, for comparison we also carried out three other composite null methods (Additional file 1), including JST [44], joint significance composite-null test (JT-comp) [64], and divide-aggregate composite-null test (DACT) [65]. We repeated 103 times for each simulation setting and displayed the average across these replicates for these methods.

Summary statistics of 31 complex diseases from the EAS and EUR populations

We applied these methods to 31 complex traits of EAS-only or EUR-only individuals available from distinct GWAS consortia (Table 1 and Additional file 1: Table S1). These traits were analyzed in our previous work and more detailed descriptions regarding them can be found therein and in respective original paper [28, 62]. We downloaded summary statistics of these traits and performed stringent quality control in both populations for each trait: (i) removed SNPs without rs label; (ii) filtered out non-biallelic SNPs and those with strand-ambiguous alleles; (iii) deleted SNPs whose alleles did not match with those in the 1000 Genomes Project [66]; (iv) excluded duplicated SNPs and those with inconsistent alleles between EAS and EUR populations; (v) kept only common SNPs (MAF  > 1%) which were shared in the two populations; (vi) removed SNPs located within the major histocompatibility complex region because of its complicated LD structure.
Table 1

Number of associated SNPs discovered by JST and CONTO for traits in the EAS and EUR populations

traitJST (f11)CONTOtraitJST (f11)CONTO
f10f01f11f10f01f11
SCZ21018657eGFR7122205312
RA2754887ANM287100127
T2D293115310824PLT26129429625
COA1914112111RBC1958568438
AOA22195753MCV35130388782
PCA2714485HCT403315281
BMI9511027291MCH25120316726
Height6982284551544MCHC13623125224
DBP330802130HGB347303182
SBP844643252MONO404250151
PP572315122NEUT440158135
HDL113520305EO402259173
LDL742218166BASO32529101
TC1016422204LYMPH29116378
TG424114109WBC6912149259
HbA1c56473496

f10 and f01 are the number of identified genes that were only associated with the trait in the EAS or EUR population, respectively, and f11 is the number of shared associated genes in both populations

SCZ schizophrenia, RA rheumatoid arthritis, T2D type 2 diabetes, COA childhood-onset asthma, AOA adult-onset asthma, PCA prostate cancer, BMI body mass index, DBP diastolic blood pressure, SBP systolic blood pressure, PP pulse pressure, HDL high density lipoprotein cholesterol, LDL low density lipoprotein cholesterol, TC total cholesterol, TG triglyceride, HbA1c hemoglobin A1c, eGFR estimated glomerular filtration rate, ANM age at natural (non-surgical) menopause, PLT platelet count, RBC red blood cell count, MVC mean corpuscular volume, HCT hematocrit, MCH mean corpuscular hemoglobin, MCHC mean corpuscular hemoglobin concentration, BASO basophil count, LYMPH lymphocyte count, WBC white blood cell count

Number of associated SNPs discovered by JST and CONTO for traits in the EAS and EUR populations f10 and f01 are the number of identified genes that were only associated with the trait in the EAS or EUR population, respectively, and f11 is the number of shared associated genes in both populations SCZ schizophrenia, RA rheumatoid arthritis, T2D type 2 diabetes, COA childhood-onset asthma, AOA adult-onset asthma, PCA prostate cancer, BMI body mass index, DBP diastolic blood pressure, SBP systolic blood pressure, PP pulse pressure, HDL high density lipoprotein cholesterol, LDL low density lipoprotein cholesterol, TC total cholesterol, TG triglyceride, HbA1c hemoglobin A1c, eGFR estimated glomerular filtration rate, ANM age at natural (non-surgical) menopause, PLT platelet count, RBC red blood cell count, MVC mean corpuscular volume, HCT hematocrit, MCH mean corpuscular hemoglobin, MCHC mean corpuscular hemoglobin concentration, BASO basophil count, LYMPH lymphocyte count, WBC white blood cell count After quality control, we implemented MAGMA with genotypes of 504 EAS or 503 EUR individuals from the 1000 Genomes Project as the reference panel. We defined the set of cis-SNPs for a specific gene in terms of the annotation file provided by VIGAS [67]. The P value and Z score for each gene of traits were thus available. To handle possible residual influence of population stratification, family structures and cryptic relatedness [68-71], we further conducted genomic control for the gene-based association results of MAGMA if an inflation in these gene-level test statistics was observed (indicated by the inflation factor  > 1.05). We took the resulting P values or Z scores as input to implement JST, JT-comp, DACT and CONTO for detecting trait-associated genes shared across the EAS and EUR populations. Afterwards, for each trait we could classify these genes into three groups: (i) null genes which were not associated with the trait in either population; (ii) population-specific genes that were related to the trait in the EAS or EUR population; (iii) population-common genes that were shared across the two populations. To understand the characteristics of these genes in distinct groups, we used several conservation scores to examine the extent to which a particular gene varied across populations, which included phyloP score [72], phastCons score [73], and dN/dS ratio [74]. Specifically, higher phyloP or phastCons score indicates more conservativeness, while smaller dN/dS ratio represents higher conservativeness. We obtained these scores from [75], and compared the average scores across all genes of these traits in the three groups described above using the Friedman F test method.

Results

Estimated false discovery rate and statistical power

We first assessed whether these methods could correctly control FDR at a given level. Here, we primarily focused on the results obtained under the setting that the number of genes was set to 15000 (Fig. 1). First, it is shown that JS-com could lead to inflated control of FDR under the polygenic and less overlapped case regardless of the magnitude of association signals (Fig. 1A). JS-com could maintain an efficient FDR control if the genetic architecture was less polygenic or sparse, but moderately or highly overlapped, especially when the association evidence was weak (e.g., τ10 = τ01 = τ11 = 2) (Fig. 1B–C). However, as the increase of association evidence the estimated FDR of JS-com became inflated, which was particularly evident when the trait had a less polygenic but highly overlapped genetic architecture (Fig. 1B–C). This finding was consistent with that observed in previous mediation analysis literature because the assumption of weak association signal was violated in JS-com as the increase in τ10, τ01 and τ11 [64, 65].
Fig. 1

Estimated false discovery rate under the simulation settings: A λ00 = 0.40, λ10 = 0.20, λ01 = 0.20, and λ11 = 0.2; B λ00 = 0.80, λ10 = 0.05, λ01 = 0.05, and λ11 = 0.10, and C λ00 = 0.90, λ10 = 0.01, λ01 = 0.01, and λ11 = 0.08. Here, the number of genes was set to 15000, and the false discovery rate was calculated as the proportion of non-overlapped associated genes among all identified ones

Estimated false discovery rate under the simulation settings: A λ00 = 0.40, λ10 = 0.20, λ01 = 0.20, and λ11 = 0.2; B λ00 = 0.80, λ10 = 0.05, λ01 = 0.05, and λ11 = 0.10, and C λ00 = 0.90, λ10 = 0.01, λ01 = 0.01, and λ11 = 0.08. Here, the number of genes was set to 15000, and the false discovery rate was calculated as the proportion of non-overlapped associated genes among all identified ones Second, DACT always resulted in overestimated FDR in our simulation scenarios; however, the inflation seemed to be less obvious as the genetic impact became strong if the genetic architecture of the trait was less polygenic or sparse but highly overlapped (Fig. 1B–C). Third, in contrast to DACT and JS-com, JST was overly conservative under all our simulation settings, which was in line with prior observations [44]. Fourth, CONTO effectively controlled FDR at the nominal level across our simulation cases; however, it was slightly conservative when the association evidence was relatively weak (e.g., τ10 = τ01 = τ11 = 2) under the genetic architecture which was highly polygenic but less overlapped (Fig. 1A), or less polygenic but moderately overlapped (Fig. 1B). As shown above, because only JST and CONTO could maintain FDR at or below the given nominal level, we thus mainly considered these two methods in our following analyses. When assessing power, it was obviously observed that CONTO was consistently more powerful compared to JST across our simulation scenarios (Fig. 2). For instance, under the polygenic but less overlapped case, CONTO had a 26.6% higher power compared to JST when the association evidence was strong (τ10 = τ01 = τ11 = 4) and the number of tested genes was 15000 (Fig. 2A). The advantage of CONTO over JST became more remarkable under other two simulation scenarios (Fig. 2B–C). Finally, it was found that the similar patterns of FDR control and power behaviors were consistently observed when the number of tested genes was 10000 or 20000 (Additional file 1: Figure S1–S4).
Fig. 2

Estimated statistical power under the simulation settings: A λ00 = 0.40, λ10 = 0.20, λ01 = 0.20, and λ11 = 0.2; B λ00 = 0.80, λ10 = 0.05, λ01 = 0.05, and λ11 = 0.10, and C λ00 = 0.90, λ10 = 0.01, λ01 = 0.01, and λ11 = 0.08. Here, the number of genes was set to 15000, and the power was calculated as the proportion of truly overlapped associated genes among all identified ones

Estimated statistical power under the simulation settings: A λ00 = 0.40, λ10 = 0.20, λ01 = 0.20, and λ11 = 0.2; B λ00 = 0.80, λ10 = 0.05, λ01 = 0.05, and λ11 = 0.10, and C λ00 = 0.90, λ10 = 0.01, λ01 = 0.01, and λ11 = 0.08. Here, the number of genes was set to 15000, and the power was calculated as the proportion of truly overlapped associated genes among all identified ones

Associated genes discovered by MAGMA and CONTO

Based on the association results of MAGMA, we identified a set of trait-associated genes (FDR  < 0.01) shared across the EAS and EUR populations using JST and CONTO (Table 1). Consistent with the results shown in the simulation studies above, we found that CONTO had higher power and thus discovered approximately two-fold more associated genes compared to JST across all the traits, with the average number of shared associated genes increased from 109 detected by JST to 291 discovered by CONTO. Moreover, for every trait all genes identified by JST were also simultaneously detected by CONTO. With regards to CONTO, the number of detected genes related to the trait in both populations ranged from 53 for AOA to 1,544 for Height. As expected, the number of common traits-associated genes was highly correlated with the trans-ethnic genetic overlap (Spearman correlation = 0.593, P = 8.75 × 10−4) (Fig. 3A). Conceptually, greater trans-ethnic genetic correlation implies higher degree of common genetic foundation underlying the trait between two diverse populations [26, 28, 42]. For example, the trans-ethnic genetic correlation was 0.93 (se = 0.04) (calculated with the popcorn method [26]) and the number of population-common associated genes was 824 for T2D; while the trans-ethnic genetic correlation was 0.53 (se = 0.11) and the number of population-common associated genes was 53 for adult-onset asthma.
Fig. 3

A Relationship between the number of population-common genes identified by CONTO and the trans-ethnic genetic correlation calculated with the popcorn method; B Relationship between the number of associated genes identified by MAGMA and the sample size of each trait in the EAS population; and C Relationship between the number of associated genes identified by MAGMA and the sample size of each trait in the EUR population

A Relationship between the number of population-common genes identified by CONTO and the trans-ethnic genetic correlation calculated with the popcorn method; B Relationship between the number of associated genes identified by MAGMA and the sample size of each trait in the EAS population; and C Relationship between the number of associated genes identified by MAGMA and the sample size of each trait in the EUR population Besides population-common genes, we could also identify many population-specific genes (FDR  < 0.01) (Table 1). It is evident that more population-specific genes were detected for traits in the EUR population because of higher power resulting from larger sample size (Fig. 3B–C). Consequently, the majority (an average of 92.4% across all traits) of trait-associated genes identified in the EAS population were also discovered in the EUR population, while only 56.1% of trait-associated genes identified in the EUR population were detected in the EAS population.

Conservation of associated genes

In terms of conservation score analysis, we observed that these population-common genes were often more evolutionarily conserved compared to unique associated genes identified in a single population, which were also more evolutionarily conserved than null genes. The increased conservation pattern in population-common genes was reflected by each of the three conservation scores including dN/dS ratio (Fig. 4A), phyloP score (Fig. 4B), and phastCons score (Fig. 4C). More specifically, the average phyloP scores were 0.209, 0.188, and 0.153 for common trait-associated genes, population-specific trait-associated genes, and null genes (P = 1.14 × 10−11), respectively; the corresponding average phastCons scores were 0.132, 0.127, and 0.116 (P = 9.36 × 10−14), and the corresponding average dN/dS ratio were 0.091, 0.096, and 0.097 (P = 6.86 × 10−10).
Fig. 4

Average dN/dS ratio score A, phyloP score B, and phastCons score C across genes for all traits in distinct gene groups

Average dN/dS ratio score A, phyloP score B, and phastCons score C across genes for all traits in distinct gene groups

Discussion

As large-scale medical genomics studies have become increasingly diverse, understanding similarity and diversity of associated genes for complex diseases/traits and consequently the transferability of genetic discoveries across various populations is essential [76]. In the present study, we proposed a novel gene-centric method called CONTO to identify common associated genes shared across distinct populations by leveraging GWAS summary statistics. As a result, we detected many population-common association signals for complex diseases and traits. From a biological perspective, the existence of population-common genes is to some extent widespread because shared genetic loci are often believed to be of ancient origin and thus shared among diverse ancestral groups [77]. We also discovered a great deal of population-specific associated genes, indicating that genetic association discoveries identified in one population were not always generalized to other ancestral groups although there indeed shows substantial trans-ethnic genetic overlap underlying diseases/traits between distinct populations. These population-specific discoveries also imply the urgent requirement of including sufficient individuals from underrepresented populations in trans-ethnic GWASs so that more ethnic groups can benefit from current medical genomics researches [18, 40, 78]. The genetic diversity can be explained by the difference in clinical definitions and phenotype measurements, gene-gene and gene-environment interaction [79, 80]. As another interpretation, the genetic inconsistency across various populations might result from various sample sizes and therefore different statistical power of studies. Furthermore, we revealed that population-common genes were in general more evolutionarily conserved compared to these population-specific or null ones. Our work distinguishes itself from previous studies in three aspects. First, unlike prior studies which often focused only on a single trait [22, 81, 82], we considered a lot of diseases/traits, which could offer more unbiased insights into the extent of trans-ethnic genetic foundation shared across different populations. Second, in contrast to prior studies which performed single-marker association analysis, we here implemented a gene-centric trans-ethnic genetic overlap identification which jointly analyzed the association of a set of SNPs with the trait. Gene-set analysis is a popular complementary strategy in association studies and often much powerful as it aggregates multiple weak, sparse association signals at the SNP level into a strong association signal at the gene level and effectively reduces the burden of multiple tests [53]. In addition, correlation among SNPs in a gene was also considered in our gene-set association analysis; consequently, the trans-ethnic difference of LD structures was naturally explained by CONTO. Third, methodologically, we considered the trans-ethnic genetic overlap identification from a perspective of composite null hypothesis testing, which effectively takes the nature of trans-ethnic genetic overlap test into account and is thus much powerful compared to competitive methods. There are two directions that need to further explore for CONTO in the future. First, when implementing CONTO, it requires us to first generate gene-level P values using gene-set association analysis methods. Certainly, the used gene-set based method would play a critical role in CONTO. Intuitively, the performance of a gene-set association analysis method is determined by how well its modeling assumption matches the true genetic architecture of a group of SNPs [83, 84], which however is generally unknown a priori and varies from one gene to another. Consequently, it is difficult to choose a consistently optimal gene-set based method for all genes across the whole genome [52, 85, 86]. In the present study, we applied MAGMA to simultaneously examine the association evidence of multiple SNPs, which was in nature a variance-component based score test for multilocus genetic association studies built based on random-effects models [53, 87]. Although MAGMA might be not the optimal method for every gene, it exhibited excellent performance in statistical power compared to many existing gene-set based methods and was widely used in gene-centric integrative analysis in post-GWAS era [52-55]. Nevertheless, the use of robust and powerful gene-set based methods in CONTO is of importance. Prior studies have demonstrated that aggregating association evidence available from diverse sources is an effective strategy for boosting power; such as integrating multiple gene expression prediction models in transcriptome-wide association studies using the harmonic mean P-value combining method [88], and combining a group of methods into a single powerful omnibus test using the minimum P-value method [86] or the aggregated Cauchy combination method [89, 90]. Therefore, leveraging these similar aggregation strategies to CONTO is an interesting direction that warrants further explorations. Second, to a great degree, CONTO should be viewed as a qualitative trans-ethnic genetic overlap identification method because it can only offer the determination whether a particular gene is associated with the trait in both populations, but is difficult to give accurate evidence regarding the consistence of trans-ethnic genetic effects across populations. Understanding the effect difference could provide more in-depth insight into the diversity and similarity of genetic architecture underlying the same trait across distinct ancestral groups. To examine the genetic influence of a gene on complex traits in GWAS, prior studies attempted to employ the polygenic risk score calculated based on SNP effect sizes and genotypes of individuals available from external reference panels [2, 30, 54, 55]. Thus, such score may be also useful for the evaluation of trans-ethnic genetic effect of a gene in CONTO.

Conclusion

In summary, CONTO stands for a powerful method for detecting trans-ethnic genetic overlap across diverse ancestral groups; our results provide important implications on the transferability of GWAS discoveries in one population to others. Additional file 1: Table S1. Complex traits available from the European and East Asian analyzed in the present study. Figure S1. Estimated false discovery rate under the simulation settings: (A) λ00=0.40, λ10=0.20, λ01=0.20, and λ11=0.2; (B) λ00=0.80, λ10=0.05, λ01=0.05, and λ11=0.10, and (C) λ00=0.90, λ10=0.01, λ01=0.01, and λ11=0.08. Here, the number of genes was set to 10000, and the false discovery rate was calculated as the proportion of non-overlapped associated genes among all identified ones. Figure S2. Estimated statistical power under the simulation settings: (A) λ00=0.40, λ10=0.20, λ01=0.20, and λ11=0.2; (B) λ00=0.80, λ10=0.05, λ01=0.05, and λ11=0.10, and (C) λ00=0.90, λ10=0.01, λ01=0.01, and λ11=0.08. Here, the number of genes was set to 10000, and the power was calculated as the proportion of truly overlapped associated genes among all identified ones. Figure S3. Estimated false discovery rate under the simulation settings: (A) λ00=0.40, λ10=0.20, λ01=0.20, and λ11=0.2; (B) λ00=0.80, λ10=0.05, λ01=0.05, and λ11=0.10, and (C) λ00=0.90, λ10=0.01, λ01=0.01, and λ11=0.08. Here, the number of genes was set to 20000, and the false discovery rate was calculated as the proportion of non-overlapped associated genes among all identified ones. Figure S4. Estimated statistical power under the simulation settings: (A) λ00=0.40, λ10=0.20, λ01=0.20, and λ11=0.2; (B) λ00=0.80, λ10=0.05, λ01=0.05, and λ11=0.10, and (C) λ00=0.90, λ10=0.01, λ01=0.01, and λ11=0.08. Here, the number of genes was set to 20000, and the power was calculated as the proportion of truly overlapped associated genes among all identified ones.
  84 in total

1.  Evaluating marginal genetic correlation of associated loci for complex diseases and traits between European and East Asian populations.

Authors:  Haojie Lu; Ting Wang; Jinhui Zhang; Shuo Zhang; Shuiping Huang; Ping Zeng
Journal:  Hum Genet       Date:  2021-06-06       Impact factor: 4.132

2.  Estimating the causal tissues for complex traits and diseases.

Authors:  Halit Ongen; Andrew A Brown; Olivier Delaneau; Nikolaos I Panousis; Alexandra C Nica; Emmanouil T Dermitzakis
Journal:  Nat Genet       Date:  2017-10-23       Impact factor: 38.330

3.  The power of genetic diversity in genome-wide association studies of lipids.

Authors:  Shoa L Clarke; Kuan-Han H Wu; Stavroula Kanoni; Greg J M Zajac; Shweta Ramdas; Sarah E Graham; Ida Surakka; Ioanna Ntalla; Sailaja Vedantam; Thomas W Winkler; Adam E Locke; Eirini Marouli; Mi Yeong Hwang; Sohee Han; Akira Narita; Ananyo Choudhury; Amy R Bentley; Kenneth Ekoru; Anurag Verma; Bhavi Trivedi; Hilary C Martin; Karen A Hunt; Qin Hui; Derek Klarin; Xiang Zhu; Gudmar Thorleifsson; Anna Helgadottir; Daniel F Gudbjartsson; Hilma Holm; Isleifur Olafsson; Masato Akiyama; Saori Sakaue; Chikashi Terao; Masahiro Kanai; Wei Zhou; Ben M Brumpton; Humaira Rasheed; Sanni E Ruotsalainen; Aki S Havulinna; Yogasudha Veturi; QiPing Feng; Elisabeth A Rosenthal; Todd Lingren; Jennifer Allen Pacheco; Sarah A Pendergrass; Jeffrey Haessler; Franco Giulianini; Yuki Bradford; Jason E Miller; Archie Campbell; Kuang Lin; Iona Y Millwood; George Hindy; Asif Rasheed; Jessica D Faul; Wei Zhao; David R Weir; Constance Turman; Hongyan Huang; Mariaelisa Graff; Anubha Mahajan; Michael R Brown; Weihua Zhang; Ketian Yu; Ellen M Schmidt; Anita Pandit; Stefan Gustafsson; Xianyong Yin; Jian'an Luan; Jing-Hua Zhao; Fumihiko Matsuda; Hye-Mi Jang; Kyungheon Yoon; Carolina Medina-Gomez; Achilleas Pitsillides; Jouke Jan Hottenga; Gonneke Willemsen; Andrew R Wood; Yingji Ji; Zishan Gao; Simon Haworth; Ruth E Mitchell; Jin Fang Chai; Mette Aadahl; Jie Yao; Ani Manichaikul; Helen R Warren; Julia Ramirez; Jette Bork-Jensen; Line L Kårhus; Anuj Goel; Maria Sabater-Lleal; Raymond Noordam; Carlo Sidore; Edoardo Fiorillo; Aaron F McDaid; Pedro Marques-Vidal; Matthias Wielscher; Stella Trompet; Naveed Sattar; Line T Møllehave; Betina H Thuesen; Matthias Munz; Lingyao Zeng; Jianfeng Huang; Bin Yang; Alaitz Poveda; Azra Kurbasic; Claudia Lamina; Lukas Forer; Markus Scholz; Tessel E Galesloot; Jonathan P Bradfield; E Warwick Daw; Joseph M Zmuda; Jonathan S Mitchell; Christian Fuchsberger; Henry Christensen; Jennifer A Brody; Mary F Feitosa; Mary K Wojczynski; Michael Preuss; Massimo Mangino; Paraskevi Christofidou; Niek Verweij; Jan W Benjamins; Jorgen Engmann; Rachel L Kember; Roderick C Slieker; Ken Sin Lo; Nuno R Zilhao; Phuong Le; Marcus E Kleber; Graciela E Delgado; Shaofeng Huo; Daisuke D Ikeda; Hiroyuki Iha; Jian Yang; Jun Liu; Hampton L Leonard; Jonathan Marten; Börge Schmidt; Marina Arendt; Laura J Smyth; Marisa Cañadas-Garre; Chaolong Wang; Masahiro Nakatochi; Andrew Wong; Nina Hutri-Kähönen; Xueling Sim; Rui Xia; Alicia Huerta-Chagoya; Juan Carlos Fernandez-Lopez; Valeriya Lyssenko; Meraj Ahmed; Anne U Jackson; Marguerite R Irvin; Christopher Oldmeadow; Han-Na Kim; Seungho Ryu; Paul R H J Timmers; Liubov Arbeeva; Rajkumar Dorajoo; Leslie A Lange; Xiaoran Chai; Gauri Prasad; Laura Lorés-Motta; Marc Pauper; Jirong Long; Xiaohui Li; Elizabeth Theusch; Fumihiko Takeuchi; Cassandra N Spracklen; Anu Loukola; Sailalitha Bollepalli; Sophie C Warner; Ya Xing Wang; Wen B Wei; Teresa Nutile; Daniela Ruggiero; Yun Ju Sung; Yi-Jen Hung; Shufeng Chen; Fangchao Liu; Jingyun Yang; Katherine A Kentistou; Mathias Gorski; Marco Brumat; Karina Meidtner; Lawrence F Bielak; Jennifer A Smith; Prashantha Hebbar; Aliki-Eleni Farmaki; Edith Hofer; Maoxuan Lin; Chao Xue; Jifeng Zhang; Maria Pina Concas; Simona Vaccargiu; Peter J van der Most; Niina Pitkänen; Brian E Cade; Jiwon Lee; Sander W van der Laan; Kumaraswamy Naidu Chitrala; Stefan Weiss; Martina E Zimmermann; Jong Young Lee; Hyeok Sun Choi; Maria Nethander; Sandra Freitag-Wolf; Lorraine Southam; Nigel W Rayner; Carol A Wang; Shih-Yi Lin; Jun-Sing Wang; Christian Couture; Leo-Pekka Lyytikäinen; Kjell Nikus; Gabriel Cuellar-Partida; Henrik Vestergaard; Bertha Hildalgo; Olga Giannakopoulou; Qiuyin Cai; Morgan O Obura; Jessica van Setten; Xiaoyin Li; Karen Schwander; Natalie Terzikhan; Jae Hun Shin; Rebecca D Jackson; Alexander P Reiner; Lisa Warsinger Martin; Zhengming Chen; Liming Li; Heather M Highland; Kristin L Young; Takahisa Kawaguchi; Joachim Thiery; Joshua C Bis; Girish N Nadkarni; Lenore J Launer; Huaixing Li; Mike A Nalls; Olli T Raitakari; Sahoko Ichihara; Sarah H Wild; Christopher P Nelson; Harry Campbell; Susanne Jäger; Toru Nabika; Fahd Al-Mulla; Harri Niinikoski; Peter S Braund; Ivana Kolcic; Peter Kovacs; Tota Giardoglou; Tomohiro Katsuya; Konain Fatima Bhatti; Dominique de Kleijn; Gert J de Borst; Eung Kweon Kim; Hieab H H Adams; M Arfan Ikram; Xiaofeng Zhu; Folkert W Asselbergs; Adriaan O Kraaijeveld; Joline W J Beulens; Xiao-Ou Shu; Loukianos S Rallidis; Oluf Pedersen; Torben Hansen; Paul Mitchell; Alex W Hewitt; Mika Kähönen; Louis Pérusse; Claude Bouchard; Anke Tönjes; Yii-Der Ida Chen; Craig E Pennell; Trevor A Mori; Wolfgang Lieb; Andre Franke; Claes Ohlsson; Dan Mellström; Yoon Shin Cho; Hyejin Lee; Jian-Min Yuan; Woon-Puay Koh; Sang Youl Rhee; Jeong-Taek Woo; Iris M Heid; Klaus J Stark; Henry Völzke; Georg Homuth; Michele K Evans; Alan B Zonderman; Ozren Polasek; Gerard Pasterkamp; Imo E Hoefer; Susan Redline; Katja Pahkala; Albertine J Oldehinkel; Harold Snieder; Ginevra Biino; Reinhold Schmidt; Helena Schmidt; Y Eugene Chen; Stefania Bandinelli; George Dedoussis; Thangavel Alphonse Thanaraj; Sharon L R Kardia; Norihiro Kato; Matthias B Schulze; Giorgia Girotto; Bettina Jung; Carsten A Böger; Peter K Joshi; David A Bennett; Philip L De Jager; Xiangfeng Lu; Vasiliki Mamakou; Morris Brown; Mark J Caulfield; Patricia B Munroe; Xiuqing Guo; Marina Ciullo; Jost B Jonas; Nilesh J Samani; Jaakko Kaprio; Päivi Pajukanta; Linda S Adair; Sonny Augustin Bechayda; H Janaka de Silva; Ananda R Wickremasinghe; Ronald M Krauss; Jer-Yuarn Wu; Wei Zheng; Anneke I den Hollander; Dwaipayan Bharadwaj; Adolfo Correa; James G Wilson; Lars Lind; Chew-Kiat Heng; Amanda E Nelson; Yvonne M Golightly; James F Wilson; Brenda Penninx; Hyung-Lae Kim; John Attia; Rodney J Scott; D C Rao; Donna K Arnett; Mark Walker; Heikki A Koistinen; Giriraj R Chandak; Chittaranjan S Yajnik; Josep M Mercader; Teresa Tusié-Luna; Carlos A Aguilar-Salinas; Clicerio Gonzalez Villalpando; Lorena Orozco; Myriam Fornage; E Shyong Tai; Rob M van Dam; Terho Lehtimäki; Nish Chaturvedi; Mitsuhiro Yokota; Jianjun Liu; Dermot F Reilly; Amy Jayne McKnight; Frank Kee; Karl-Heinz Jöckel; Mark I McCarthy; Colin N A Palmer; Veronique Vitart; Caroline Hayward; Eleanor Simonsick; Cornelia M van Duijn; Fan Lu; Jia Qu; Haretsugu Hishigaki; Xu Lin; Winfried März; Esteban J Parra; Miguel Cruz; Vilmundur Gudnason; Jean-Claude Tardif; Guillaume Lettre; Leen M 't Hart; Petra J M Elders; Scott M Damrauer; Meena Kumari; Mika Kivimaki; Pim van der Harst; Tim D Spector; Ruth J F Loos; Michael A Province; Bruce M Psaty; Ivan Brandslund; Peter P Pramstaller; Kaare Christensen; Samuli Ripatti; Elisabeth Widén; Hakon Hakonarson; Struan F A Grant; Lambertus A L M Kiemeney; Jacqueline de Graaf; Markus Loeffler; Florian Kronenberg; Dongfeng Gu; Jeanette Erdmann; Heribert Schunkert; Paul W Franks; Allan Linneberg; J Wouter Jukema; Amit V Khera; Minna Männikkö; Marjo-Riitta Jarvelin; Zoltan Kutalik; Francesco Cucca; Dennis O Mook-Kanamori; Ko Willems van Dijk; Hugh Watkins; David P Strachan; Niels Grarup; Peter Sever; Neil Poulter; Jerome I Rotter; Thomas M Dantoft; Fredrik Karpe; Matt J Neville; Nicholas J Timpson; Ching-Yu Cheng; Tien-Yin Wong; Chiea Chuen Khor; Charumathi Sabanayagam; Annette Peters; Christian Gieger; Andrew T Hattersley; Nancy L Pedersen; Patrik K E Magnusson; Dorret I Boomsma; Eco J C de Geus; L Adrienne Cupples; Joyce B J van Meurs; Mohsen Ghanbari; Penny Gordon-Larsen; Wei Huang; Young Jin Kim; Yasuharu Tabara; Nicholas J Wareham; Claudia Langenberg; Eleftheria Zeggini; Johanna Kuusisto; Markku Laakso; Erik Ingelsson; Goncalo Abecasis; John C Chambers; Jaspal S Kooner; Paul S de Vries; Alanna C Morrison; Kari E North; Martha Daviglus; Peter Kraft; Nicholas G Martin; John B Whitfield; Shahid Abbas; Danish Saleheen; Robin G Walters; Michael V Holmes; Corri Black; Blair H Smith; Anne E Justice; Aris Baras; Julie E Buring; Paul M Ridker; Daniel I Chasman; Charles Kooperberg; Wei-Qi Wei; Gail P Jarvik; Bahram Namjou; M Geoffrey Hayes; Marylyn D Ritchie; Pekka Jousilahti; Veikko Salomaa; Kristian Hveem; Bjørn Olav Åsvold; Michiaki Kubo; Yoichiro Kamatani; Yukinori Okada; Yoshinori Murakami; Unnur Thorsteinsdottir; Kari Stefansson; Yuk-Lam Ho; Julie A Lynch; Daniel J Rader; Philip S Tsao; Kyong-Mi Chang; Kelly Cho; Christopher J O'Donnell; John M Gaziano; Peter Wilson; Charles N Rotimi; Scott Hazelhurst; Michèle Ramsay; Richard C Trembath; David A van Heel; Gen Tamiya; Masayuki Yamamoto; Bong-Jo Kim; Karen L Mohlke; Timothy M Frayling; Joel N Hirschhorn; Sekar Kathiresan; Michael Boehnke; Pradeep Natarajan; Gina M Peloso; Christopher D Brown; Andrew P Morris; Themistocles L Assimes; Panos Deloukas; Yan V Sun; Cristen J Willer
Journal:  Nature       Date:  2021-12-09       Impact factor: 69.504

4.  MAGMA: generalized gene-set analysis of GWAS data.

Authors:  Christiaan A de Leeuw; Joris M Mooij; Tom Heskes; Danielle Posthuma
Journal:  PLoS Comput Biol       Date:  2015-04-17       Impact factor: 4.475

5.  Modeling Heterogeneity in the Genetic Architecture of Ethnically Diverse Groups Using Random Effect Interaction Models.

Authors:  Yogasudha Veturi; Gustavo de Los Campos; Nengjun Yi; Wen Huang; Ana I Vazquez; Brigitte Kühnel
Journal:  Genetics       Date:  2019-02-22       Impact factor: 4.562

6.  Population-specific causal disease effect sizes in functionally important regions impacted by selection.

Authors:  Huwenbo Shi; Steven Gazal; Masahiro Kanai; Evan M Koch; Armin P Schoech; Katherine M Siewert; Samuel S Kim; Yang Luo; Tiffany Amariuta; Hailiang Huang; Yukinori Okada; Soumya Raychaudhuri; Shamil R Sunyaev; Alkes L Price
Journal:  Nat Commun       Date:  2021-02-17       Impact factor: 14.919

7.  Population genetic differentiation of height and body mass index across Europe.

Authors:  Matthew R Robinson; Gibran Hemani; Carolina Medina-Gomez; Massimo Mezzavilla; Tonu Esko; Konstantin Shakhbazov; Joseph E Powell; Anna Vinkhuyzen; Sonja I Berndt; Stefan Gustafsson; Anne E Justice; Bratati Kahali; Adam E Locke; Tune H Pers; Sailaja Vedantam; Andrew R Wood; Wouter van Rheenen; Ole A Andreassen; Paolo Gasparini; Andres Metspalu; Leonard H van den Berg; Jan H Veldink; Fernando Rivadeneira; Thomas M Werge; Goncalo R Abecasis; Dorret I Boomsma; Daniel I Chasman; Eco J C de Geus; Timothy M Frayling; Joel N Hirschhorn; Jouke Jan Hottenga; Erik Ingelsson; Ruth J F Loos; Patrik K E Magnusson; Nicholas G Martin; Grant W Montgomery; Kari E North; Nancy L Pedersen; Timothy D Spector; Elizabeth K Speliotes; Michael E Goddard; Jian Yang; Peter M Visscher
Journal:  Nat Genet       Date:  2015-09-14       Impact factor: 38.330

Review 8.  Statistical methods for mediation analysis in the era of high-throughput genomics: Current successes and future challenges.

Authors:  Ping Zeng; Zhonghe Shao; Xiang Zhou
Journal:  Comput Struct Biotechnol J       Date:  2021-05-26       Impact factor: 7.271

9.  Generalization and dilution of association results from European GWAS in populations of non-European ancestry: the PAGE study.

Authors:  Christopher S Carlson; Tara C Matise; Kari E North; Christopher A Haiman; Megan D Fesinmeyer; Steven Buyske; Fredrick R Schumacher; Ulrike Peters; Nora Franceschini; Marylyn D Ritchie; David J Duggan; Kylee L Spencer; Logan Dumitrescu; Charles B Eaton; Fridtjof Thomas; Alicia Young; Cara Carty; Gerardo Heiss; Loic Le Marchand; Dana C Crawford; Lucia A Hindorff; Charles L Kooperberg
Journal:  PLoS Biol       Date:  2013-09-17       Impact factor: 8.029

10.  Trans-ancestry genome-wide association meta-analysis of prostate cancer identifies new susceptibility loci and informs genetic risk prediction.

Authors:  David V Conti; Burcu F Darst; Lilit C Moss; Edward J Saunders; Xin Sheng; Alisha Chou; Fredrick R Schumacher; Ali Amin Al Olama; Sara Benlloch; Tokhir Dadaev; Mark N Brook; Ali Sahimi; Thomas J Hoffmann; Atushi Takahashi; Koichi Matsuda; Yukihide Momozawa; Masashi Fujita; Kenneth Muir; Artitaya Lophatananon; Peggy Wan; Loic Le Marchand; Lynne R Wilkens; Victoria L Stevens; Susan M Gapstur; Brian D Carter; Johanna Schleutker; Teuvo L J Tammela; Csilla Sipeky; Anssi Auvinen; Graham G Giles; Melissa C Southey; Robert J MacInnis; Cezary Cybulski; Dominika Wokołorczyk; Jan Lubiński; David E Neal; Jenny L Donovan; Freddie C Hamdy; Richard M Martin; Børge G Nordestgaard; Sune F Nielsen; Maren Weischer; Stig E Bojesen; Martin Andreas Røder; Peter Iversen; Jyotsna Batra; Suzanne Chambers; Leire Moya; Lisa Horvath; Judith A Clements; Wayne Tilley; Gail P Risbridger; Henrik Gronberg; Markus Aly; Robert Szulkin; Martin Eklund; Tobias Nordström; Nora Pashayan; Alison M Dunning; Maya Ghoussaini; Ruth C Travis; Tim J Key; Elio Riboli; Jong Y Park; Thomas A Sellers; Hui-Yi Lin; Demetrius Albanes; Stephanie J Weinstein; Lorelei A Mucci; Edward Giovannucci; Sara Lindstrom; Peter Kraft; David J Hunter; Kathryn L Penney; Constance Turman; Catherine M Tangen; Phyllis J Goodman; Ian M Thompson; Robert J Hamilton; Neil E Fleshner; Antonio Finelli; Marie-Élise Parent; Janet L Stanford; Elaine A Ostrander; Milan S Geybels; Stella Koutros; Laura E Beane Freeman; Meir Stampfer; Alicja Wolk; Niclas Håkansson; Gerald L Andriole; Robert N Hoover; Mitchell J Machiela; Karina Dalsgaard Sørensen; Michael Borre; William J Blot; Wei Zheng; Edward D Yeboah; James E Mensah; Yong-Jie Lu; Hong-Wei Zhang; Ninghan Feng; Xueying Mao; Yudong Wu; Shan-Chao Zhao; Zan Sun; Stephen N Thibodeau; Shannon K McDonnell; Daniel J Schaid; Catharine M L West; Neil Burnet; Gill Barnett; Christiane Maier; Thomas Schnoeller; Manuel Luedeke; Adam S Kibel; Bettina F Drake; Olivier Cussenot; Géraldine Cancel-Tassin; Florence Menegaux; Thérèse Truong; Yves Akoli Koudou; Esther M John; Eli Marie Grindedal; Lovise Maehle; Kay-Tee Khaw; Sue A Ingles; Mariana C Stern; Ana Vega; Antonio Gómez-Caamaño; Laura Fachal; Barry S Rosenstein; Sarah L Kerns; Harry Ostrer; Manuel R Teixeira; Paula Paulo; Andreia Brandão; Stephen Watya; Alexander Lubwama; Jeannette T Bensen; Elizabeth T H Fontham; James Mohler; Jack A Taylor; Manolis Kogevinas; Javier Llorca; Gemma Castaño-Vinyals; Lisa Cannon-Albright; Craig C Teerlink; Chad D Huff; Sara S Strom; Luc Multigner; Pascal Blanchet; Laurent Brureau; Radka Kaneva; Chavdar Slavov; Vanio Mitev; Robin J Leach; Brandi Weaver; Hermann Brenner; Katarina Cuk; Bernd Holleczek; Kai-Uwe Saum; Eric A Klein; Ann W Hsing; Rick A Kittles; Adam B Murphy; Christopher J Logothetis; Jeri Kim; Susan L Neuhausen; Linda Steele; Yuan Chun Ding; William B Isaacs; Barbara Nemesure; Anselm J M Hennis; John Carpten; Hardev Pandha; Agnieszka Michael; Kim De Ruyck; Gert De Meerleer; Piet Ost; Jianfeng Xu; Azad Razack; Jasmine Lim; Soo-Hwang Teo; Lisa F Newcomb; Daniel W Lin; Jay H Fowke; Christine Neslund-Dudas; Benjamin A Rybicki; Marija Gamulin; Davor Lessel; Tomislav Kulis; Nawaid Usmani; Sandeep Singhal; Matthew Parliament; Frank Claessens; Steven Joniau; Thomas Van den Broeck; Manuela Gago-Dominguez; Jose Esteban Castelao; Maria Elena Martinez; Samantha Larkin; Paul A Townsend; Claire Aukim-Hastie; William S Bush; Melinda C Aldrich; Dana C Crawford; Shiv Srivastava; Jennifer C Cullen; Gyorgy Petrovics; Graham Casey; Monique J Roobol; Guido Jenster; Ron H N van Schaik; Jennifer J Hu; Maureen Sanderson; Rohit Varma; Roberta McKean-Cowdin; Mina Torres; Nicholas Mancuso; Sonja I Berndt; Stephen K Van Den Eeden; Douglas F Easton; Stephen J Chanock; Michael B Cook; Fredrik Wiklund; Hidewaki Nakagawa; John S Witte; Rosalind A Eeles; Zsofia Kote-Jarai; Christopher A Haiman
Journal:  Nat Genet       Date:  2021-01-04       Impact factor: 38.330

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.