Literature DB >> 25932435

Meta-analysis of genetic association studies.

Young Ho Lee1.   

Abstract

The object of this review is to help readers to understand meta-analysis of genetic association study. Genetic association studies are a powerful approach to identify susceptibility genes for common diseases. However, the results of these studies are not consistently reproducible. In order to overcome the limitations of individual studies, larger sample sizes or meta-analysis is required. Meta-analysis is a statistical tool for combining results of different studies on the same topic, thus increasing statistical strength and precision. Meta-analysis of genetic association studies combines the results from independent studies, explores the sources of heterogeneity, and identifies subgroups associated with the factor of interest. Meta-analysis of genetic association studies is an effective tool for garnering a greater understanding of complex diseases and potentially provides new insights into gene-disease associations.

Entities:  

Keywords:  Association study; Gene; Meta-analysis; Polymorphism

Mesh:

Substances:

Year:  2015        PMID: 25932435      PMCID: PMC4390695          DOI: 10.3343/alm.2015.35.3.283

Source DB:  PubMed          Journal:  Ann Lab Med        ISSN: 2234-3806            Impact factor:   3.464


INTRODUCTION

Genetic association studies evaluate the association between a disease and susceptible genetic variants in order to identify genetic variants that affect susceptibility to common diseases [1]. Genetic association studies are a powerful approach for identifying susceptibility genes underlying common diseases and provide linkage analyses when investigating complex diseases. The number of genetic association studies is increasing exponentially; however, the results are not consistently reproducible. Genetic association studies of candidate genes have shown that the majority of initial positive associations cannot be reproduced in subsequent studies [2, 3]. These findings suggest that a large number of original findings are false-positive reports (type I errors), or that small genetic effects were undetectable (false-negative, type II errors) in the majority of these studies. However, it is possible that the lack of reproducibility may be due to true variability in the association of different populations [2]. Candidate genes occasionally demonstrate only small effects in genetic association tests (mean odds ratio [OR] of 1.33 in 55 meta-analyses); therefore, genetic association studies require thousands of subjects to have a reasonable chance of discovering an effect [2]. Thus, to detect a small or moderate genetic effect of polymorphisms, large sample sizes or meta-analysis is required [4]. Meta-analysis is a statistical tool for combining results from different studies on the same topic and is becoming a popular method for resolving discrepancies in genetic association studies. Meta-analysis is an objective, quantitative synthesis of research findings, enabling the identification of genuine associations [5]. Meta-analysis increases statistical strength and precision in estimating effects by combining the results of previous studies, thus overcoming the problem of small sample size and the inadequate statistical strength of complex trait genetic studies [6]. Testing for and quantifying inter-study heterogeneity is an important step of meta-analysis [5]. Meta-analysis analyzes the discrepancies in the results of independent studies by addressing inter-study heterogeneity, thus potentially providing new insights into gene-disease associations. Meta-analysis combines the results from independent studies, explores the sources of heterogeneity, and identifies subgroups associated with the factor of interest. When properly performed, meta-analysis of genetic association studies is considered as a decisive evidence [7], and is an effective tool for gaining a greater understanding of complex diseases. The object of this review is to help readers to understand meta-analysis of genetic association study. We deal with meta-analysis process of genetic association studies.

META-ANALYSIS METHOD

The general process by which a meta-analysis of genetic association studies is performed is outlined in Table 1. Following these steps will facilitate performing and understanding meta-analyses.
Table 1

Steps in performing a meta-analysis for genetic association studies

1.Check study quality: Hardy-Weinberg test
2.Check inter-study heterogeneity: Cochran Q test, I2
3.Meta-analysis: Fixed or random effects model, Forrest plot
4.Perform subgroup analysis: Ethnicity-specific analysis
5.Check publication bias: Funnel plot, Egger's regression test
6.Present meta-analysis result

1. Genetic models

In a genetic association study on a polymorphism with two alleles (A and a), where one of the alleles may be associated with a disease (let A be the susceptibility allele), subjects are classified into three genotypes: AA, Aa, and aa. Association studies are the following genetic models [8]: (1) the allele contrast model, in which the numbers of allele A are compared with that of allele a (A vs. a); (2) the recessive model, in which the AA genotype is compared with the combined Aa+aa genotype (AA vs. Aa+aa); (3) the dominant model, in which the combined AA+Aa genotype is compared with the aa genotype (AA+Aa vs. aa); (4) the homozygote contrast (additive) model, in which the AA genotype is compared with the aa genotype (AA vs. aa); and (5) the co-dominant model, in which the combined AA+Aa genotype is compared with the aa genotype (AA+aa vs. Aa). Genetic association studies do not adopt a specific model, and thus multiple genetic models need to be examined [9].

2. Heterogeneity

Meta-analysis analyzes the variance in the results of independent studies and examines the existence of heterogeneity in primary studies. The heterogeneity test examines the following null hypothesis: no differences exist between the findings of primary studies. Cochran's Q test is used to determine whether differences between primary studies exist or whether variations are due to chance [10]. Cochran's Q value is calculated by summing the squared deviations of each study's estimate from the overall estimate, and then comparing it with the chi-squared distribution with κ-1 degrees of freedom (df) (where κ is the number of studies) [10]. A heterogeneity P value <0.10 (not 0.05) indicates the presence of heterogeneity because of the low statistical strength of Cochran's Q test [11]. However, the Q test can be unreliable when a small number of studies are included in the meta-analysis. Another commonly used method for testing heterogeneity is the I2 value, which quantifies the effect of heterogeneity and does not depend on the number of studies or the type of outcome data. I2 values range between 0% and 100% and represent the proportion of inter-study variability that can be attributed to heterogeneity rather than chance (I2=100%×[Q -df]/Q) [12]. I2 values of 25%, 50%, and 75% are assigned as low, moderate, and high estimates, respectively [9, 12]. We evaluate Cochran's Q and I2 values by using meta-analysis software such as the Review Manager (Cochrane Collaboration, London, UK) or Comprehensive Meta-Analysis program (Biosta, Englewood, NJ, USA) by inputting the number of subjects and genotype data in the cases and controls.

3. Fixed vs. random effects models

Meta-analysis combines the effect sizes of the included studies by weighting the data according to the amount of information in each study. The weights are calculated by using the sample size and variability within each study. The fixed effects model assumes that genetic factors have similar effects on disease susceptibility in all of the studies, and that the observed variations between studies are caused by chance alone [13]. The random effects model assumes that different studies exhibit substantial diversity and assesses both intra-study sampling errors and inter-study variances [14]. The choice of meta-analysis model depends on the presence or absence of heterogeneity. In the absence of heterogeneity, a fixed effects model is used for meta-analysis. When a significant Q value (P<0.10) is calculated, indicating the existence of heterogeneity in studies, a random effects model is used for meta-analysis [15]. When study groups are homogeneous, both models would offer similar results; however, in case of heterogeneity, the random effects model usually provides wider confidence intervals (CIs) than the fixed effects model [16]. We perform meta-analysis of fixed or random effects models by using the Review Manager or Comprehensive Meta-Analysis program after inputting the number of subjects and genotype data into the cases and controls.

4. Evaluating causes of heterogeneity

It is important to assess whether heterogeneity exists in the studies included in meta-analysis as well as to determine possible causes of heterogeneity, because heterogeneity can lead to bias in meta-analysis results-referred to as "mixing apples and oranges" [17]. Subgroup analysis may also be used to assess the impact of heterogeneity. In subgroup analysis, meta-analysis is performed on the basis of factors such as ethnicity, number of studies, the Hardy-Weinberg equilibrium (HWE), or clinical features, to assess the impact of a potential source of heterogeneity. The HWE refers to a situation, in which the frequencies of genotypes are predicted on the basis of the frequencies of two alleles according to the simple Mendelian inheritance model [18]. Departures from HWE can arise from genotyping errors, population stratification, and selection bias in the recruitment of controls [19]. Whether the genotype frequencies of the controls are in HWE must be determined, because genotyping errors are a significant cause of deviation from HWE. The distribution of genotypes in the control group is tested for deviation from HWE by performing an exact test. Subgroup analysis is recommended, except for studies, in which the controls are not in HWE [11]. We can check HWE of genotypes in control groups by analyzing the genotype data by using an HWE web tool (http://ihg.gsf.de/cgi-bin/hw/hwa1.pl).

5. Publication bias

Studies reporting positive effects tend to have a higher likelihood of being published than those that do not, and studies showing no significant result tend to remain unpublished [20]. As meta-analysis includes only published studies, the degree of the actual effect might be overestimated [20]. This outcome is termed "publication bias." Meta-analysis of genetic association studies may be subject to publication bias. The funnel plot is a commonly used graphic test to assess publication bias in meta-analyses [21]. This test is a scatter plot of the effect estimate from each study included in the meta-analysis against the measure of its precision (1/SE) [22]. The effect estimates of small-scale studies will scatter more widely at the bottom of the graph, while the spread among larger-scale studies will be narrower. In the absence of publication bias, the funnel plot resembles a symmetrical inverted funnel; asymmetry suggests the existence of publication bias [23]. However, funnel plots usually require a range of studies of varying scales and subjective judgments. Therefore, other methods such as the Egger's linear regression test [21], which measures funnel plot asymmetry on a natural logarithm scale of ORs, is used. If the existence of publication bias is suggested by the funnel plot or other statistical tests, the results of the meta-analysis should be interpreted cautiously, and the possible impact of publication bias should be noted. Funnel plots and Egger's regression tests can be performed by using the Review Manager or Comprehensive Meta-Analysis program.

EXAMPLES OF META-ANALYSES OF GENETIC ASSOCIATION TESTS

We selected studies for the meta-analysis of association between the Fc receptor like-3 (FCRL3)-169C/T polymorphism and rheumatoid arthritis (RA) based on inclusion and exclusion criteria [24]. We included studies if: (1) they were case control studies, (2) the data was original (independence among studies), (3) they provided enough data to calculate ORs, and (4) the distribution of the FCRL3-169C/T polymorphism in normal controls was in HWE. We excluded the followings: (1) studies that contained overlapping data, (2) studies, in which the number of null and wild genotypes could not be ascertained, and (3) those, in which family members were studied, as these analyses are based on linkage considerations. Meta-analyses were performed by using: 1) allelic contrast, 2) recessive, 3) dominant, 4) homozygote, and 5) heterozygote contrast models to determine the association between the FCRL3-169C/T polymorphism and RA [24]. Step 1: We determined if genotypes in the control group deviated from HWE by using the web tool found at http://ihg.gsf.de/cgi-bin/hw/hwa1.pl. No deviation from HWE was found in any of the studies included in the meta-analysis. Step 2: We calculated Cochran's Q and I2 values by using Comprehensive Meta-Analysis program. Cochran's Q P value and I2 values were calculated as 0.084 and 34.1%, respectively, indicating existence of inter-study heterogeneity [24]. Step 3: We selected one model between the fixed and random effects models on the basis of heterogeneity and performed statistical analyses of the selected model by using Comprehensive Meta-Analysis program. We selected the random effect model because of the significant inter-study heterogeneity (Cochran's Q P value=0.084, I2 value= 34.1%) [24]. By performing a random effects meta-analysis, no association was found between RA and the FCRL3-169C allele in the study subjects (OR=1.046, 95% CI=0.997-1.098, P=0.068) [24]. Step 4: We performed ethnicity-specific meta-analyses on the European and Asian populations of the 17 studies included in the original analysis (9 European, 7 Asian, and 1 Native North American populations). The meta-analysis stratified by ethnicity revealed a significant association between the FCRL3-169C allele and RA in the Asian populations under allelic contrast, recessive, dominant, and homozygote contrast models (C allele OR=1.101, 95% CI=1.035-1.174, P=0.002), but not in the European populations (OR=1.012, 95% CI= 0.962-1.065, P=0.643) [24]. Step 5: We generated a funnel plot and performed Egger's regression test by using Comprehensive Meta-Analysis program. The funnel plot showed no evidence of asymmetry and Egger's regression test did not yield a significant P value (Egger's regression test P value=0.863), indicating that publication bias did not affect the meta-analysis [24]. Step 6: We presented the meta-analysis results showing that FCRL3-169C/T polymorphism may confer susceptibility to seropositive RA in Asian populations [24].

DISCUSSION

Meta-analysis is a statistical method for combining the results of independent studies, providing a precise estimate of the effect size, and increasing statistical strength, which is especially important when the strength of primary study is limited because of a small sample size. Meta-analysis solves a problem associated with genetic association studies by analyzing variation in the results of different studies by identifying inter-study heterogeneity. Thus, meta-analysis is potentially a powerful tool for assessing the effects of candidate genes [25, 26, 27]. Meta-analysis is also applicable to a variety of genetics study designs from family-based linkage studies to genome-wide scans and genome-wide association studies, as well as to population-based association studies [5]. Meta-analysis has been used increasingly for combining and integrating data from a number of independent studies. However, meta-analysis is not a suitable replacement for robust genetic association studies. A major criticism of meta-analysis is that it combines different types of studies, mixing apples and oranges. However, meta-analysis can overcome this problem by assessing heterogeneity in studies, and performing subgroup analysis [28]. The "garbage in, garbage out" metaphor refers to the fact that if a meta-analysis includes low quality studies with bias, the results of the meta-analysis will be biased. The results of meta-analysis depend on the quality of primary research. Therefore, meta-analysis should include studies selected on the basis of inclusion criteria. Despite these limitations, a properly executed meta-analysis is an invaluable link between past and future studies that objectively and quantitatively synthesizes evidence while minimizing bias. In conclusion, meta-analysis of genetic association studies is an effective tool for garnering a greater understanding of complex diseases and potentially provides new insights into gene-disease associations.
  28 in total

1.  Funnel plots for detecting bias in meta-analysis: guidelines on choice of axis.

Authors:  J A Sterne; M Egger
Journal:  J Clin Epidemiol       Date:  2001-10       Impact factor: 6.437

2.  Why we need a broad perspective on meta-analysis. It may be crucially important for patients.

Authors:  P C Gotzsche
Journal:  BMJ       Date:  2000-09-09

3.  Quantifying heterogeneity in a meta-analysis.

Authors:  Julian P T Higgins; Simon G Thompson
Journal:  Stat Med       Date:  2002-06-15       Impact factor: 2.373

Review 4.  Genetic associations: false or true?

Authors:  John P A Ioannidis
Journal:  Trends Mol Med       Date:  2003-04       Impact factor: 11.951

5.  Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease.

Authors:  Kirk E Lohmueller; Celeste L Pearce; Malcolm Pike; Eric S Lander; Joel N Hirschhorn
Journal:  Nat Genet       Date:  2003-01-13       Impact factor: 38.330

Review 6.  Problems of reporting genetic associations with complex outcomes.

Authors:  Helen M Colhoun; Paul M McKeigue; George Davey Smith
Journal:  Lancet       Date:  2003-03-08       Impact factor: 79.321

7.  Meta-analysis of genetic association studies.

Authors:  Marcus R Munafò; Jonathan Flint
Journal:  Trends Genet       Date:  2004-09       Impact factor: 11.639

8.  A general parametric approach to the meta-analysis of randomized clinical trials.

Authors:  A Whitehead; J Whitehead
Journal:  Stat Med       Date:  1991-11       Impact factor: 2.373

9.  Meta-analysis of TNF-alpha promoter -308 A/G polymorphism and SLE susceptibility.

Authors:  Young Ho Lee; John B Harley; Swapan K Nath
Journal:  Eur J Hum Genet       Date:  2006-03       Impact factor: 4.246

10.  Meta-analysis in clinical trials.

Authors:  R DerSimonian; N Laird
Journal:  Control Clin Trials       Date:  1986-09
View more
  40 in total

Review 1.  Association between the 8-oxoguanine DNA glycosylase gene Ser326Cys polymorphism and age-related cataract: a systematic review and meta-analysis.

Authors:  Xiao-Cui Liu; Xiao-Hui Guo; Bing Chen; Zhao-Hui Li; Xiao-Fei Liu
Journal:  Int Ophthalmol       Date:  2017-06-19       Impact factor: 2.031

2.  Parents' and Children's ADHD in a Family System.

Authors:  Kirby Deater-Deckard
Journal:  J Abnorm Child Psychol       Date:  2017-04

3.  Field synopsis and meta-analyses of genetic epidemiological evidence for Kashin-Beck disease, an endemic osteoarthropathy in China.

Authors:  Lei Yang; Guang-Hui Zhao; Huan Liu; Xi Wang; Xiong Guo; Mikko J Lammi
Journal:  Mol Genet Genomics       Date:  2016-06-02       Impact factor: 3.291

4.  Meta-analysis of associations between interleukin-10 polymorphisms and susceptibility to Behcet's disease.

Authors:  Young Ho Lee; Gwan Gyu Song
Journal:  Immunol Res       Date:  2019-10       Impact factor: 2.829

Review 5.  The roles of ANRIL polymorphisms in periodontitis: a systematic review and meta-analysis.

Authors:  Ayla Öztürk; Ahmet Oğuz Ada
Journal:  Clin Oral Investig       Date:  2021-11-25       Impact factor: 3.573

Review 6.  An association between IL-10 promoter polymorphisms and diabetic nephropathy: a meta-analysis of case-control studies.

Authors:  Norah Htet Htet; Arun Kumar Basavaraj; Cho Naing; Sivalingam Nalliah
Journal:  J Diabetes Metab Disord       Date:  2018-09-01

Review 7.  Association of ESRα Gene Pvu II T>C, XbaI A>G and BtgI G>A Polymorphisms with Knee Osteoarthritis Susceptibility: A Systematic Review and Meta-Analysis Based on 22 Case-Control Studies.

Authors:  Masoud Mahdinezhad Yazdi; Mohamad H Jamalaldini; Mohammad R Sobhan; Mohammadali Jafari; Mahta Mazaheri; Masoud Zare-Shehneh; Hossein Neamatzadeh
Journal:  Arch Bone Jt Surg       Date:  2017-11

8.  Janus kinase inhibitors for treating active ankylosing spondylitis: a meta-analysis of randomized controlled trials.

Authors:  Y H Lee; G G Song
Journal:  Z Rheumatol       Date:  2020-12-18       Impact factor: 1.372

Review 9.  Fruit and Vegetables Consumption and Risk of Hypertension: A Meta-Analysis.

Authors:  Bingrong Li; Fang Li; Longfei Wang; Dongfeng Zhang
Journal:  J Clin Hypertens (Greenwich)       Date:  2016-01-29       Impact factor: 3.738

10.  A comprehensive meta-analysis and a case-control study give insights into genetic susceptibility of lung cancer and subgroups.

Authors:  Debmalya Sengupta; Souradeep Banerjee; Pramiti Mukhopadhyay; Ritabrata Mitra; Tamohan Chaudhuri; Abhijit Sarkar; Gautam Bhattacharjee; Somsubhra Nath; Susanta Roychoudhury; Samsiddhi Bhattacharjee; Mainak Sengupta
Journal:  Sci Rep       Date:  2021-07-16       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.