| Literature DB >> 21261943 |
Urko M Marigorta1, Oscar Lao, Ferran Casals, Francesc Calafell, Carlos Morcillo-Suárez, Rui Faria, Elena Bosch, François Serra, Jaume Bertranpetit, Hernán Dopazo, Arcadi Navarro.
Abstract
BACKGROUND: Searching for associations between genetic variants and complex diseases has been a very active area of research for over two decades. More than 51,000 potential associations have been studied and published, a figure that keeps increasing, especially with the recent explosion of array-based Genome-Wide Association Studies. Even if the number of true associations described so far is high, many of the putative risk variants detected so far have failed to be consistently replicated and are widely considered false positives. Here, we focus on the world-wide patterns of replicability of published association studies.Entities:
Mesh:
Year: 2011 PMID: 21261943 PMCID: PMC3039608 DOI: 10.1186/1471-2164-12-55
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Summary of the steps and filters to ascertain the Global and Continental Sets. A further text summary is available in Additional File 1.
Figure 2Coefficients of correlation between replicability and . Values correspond to Spearman correlation coefficients (ρ). The replicability is measured in % of positive studies over total studies and the genetic differentiation values correspond to average FST values between Europeans and East Asians, under different pooling conditions. X-axis values indicate the cutoff of number of studies per associations, filtering out those associations with fewer studies than indicated. Solid grey lines correspond to the Full Set of associations. Dashed lines correspond to associations from the Conservative Set (replicability ≥50% in at least one continent). Circles, triangles and squares correspond to FST values computed from average of the SNPs in a gene, individual SNPs and tagSNPs, respectively.
Summary of Spearman's correlation coefficients between FST and ϕ as the discordance in replicabilities for the 37 associations from the Continental Set
| ASSOCIATIONS | CATHEGORY | N | ||||
|---|---|---|---|---|---|---|
| Full Set | All SNPs | By Gene | Average | 33 | 0.496 * | |
| Variance | 32 | 0.636 * | ||||
| Independently | Average | 3710 | 0.155 * | |||
| tagSNPs | By Gene | Average | 33 | 0.187 | ||
| Independently | Average | 538 | 0.16 * | |||
| Conservative Setc | All SNPs | By Gene | Average | 26 | 0.666 * | |
| Variance | 24 | 0.666 * | ||||
| Independently | Average | 2454 | 0.15 * | |||
| tagSNPs | By Gene | Average | 24 | 0.453 | ||
| Independently | Average | 486 | 0.063 * | |||
a Pooling by gene tests average genic FST, while in pooling independently each SNP has been assigned the replicability from the association it belongs to
b Variable indicates which parameter (either average or variance in FST) has been tested versus ϕ
c Conservative set contains those associations that have a replicability of ≥50% in at least one continent
Figure 3Correlation between discordance in replicability and . The discordance in replicability values correspond to values and the genetic differentiation to average genic FST values between Europeans and East Asians. Grey circles correspond to the associations from the Conservative Set (n = 26, replicability ≥50% in at least one continent). Solid line indicates the regression line for the full set of associations (n = 33). Dashed line indicates the regression line for the conservative set of associations.
Summary of multiple regression analysis for the Continental Set.
| a) FULL SET | P value | Beta | ||||
|---|---|---|---|---|---|---|
| a.1) Two predictors (n = 33)a | ||||||
| Gene Length (number of SNPs) | 24.641 | 0.596 | 0.613 | >0.93 | ||
| | 0.456 | >0.99 | ||||
| a.2) One predictor (n = 31;NRG1/PARK2 out) | ||||||
| | 16.097 | 0.335 | 0.597 | - | ||
| b) CONSERVATIVE SET b | F df:1,24 | P value | R2 | Beta | Tolc | |
| b.1) Two predictors (n = 26)a | ||||||
| | 26.709 | 0.673 | 0.64 | >0.99 | ||
| Gene Length (number of SNPs) | 0.502 | >0.99 | ||||
| b.2) One predictor (n = 24;NRG1/PARK2 out) | ||||||
| | 18.023 | 0.428 | 0.673 | - | ||
a 6 Excluded non-significant variables: a) percentage of intronic SNPs in the gene from each association, b) total number of studies of each association, c) total number of studies performed in Europe, d) total number of studies performed in East Asia, e) average study sample size of each association, and f) average year of study performance in each association
b Conservative set contains those associations that at least in one continent have a replicability index of 50%
c Tol = Tolerance
Population-specific test on the long-term evolutionary status for the SNPs from the 890 associations from the Global Set.
| Replicability | ||||||||
|---|---|---|---|---|---|---|---|---|
| ≤66.67% | 19017 | 4533 | 18529 | 4991 | 18623 | 4919 | 19816 | 3757 |
| (n = 441) | ||||||||
| ≥66.67% | 19783 | 5173 | 19271 | 5620 | 19334 | 5556 | 20789 | 4187 |
| (n = 441) | ||||||||
| p-value | p-value | p-value | p-value | |||||
a CEU = North Americans (Utah) of Northern European ancestry; CHB = Chinese from Beijing; JPT = Japanese from Tokyo; YRI = Yorubans from Ibadan (Nigeria)
b ANC = SNPs with an ancestral major allele (freq ≥ 0.5); DER = SNPs with a derived major allele (freq ≥ 0.5)