| Literature DB >> 26910538 |
Seung Hoan Choi1,2,3, Daniela Ruggiero4, Rossella Sorice4, Ci Song5,6,7, Teresa Nutile4, Albert Vernon Smith8,9, Maria Pina Concas10, Michela Traglia11, Caterina Barbieri11, Ndeye Coumba Ndiaye12, Maria G Stathopoulou12, Vasiliki Lagou13,14, Giovanni Battista Maestrale10, Cinzia Sala11, Stephanie Debette1,15,16, Peter Kovacs17, Lars Lind18, John Lamont19, Peter Fitzgerald19, Anke Tönjes20, Vilmundur Gudnason8,9, Daniela Toniolo11, Mario Pirastu10, Celine Bellenguez21,22,23, Ramachandran S Vasan3,24, Erik Ingelsson6, Anne-Louise Leutenegger25,26, Andrew D Johnson5, Anita L DeStefano1,2,3, Sophie Visvikis-Siest12, Sudha Seshadri1,3, Marina Ciullo4.
Abstract
Vascular endothelial growth factor (VEGF) is an angiogenic and neurotrophic factor, secreted by endothelial cells, known to impact various physiological and disease processes from cancer to cardiovascular disease and to be pharmacologically modifiable. We sought to identify novel loci associated with circulating VEGF levels through a genome-wide association meta-analysis combining data from European-ancestry individuals and using a dense variant map from 1000 genomes imputation panel. Six discovery cohorts including 13,312 samples were analyzed, followed by in-silico and de-novo replication studies including an additional 2,800 individuals. A total of 10 genome-wide significant variants were identified at 7 loci. Four were novel loci (5q14.3, 10q21.3, 16q24.2 and 18q22.3) and the leading variants at these loci were rs114694170 (MEF2C, P = 6.79 x 10(-13)), rs74506613 (JMJD1C, P = 1.17 x 10(-19)), rs4782371 (ZFPM1, P = 1.59 x 10(-9)) and rs2639990 (ZADH2, P = 1.72 x 10(-8)), respectively. We also identified two new independent variants (rs34528081, VEGFA, P = 1.52 x 10(-18); rs7043199, VLDLR-AS1, P = 5.12 x 10(-14)) at the 3 previously identified loci and strengthened the evidence for the four previously identified SNPs (rs6921438, LOC100132354, P = 7.39 x 10(-1467); rs1740073, C6orf223, P = 2.34 x 10(-17); rs6993770, ZFPM2, P = 2.44 x 10(-60); rs2375981, KCNV2, P = 1.48 x 10(-100)). These variants collectively explained up to 52% of the VEGF phenotypic variance. We explored biological links between genes in the associated loci using Ingenuity Pathway Analysis that emphasized their roles in embryonic development and function. Gene set enrichment analysis identified the ERK5 pathway as enriched in genes containing VEGF associated variants. eQTL analysis showed, in three of the identified regions, variants acting as both cis and trans eQTLs for multiple genes. Most of these genes, as well as some of those in the associated loci, were involved in platelet biogenesis and functionality, suggesting the importance of this process in regulation of VEGF levels. This work also provided new insights into the involvement of genes implicated in various angiogenesis related pathologies in determining circulating VEGF levels. The understanding of the molecular mechanisms by which the identified genes affect circulating VEGF levels could be important in the development of novel VEGF-related therapies for such diseases.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26910538 PMCID: PMC4766012 DOI: 10.1371/journal.pgen.1005874
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Descriptive statistics for participating cohorts at all stages.
| Study | Sample size | Age | Women %(n) | VEGF(median; 25%-75%) | VEGF(min; mean; max) | Undetectable % (n) | Stage |
|---|---|---|---|---|---|---|---|
| 1548 | 76.16 ± 5.61 | 51.17 (916) | 50.9; 35.3–75.3 | 0.2; 61.5; 715.6 | 0.06 (1) | 1 | |
| 1115 | 51.25 ± 19.29 | 54.98 (613) | 382.8; 232.3–591.4 | 16.6; 447.7; 2046.6 | 0 (0) | 1 | |
| 7048 | 51.50 ± 15.66 | 54.27(3825) | 284.6;161.7–452.1 | 2.0; 342.5; 2718.0 | 0 (0) | 1 | |
| 897 | 52.82 ± 18.17 | 36.57 (328) | 65.4; 27.9–109.2 | 20.8; 86.5; 2690.0 | 20.4 (187) | 1 | |
| 945 | 70.20 ± 0.17 | 50.05 (473) | 187.4; 106.2–316.1 | 10.3; 237.6; 1167.9 | 0 (0) | 1 | |
| 1759 | 55.08 ± 17.99 | 56.45 (993) | 66.3; 13.4–137.2 | 2.7; 106.6; 2429.1 | 23.5 (414) | 1 | |
| 470 | 55.50 ± 20.30 | 57.45 (270) | 393.6; 226.5–585.8 | 34.3; 434.7; 1268.7 | 0 (0) | 2 | |
| 659 | 48.21 ± 16.26 | 64.04 (422) | 49.3; 0–98.4 | 2.4; 75; 1269.4 | 27.8 (183) | 2 | |
| 995 | 55.51 ± 11.12 | 50.45 (502) | 33.3; 20.7–59.0 | 4.7; 47; 448.3 | 0 (0) | 3 | |
| 676 | 30.45 ± 14.03 | 49.85 (337) | 27.0; 16.7–43.2 | 9.7; 35.1; 255.1 | 0 (0) | 3 |
*Study names were abbreviate as Age Gene/Environment Susceptibility Reykjavik Study (AGES), Cilento study (Cilento), Framingham Heart Study (FHS), Ogliastra Genetic Park (OGP), Prospective Investigation of the Vasculature in Uppsala Seniors Study (PIVUS), Val Borbera (VB), a village included in the Cilento study (Gioi), Sorbs population (Sorbs), hypertensive adults (HT) and STANISLAS Family Study (SFS) from Biological Resources Center (BRC) Interactions Gène-Environnement en Physiopathologie CardioVasculaire” (IGE- PCV).
†Age at measured VEGF levels in years.
‡The median and interquartile range were calculated considering the individuals with undetectable levels of VEGF as having 0 pg/ml.
§The analysis stage in which each cohort was used.
Fig 1Flow chart for VEGF meta-analysis study design.
Fig 2Regional plots of top 10 replicated variants in the Stage 1.
Regional association plots show–log10 p-values for all variants ordered by their chromosomal position (build37) within regions of the 10 replicated variants. Panel A shows the 5q14.3 locus. Panels B-D show three independent signals on 6p21.1 locus. Panel E shows the 8q23.1 locus. Panels F-G show two independent signals on 9p24.2 locus. Panel H-J show 10q21.1, 16q24.2 and 18q22.3 loci respectively. The variants were color coded by R-square values with the top variant shaped in diamond. The estimated recombination rates (cM/Mb) were generated by lifting the HapMap Phase II genetic map from build35 to GRCH37 and showed in blue line. The green arrows represent the known genes in each locus. Red dotted line represents the genome-wide significant level (P = 5x10-8). Plots in the panels B, D and F are truncated at p-value = 10−20.
Meta-analysis result of the 10 replicated variants.
| Variant rsID | Chr | Position | Coded | Coded AF | St1.P | St2.P | St3.P | Direction | IV.Beta | IV.SE | IV.P | Het.P | N.Gene | Location |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| rs114694170 | 5 | 88180196 | T | 0.96 | 1.00E-10 | 3.50E-12 | 6.80E-13 | --------+- | -0.15 | 0.023 | 1.09E-11 | 0.034 | Intron | |
| rs34528081 | 6 | 43704417 | T | 0.60 | 1.30E-17 | 3.20E-19 | 1.50E-18 | ---------+ | -0.09 | 0.010 | 1.83E-17 | 0.0002 | Intergenic | |
| rs6921438 | 6 | 43925607 | A | 0.46 | 6.6E-1315 | 2.5E-1425 | 7.4E-1467 | ---------- | -0.64 | 0.008 | 1.66E-1449 | 1.82E-88 | Intergenic | |
| rs1740073 | 6 | 43947398 | T | 0.36 | 8.30E-16 | 8.50E-18 | 2.30E-17 | +++--+++++ | 0.09 | 0.010 | 4.40E-17 | 1.71E-05 | Intergenic | |
| rs6993770 | 8 | 106581528 | A | 0.70 | 1.80E-56 | 1.30E-60 | 2.40E-60 | ++++++++++ | 0.16 | 0.010 | 3.83E-55 | 2.01E-06 | Intron | |
| rs7043199 | 9 | 2621145 | A | 0.21 | 3.10E-09 | 8.20E-12 | 5.10E-14 | ---------- | -0.10 | 0.013 | 4.16E-14 | 0.49 | Intron | |
| rs2375981 | 9 | 2692583 | C | 0.54 | 2.50E-92 | 7.50E-100 | 1.50E-100 | ++++++++++ | 0.21 | 0.010 | 9.49E-99 | 2.00E-11 | Intergenic | |
| rs10761741 | 10 | 65066186 | T | 0.43 | 4.60E-15 | 1.10E-16 | 1.20E-19 | ++++++++++ | 0.08 | 0.009 | 2.99E-19 | 0.82 | Intron | |
| rs4782371 | 16 | 88568831 | T | 0.67 | 2.80E-07 | 1.20E-08 | 1.60E-09 | ---------- | -0.07 | 0.011 | 1.26E-09 | 0.57 | Intron | |
| rs2639990 | 18 | 72915551 | T | 0.91 | 2.50E-07 | 4.20E-08 | 1.70E-08 | ++++++++++ | 0.11 | 0.018 | 5.85E-10 | 0.81 | Intron |
*Coded: Coded allele.
†Coded AF: Coded allele Frequency.
‡St1.P- St2.P-St3.P: Stage1, 2, and 3 p-value.
§Direction: Effect direction of the coded allele on VEGF levels; the sequence of the direction follows the alphabetical order of cohorts at each stage (AGES, Cilento, FHS, OGP, PIVUS, VB, Gioi, Sorbs, HT, SFS).
#IV.Beta: Inverse-variance weighted meta-analysis Beta at Stage 3.
**IV.SE: Inverse-variance weighted meta-analysis standard error at Stage 3.
††IV.P: Inverse-variance weighted meta-analysis p-value at Stage 3.
‡‡Het.P: p-value for Cochran’s Q-statistic for heterogeneity at Stage 3.
||N.Gene: The nearest gene from the variant.
§§rs10761741 is a proxy variant (r2 = 0.97) of rs74506613.
||||rs2639990 is the next lowest p-value variant of rs111939830.
##The novel variants were identified in meta-analysis.
Fig 3Forest plots of the 10 replicated variants.
Panels A-J. The effect size (Beta) and the 95% Confidence Interval (CI) in all discovery and replication studies and the cumulative effect in the inverse-variance meta-analysis are represented for each replicated variant.
Fig 4Network generated by IPA using 26 genes in replicated loci.
Edges describe the characteristics of relationship between the two nodes. Lines between two genes denote verified interaction. The shapes of the nodes represent the functional level of the gene product.
100] was performed. The 1000-genome v3 genotypes were used as reference panel for LD calculation; the physical threshold for clumping was 1 Mb, and the r2 threshold for clumping was 0.1.