Literature DB >> 25281659

A novel common variant in DCST2 is associated with length in early life and height in adulthood.

Ralf J P van der Valk1, Eskil Kreiner-Møller2, Marjolein N Kooijman1, Mònica Guxens3, Evangelia Stergiakouli4, Annika Sääf5, Jonathan P Bradfield6, Frank Geller7, M Geoffrey Hayes8, Diana L Cousminer9, Antje Körner10, Elisabeth Thiering11, John A Curtin12, Ronny Myhre13, Ville Huikari14, Raimo Joro15, Marjan Kerkhof16, Nicole M Warrington17, Niina Pitkänen18, Ioanna Ntalla19, Momoko Horikoshi20, Riitta Veijola21, Rachel M Freathy22, Yik-Ying Teo23, Sheila J Barton24, David M Evans25, John P Kemp25, Beate St Pourcain26, Susan M Ring27, George Davey Smith4, Anna Bergström5, Inger Kull28, Hakon Hakonarson29, Frank D Mentch6, Hans Bisgaard2, Bo Chawes2, Jakob Stokholm2, Johannes Waage2, Patrick Eriksen2, Astrid Sevelsted2, Mads Melbye30, Cornelia M van Duijn31, Carolina Medina-Gomez32, Albert Hofman33, Johan C de Jongste34, H Rob Taal35, André G Uitterlinden32, Loren L Armstrong8, Johan Eriksson9, Aarno Palotie36, Mariona Bustamante37, Xavier Estivill38, Juan R Gonzalez3, Sabrina Llop39, Wieland Kiess10, Anubha Mahajan40, Claudia Flexeder41, Carla M T Tiesler11, Clare S Murray12, Angela Simpson12, Per Magnus42, Verena Sengpiel43, Anna-Liisa Hartikainen44, Sirkka Keinanen-Kiukaanniemi14, Alexandra Lewin45, Alexessander Da Silva Couto Alves45, Alexandra I Blakemore46, Jessica L Buxton46, Marika Kaakinen47, Alina Rodriguez48, Sylvain Sebert14, Marja Vaarasmaki49, Timo Lakka50, Virpi Lindi15, Ulrike Gehring51, Dirkje S Postma52, Wei Ang53, John P Newnham53, Leo-Pekka Lyytikäinen54, Katja Pahkala55, Olli T Raitakari56, Kalliope Panoutsopoulou57, Eleftheria Zeggini57, Dorret I Boomsma58, Maria Groen-Blokhuis58, Jorma Ilonen59, Lude Franke60, Joel N Hirschhorn61, Tune H Pers62, Liming Liang63, Jinyan Huang64, Berthold Hocher65, Mikael Knip66, Seang-Mei Saw67, John W Holloway68, Erik Melén69, Struan F A Grant29, Bjarke Feenstra7, William L Lowe8, Elisabeth Widén9, Elena Sergeyev10, Harald Grallert70, Adnan Custovic12, Bo Jacobsson71, Marjo-Riitta Jarvelin72, Mustafa Atalay15, Gerard H Koppelman73, Craig E Pennell53, Harri Niinikoski74, George V Dedoussis75, Mark I Mccarthy76, Timothy M Frayling22, Jordi Sunyer77, Nicholas J Timpson4, Fernando Rivadeneira32, Klaus Bønnelykke2, Vincent W V Jaddoe78.   

Abstract

Common genetic variants have been identified for adult height, but not much is known about the genetics of skeletal growth in early life. To identify common genetic variants that influence fetal skeletal growth, we meta-analyzed 22 genome-wide association studies (Stage 1; N = 28 459). We identified seven independent top single nucleotide polymorphisms (SNPs) (P < 1 × 10(-6)) for birth length, of which three were novel and four were in or near loci known to be associated with adult height (LCORL, PTCH1, GPR126 and HMGA2). The three novel SNPs were followed-up in nine replication studies (Stage 2; N = 11 995), with rs905938 in DC-STAMP domain containing 2 (DCST2) genome-wide significantly associated with birth length in a joint analysis (Stages 1 + 2; β = 0.046, SE = 0.008, P = 2.46 × 10(-8), explained variance = 0.05%). Rs905938 was also associated with infant length (N = 28 228; P = 5.54 × 10(-4)) and adult height (N = 127 513; P = 1.45 × 10(-5)). DCST2 is a DC-STAMP-like protein family member and DC-STAMP is an osteoclast cell-fusion regulator. Polygenic scores based on 180 SNPs previously associated with human adult stature explained 0.13% of variance in birth length. The same SNPs explained 2.95% of the variance of infant length. Of the 180 known adult height loci, 11 were genome-wide significantly associated with infant length (SF3B4, LCORL, SPAG17, C6orf173, PTCH1, GDF5, ZNFX1, HHIP, ACAN, HLA locus and HMGA2). This study highlights that common variation in DCST2 influences variation in early growth and adult height.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25281659      PMCID: PMC4447786          DOI: 10.1093/hmg/ddu510

Source DB:  PubMed          Journal:  Hum Mol Genet        ISSN: 0964-6906            Impact factor:   6.150


INTRODUCTION

Fetal and infancy length growth are important measures of development in early life. Early length growth seems to be associated with height in adulthood (1). It has been shown that fetal and infant growth are independently associated with higher risks of cardiovascular disease, type 2 diabetes and many other complex diseases. Previous findings suggested genetic links between fetal growth and metabolism (2,3). However, these studies mainly focused on birth weight as early growth measure. Skeletal growth is a different measure of development in early life. Skeletal growth during fetal life and infancy is a complex trait with heritability estimates of 26–72% (4). Although correlated with each other, fetal, infant and adult skeletal growth may be influenced by different genetic factors. Many common genetic variants have been identified for adult height (5), but not much is known about the genetics of skeletal growth in early life. Although, several rare genetic defects with large effects on length at birth and during infancy have been found (6,7), common genetic variants that influence normal variation in birth and infant length have not yet been identified. Therefore, we aimed to identify common genetic variants influencing early length growth, also in perspective of their effect on adult stature.

RESULTS

To identify common genetic variants associated with birth length, we examined 2 201 971 million directly genotyped and imputed SNPs with birth length in 22 independent discovery studies with genome-wide association (GWA) or Metabochip data (Stage 1; N = 28 459; Fig. 1). Birth length was measured using standardized procedures (Supplementary Material, Tables S1 and S2). Studies with self-reported measurements were excluded a priori. Birth length was standardized using growth analyzer (http://www.growthanalyser.org), transforming birth length into sex- and age-adjusted standard deviation scores (SDS). We used the North-European 1991 reference panel to compare results between studies. We applied linear regression between number of alleles or dosages obtained from imputations and standardized birth length (full details in Materials and Methods).
Figure 1

Study design.

Study design.

Gene identification

In the discovery phase (Stage 1), we found seven independent top SNPs with suggestive evidence of association (P < 1 × 10−6) with birth length (Supplementary Material, Figs. S1 and S2, QQ- and Manhattan plot). Four SNPs mapped to loci already known to be associated with adult height (Supplementary Material, Table S3, LCORL, PTCH1, GPR126 and HMGA2) (5). The 3 SNPs reflecting potentially novel associations were taken forward in nine independent replication studies (Stage 2; N = 11 995; Fig. 1). Only one of the three SNPs displayed significant evidence for replication in Stage 2 and reached genome-wide significance in the joint analysis (Stages 1 + 2; P < 5 × 10−8; Table 1). This novel association arose from SNP rs905938, mapping to chromosome 1q22 in DC-STAMP domain containing 2 (DCST2) (Fig. 2, regional association plot). Each C allele [minor allele frequency (MAF) = 0.24] of rs905938 was associated with an increase (standardized) of 0.046 SDS in birth length (standard error = 0.008, P = 2.46 × 10−8; explained variance = 0.05%). The genome-wide significantly associated SNP showed low degree of heterogeneity between the discovery studies (P = 0.93, I2 = 0%). Figure 3 shows the forest plot of the associations between rs905938[C] and birth length across all studies. Other suggestive loci in the discovery analysis are shown in Supplementary Material, Table S3 (P < 1 × 10−5). Summary statistics of all SNPs are available at http://egg-consortium.org.
Table 1

Summary statistics of the three novel SNPs at P < 1 × 10−6 in the discovery analysis and the replication follow-up results

MarkerMAFβSEPnI2HetP
Discovery (Stage 1)
 rs905938[C] at 1q22 (DCST2)0.240.0500.0102.59 × 10−728 3270.00.930
 rs12545524[G] at 8q22.1 (near GDF6)0.140.0780.0141.54 × 10−822 1706.60.376
 rs11037473[A] at 11p11.2 (nearest genes TTC17-HSD17B12)0.06−0.1090.0212.17 × 10−722 2590.00.735
Replication (Stage 2)
 rs905938[C] at 1q22 (DCST2)0.230.0350.0151.99 × 10−211 908
 rs12545524[G] at 8q22.1 (near GDF6)0.11−0.0120.0174.67 × 10−117 614
 rs11037473[A] at 11p11.2 (nearest genes TTC17-HSD17B12)0.08−0.0350.0208.06 × 10−217 606
Discovery + replication (Stages 1 + 2)
 rs905938[C] at 1q22 (DCST2)0.240.0460.0082.46 × 10−840 235
 rs12545524[G] at 8q22.1 (near GDF6)0.130.0420.0119.08 × 10−539 784
 rs11037473[A] at 11p11.2 (nearest genes TTC17-HSD17B12)0.07−0.0690.0141.49 × 10−639 865

SNPs markers are identified according to their standard rs numbers (NCBI build 36). Independent novel SNPs with a strong suggestive effect in the discovery analysis on birth length are shown (P < 1 × 10−6). SNPs in loci that are known to be associated with adult height were excluded for replication efforts (adult height loci: LCORL, PTCH1, GPR126 and HMGA2). MAF, minor allele frequency; SE, standard error. β reflects differences in standardized birth length per minor allele. P values are obtained from linear regression of each SNP against standardized birth length adjusted for sex and gestational age. We included both GWA and metabochip cohorts in our discovery analysis, rs905938 is on the metabochip, and rs12545524 and rs11037473 are not, this explains the differences in numbers (n). Derived inconsistency statistic I2 and HetP values reflect heterogeneity across discovery studies with the use of Cochran's Q tests.

Figure 2

Regional association plot of 1q22 in the 22 birth length discovery studies (N = 28 459). SNPs are plotted with their P values (as −log10 values; left y-axis) as a function of genomic position (x-axis). Estimated recombination rates (right y-axis) taken from HapMap are plotted to reflect the local LD-structure around the top associated SNP (‘white open diamond’) and the correlated proxies (‘circles’ according to a black-to-gray scale from r2 = 0 to 1). The joint analysis P value of discovery and replication studies is reported with the ‘white square’ (N = 40 235).

Figure 3

Forest plot of the associations between rs905938[C] and birth length. *Replication studies. The ‘black diamond’ indicates the overall effect size and the confidence interval of the 31 studies.

Summary statistics of the three novel SNPs at P < 1 × 10−6 in the discovery analysis and the replication follow-up results SNPs markers are identified according to their standard rs numbers (NCBI build 36). Independent novel SNPs with a strong suggestive effect in the discovery analysis on birth length are shown (P < 1 × 10−6). SNPs in loci that are known to be associated with adult height were excluded for replication efforts (adult height loci: LCORL, PTCH1, GPR126 and HMGA2). MAF, minor allele frequency; SE, standard error. β reflects differences in standardized birth length per minor allele. P values are obtained from linear regression of each SNP against standardized birth length adjusted for sex and gestational age. We included both GWA and metabochip cohorts in our discovery analysis, rs905938 is on the metabochip, and rs12545524 and rs11037473 are not, this explains the differences in numbers (n). Derived inconsistency statistic I2 and HetP values reflect heterogeneity across discovery studies with the use of Cochran's Q tests. Regional association plot of 1q22 in the 22 birth length discovery studies (N = 28 459). SNPs are plotted with their P values (as −log10 values; left y-axis) as a function of genomic position (x-axis). Estimated recombination rates (right y-axis) taken from HapMap are plotted to reflect the local LD-structure around the top associated SNP (‘white open diamond’) and the correlated proxies (‘circles’ according to a black-to-gray scale from r2 = 0 to 1). The joint analysis P value of discovery and replication studies is reported with the ‘white square’ (N = 40 235). Forest plot of the associations between rs905938[C] and birth length. *Replication studies. The ‘black diamond’ indicates the overall effect size and the confidence interval of the 31 studies.

Functional analyses

We assessed common variants with deleterious functional implications in linkage disequilibrium (LD, r2 > 0.80) with rs905938 using HaploReg (8). There were no non-synonymous variants in LD with rs905938. We found three putative functional intronic variants in high LD with rs905938. Details are depicted in Supplementary Material, Table S4. Subsequently, we assessed whether variants in the identified locus were involved in the regulation of messenger RNA expression (eQTLs) in genome-wide expression datasets of lymphoblastoid cell lines (LCLs, N = 1830) (9,10). We found cis eQTLs [false discovery rate (FDR) < 1% account for all SNP-probe pairs that were within 1 Mb of each other) for transcripts of PBXIP1, GBA and ADAM15. Yet, rs905938 and the cis eQTL SNPs were not in perfect LD (r2 < 0.80, Supplementary Material, Table S5). Therefore, we cannot exclude that multiple independent effects arise from the same region of association.

DCST2 and growth phenotypes

We tested the associations of rs905938[C] with ‘fetal growth’ measures in the 1st, 2nd and 3rd trimester of pregnancy in the Generation R Study (N = 5756) (11), infant length at 1 year of age (range 6–18 months; N = 28 228) in the Early Growth Genetics (EGG) consortium (12), and adult height in the Genetic Investigation of Anthropometric Traits (GIANT) consortium (N = 127 513) (5). Rs905938[C] was not associated with ‘fetal growth’ measures, but was associated with infant length and adult height (P < 0.05; Table 2).
Table 2

Associations of rs905938[C] in DCST2 related to birth length with ‘fetal growth’ measures, infant length and adult height

βSEP
Generation R: fetal growth (N = 5756)
First trimester
 Crown-rump length (n = 1126)0.0030.0450.952
Second trimester
 Femur length (n = 5361)−0.0350.0230.129
Third trimester
 Femur length (n = 5532)−0.0150.0220.490
EGG: infant length
 Infant length at 1 year of age (N = 28 228)0.0350.0105.54 × 10−4
GIANT: adult height
 Adult height (N = 127 513)0.0240.0061.45 × 10−5

rs905938 C-allele with a genome-wide significant effect on birth length is shown (P < 5 × 10−8) in relation to ‘fetal growth’ measures, infant length and adult height. SE, standard error. β reflects difference in standard deviation scores per minor allele.

Associations of rs905938[C] in DCST2 related to birth length with ‘fetal growth’ measures, infant length and adult height rs905938 C-allele with a genome-wide significant effect on birth length is shown (P < 5 × 10−8) in relation to ‘fetal growth’ measures, infant length and adult height. SE, standard error. β reflects difference in standard deviation scores per minor allele.

Known adult height loci in relation to birth and infant length

We also explored whether common genetic variants known to be associated with adult height (5) influenced birth length variation. We found that 17 out of 180 known adult height loci were associated with birth length (FDR < 5%, Supplementary Material, Table S6; Fig. 4, QQ-plot of 180 SNPs and birth length). We then calculated a height-increasing-alleles score of the 180 known height loci (5) to predict birth length in the Generation R Study (N = 2085; Fig. 5). The score composed of variants associated with adult height explained 0.13% of the variance in birth length (P = 0.1), in contrast to the ∼10% of the phenotypic variation in adult height reported in the original manuscript (5).
Figure 4

QQ-plots of the 180 known adult height SNPs with birth and infant length. QQ-plot of the 180 known adult height SNPs in association with birth length (upper panel) in 22 studies (N = 28 459) and with infant length (lower panel) in 19 studies (N = 28 238). The black dots represent observed P values and the diagonal lines represent the expected P values under the null distribution.

Figure 5

Height-increasing-alleles score of known adult height SNPs predicting birth and infant length. Genetic risk-allele scores (sum of height-increasing alleles weighted by known effect on adult height (5) transformed to standard deviation Z-scores) in the Generation R study plotted against length adjusted for sex and age. The distribution of the genetic risk-allele score is depicted as bars. (A) Mean birth length plotted against the genetic score (N = 2085). (B) Mean infant length plotted against the genetic score (N = 2385).

QQ-plots of the 180 known adult height SNPs with birth and infant length. QQ-plot of the 180 known adult height SNPs in association with birth length (upper panel) in 22 studies (N = 28 459) and with infant length (lower panel) in 19 studies (N = 28 238). The black dots represent observed P values and the diagonal lines represent the expected P values under the null distribution. Height-increasing-alleles score of known adult height SNPs predicting birth and infant length. Genetic risk-allele scores (sum of height-increasing alleles weighted by known effect on adult height (5) transformed to standard deviation Z-scores) in the Generation R study plotted against length adjusted for sex and age. The distribution of the genetic risk-allele score is depicted as bars. (A) Mean birth length plotted against the genetic score (N = 2085). (B) Mean infant length plotted against the genetic score (N = 2385). To evaluate whether different common genetic variants influenced both birth and infant length, we tested 2 193 675 million SNPs for association with infant length in almost the same set of samples used for the analysis of birth length (19 studies, N = 28 238; Supplementary Material, Table S7). We identified genome-wide significant associations at 11 genetic loci (Supplementary Material, Figs S3 and S4, QQ- and Manhattan plot), which all are known to be associated with adult height (Table 3, SNPs in or near SF3B4, LCORL, SPAG17, C6orf173, PTCH1, GDF5, ZNFX1, HHIP, ACAN, HLA locus and HMGA2) (5,13). In addition, we found that variants in 58 of the adult height loci were associated with infant length at an FDR of 5% (Supplementary Material, Table S8; Fig. 4, QQ-plot of 180 SNPs and infant length). Next, we tested in the Generation R Study (N = 2385) how much of the phenotypic variance in infant length was explained by the score composed of height-increasing-alleles. Variants from the 180 known adult height loci together explained 2.95% of the variance in infant length (P = 3.10 × 10−17, Fig. 5).
Table 3

Summary statistics of the eleven known adult height SNPs in association with infant length at P < 5 × 10−8

MarkerMAFβSEPnI2HetP
rs7536458[G] at 1p12 (SPAG17)0.25−0.0640.0109.61 × 10−11282340.00.403
rs11205303[C] at 1q21.2 (SF3B4)0.340.0870.0111.79 × 10−16265590.00.864
rs1380294[T] at 4p15.31 (LCORL)0.15−0.1080.0142.54 × 10−142307913.70.184
rs1812175[A] at 4q28-q32(HHIP)0.18−0.0680.0112.33 × 10−9282270.00.398
rs592229[G] at (HLA locus)0.430.0480.0092.22 × 10−8282230.60.326
rs9385399[T] at 6q22.32 (C6orf173)0.460.0550.0091.68 × 10−10282240.00.943
rs1984119[C] at 9q22.3 (PTCH1)0.26−0.0630.0101.77 × 10−10281970.00.490
rs7970350[T] at 12q15 (HMGA2)0.49−0.0470.0092.90 × 10−8282260.00.426
rs2280470[A] at 15q26.1 (ACAN)0.360.0530.0096.43 × 10−9274430.00.436
rs143384[G] at 20q11.2 (GDF5)0.440.0580.0092.87 × 10−10282320.00.996
rs1567865[T] at 20q13.13 (ZNFX1)0.210.0630.0101.10 × 10−92822922.50.104

SNPs markers are identified according to their standard rs numbers (NCBI build 36). The total sample includes data of 19 independent datasets (N = 28 238). MAF, minor allele frequency; SE, standard error. β reflects differences in standardized infant length per minor allele. P values are obtained from linear regression of each SNP against standardized infant length adjusted for sex and age. We included both GWA and metabochip cohorts in our discovery analysis, this explains the differences in numbers (n). Derived inconsistency statistic I2 and HetP values reflect heterogeneity across discovery studies with the use of Cochran's Q tests.

Summary statistics of the eleven known adult height SNPs in association with infant length at P < 5 × 10−8 SNPs markers are identified according to their standard rs numbers (NCBI build 36). The total sample includes data of 19 independent datasets (N = 28 238). MAF, minor allele frequency; SE, standard error. β reflects differences in standardized infant length per minor allele. P values are obtained from linear regression of each SNP against standardized infant length adjusted for sex and age. We included both GWA and metabochip cohorts in our discovery analysis, this explains the differences in numbers (n). Derived inconsistency statistic I2 and HetP values reflect heterogeneity across discovery studies with the use of Cochran's Q tests.

DEPICT analysis of birth and infant length

Finally, we used a pathway analysis tool called DEPICT (Pers et al., unpublished data) to prioritize genes at associated regions, search for reconstituted gene sets that were enriched in genes near associated variants, and identify tissue and cell types in which genes from loci associated with birth and infant length were highly expressed (full details in Materials and Methods). For both traits, we used independent SNPs (r2 < 0.05) associated at P < 1 × 10−5, from 21 birth length and 44 infant length loci. There were no pathways significantly overrepresented in the birth length results. In contrast, for infant length DEPICT significantly prioritized nine genes which were overrepresented (FDR < 5%, Supplementary Material, Table S9), including three known Mendelian human stature genes (ACAN, GDF5 and PTCH1) as well as several relevant reconstituted gene sets (e.g. abnormal sternum ossification, regulation of osteoblast proliferation and WNT signaling, Supplementary Material, Table S10). There was no significant enrichment for particular tissue or cell types for any of the two traits.

DISCUSSION

In the present study we identified one previously unknown locus (rs905938 in DCST2 at 1q22) to be associated with birth length at a genome-wide significant level. This common genetic variant was also associated with infant length and adult height. It was not possible to identify eQTLs for transcripts of DCST2 in the MRCA and MRCE databases, as there were no probes available (9). Also, there was no significant eQTL of DCST2 in immortalized LCLs (10). However, DCST2 is a DC-STAMP-like protein family member and DC-STAMP is an important regulator of osteoclast cell-fusion in bone homeostasis (14–16). The transcripts of PBXIP1, GBA and ADAM15 were in weak LD with our lead SNP rs905938. The PBXIP1 protein is known to regulate estrogen receptor functions (17). Mutations in the GBA gene cause Gaucher disease, and strong associations with Parkinson's disease and dementia with Lewy bodies have been described (18–21). ADAM15 is prominently expressed in osteoblasts and to a lesser extent in osteoclasts (22). A study in mice showed that ADAM15 is required for normal skeletal homeostasis and that its absence causes increased nuclear translocation of β-catenin in osteoblasts leading to increased osteoblast proliferation and function, which results in higher trabecular and cortical bone mass (23). The 1q22 locus is a complex region harboring multiple interesting genes that could affect birth length. We emphasize that we could not specifically pinpoint the causal gene(s) as our lead SNP (rs905938) was not in perfect LD with our cis eQTL SNPs. Although, there is some overlap between adult height loci and birth length, which is illustrated by 17 shared loci, the genetic architecture of adult height seems more similar to the genetic architecture of infant length than birth length [58 shared loci for infant length, based on conservative statistical method (FDR)]. One point of consideration for the interpretation of our findings is the potential of measurement error for birth length (24). This may lead to less power to detect novel genetic variants as standard errors of SNPs could be increased. The estimate of the risk-allele score slope of Figure 5 is not influenced by measurement error and the differences in the slopes suggest that birth and infant length are influenced by distinct genetic variants. We found that the SNP effects for birth length of 137 of the 180 established height loci were in the same direction as reported in the GIANT paper (5) (Supplementary Material, Table S6; probability of success = 0.761, P = 6.25 × 10−13). One hundred sixty-two of the 180 loci were in the same direction for infant length (Supplementary Material, Table S8; probability of success = 0.900, P = 2.20 × 10−16). Four SNPs associated with birth length (P < 1 × 10−5) are in or near loci known to be associated with birth weight (LCORL, HMGA2, ADCY5 and ADRB1). LCORL is associated with birth weight, birth length, infant length and adult height, but we could not find an obvious link between the gene and adult-onset diseases. HMGA2 is associated with aortic root size (25), type 2 diabetes (26), and many other traits like tooth development, head circumference and brain structure (12,27). ADCY5 is also associated with type 2 diabetes and ADRB1 with adult blood pressure (2,3). These findings highlight genetic links between fetal growth and metabolism (2,3,26). As we found overlap between genetic variants of birth weight and birth length, we looked-up the effect of rs905938 in DCST2 on birth weight in a previous EGG study (3). Rs905938 was associated with birth weight, but weaker as compared with birth length (β = 0.035 SDS, SE = 0.010, P = 2.35 × 10−4, N = 26 558). In conclusion, in the present study we identified one novel locus (rs905938 in DCST2 at 1q22) associated with birth length at a genome-wide significant level. This common genetic variant was also associated with infant length and adult height, with decreasing magnitude of the associations in later life (0.046 SDS for birth length, 0.035 SDS for infant length and 0.024 SDS for adult height). To our knowledge, no phenotype has been previously associated with the DCST2 gene and while the gene is expressed in osteoclasts, its function should be further studied.

MATERIALS AND METHODS

Stage 1: discovery genome-wide association analyses of birth length

We combined 21 population-based studies with GWA or Metabochip data and birth length available (total N = 28 459 individuals). One of our discovery cohorts had two independent sub-samples within their study leading to a total of 22 independent GWA/Metabochip sub-samples for our analysis: one sub-sample from the Avon Longitudinal Study of Parents and Children (ALSPAC, GWA, n = 4816); Children, Allergy, Milieu, Stockholm, Epidemiology [Swedish] (BAMSE, GWA, n = 423); Children's Hospital Of Philadelphia (CHOP, GWA, n = 432); Copenhagen Study on Asthma in Childhood 2000 (COPSAC-2000, GWA, n = 348); Copenhagen Study on Asthma in Childhood Registry (COPSAC-Registry, GWA, n = 1111); Danish National Birth Cohort (DNBC, GWA, n = 932); Generation R Study (Generation R, GWA, n = 2085); Hyperglycemia and Adverse Pregnancy Outcomes study (HAPO, GWA, n = 1325); Helsinki Birth Cohort Study (HBCS, GWA, n = 1572); Infancia y Medio Ambiente (INMA, GWA, n = 848); Leipzig Childhood Obesity cohort (LEIPZIG, Metbochip, n = 607); Lifestyle Immune System Allergy study (LISA, GWA, n = 552); Manchester Asthma and Allergy Study (MAAS, GWA, n = 402); Norwegian Mother and Child Cohort study (MOBA, GWA, n = 832); Northern Finland Birth Cohorts 1966 (NFBC66, GWA, n = 4642); Northern Finland Birth Cohorts 1986 (NFBC86, Metabochip, n = 4652); Physical Activity and Nutrition in Children study (PANIC, Metabochip, n = 319); two sub-samples from the Prevention and Incidence of Asthma and Mite Allergy birth cohort study (PIAMA1, GWA, n = 283; PIAMA2, GWA, n = 195); The Western Australian Pregnancy Cohort Study (RAINE, GWA, n = 1272); Special Turku Coronary Risk Factor Intervention Project (STRIP, Metabochip, n = 614); and TEENs of Attica: Genes and Environment (TEENAGE, GWA, n = 197). While no systematic phenotypic differences were observed between the sub-samples of the PIAMA birth cohort study, they were analyzed separately due to genotyping on different platforms and at different time periods. Genotypes within each study were obtained using high-density SNP arrays and then imputed for ∼2.5 M HapMap SNPs (Phase II, release 22; http://hapmap.ncbi.nlm.nih.gov/). The basic characteristics, exclusions applied (for example, individuals of non-European ancestry, family related individuals), genotyping, quality control and imputation methods for each discovery study are presented in Supplementary Material, Table S1.

Statistical analysis within discovery studies

In all studies, birth length was measured using standardized procedures. Studies with self-reported measurements were excluded a priori. Birth length was standardized using growth analyzer (http://www.growthanalyser.org), transforming birth length into sex- and age-adjusted SDS. We used the North-European 1991 reference panel to compare results between studies. Multiple births and twins were excluded from all analyses. We applied linear regression between number of alleles or dosages obtained from imputations and standardized birth length. The GWA analysis per study was performed using MaCH2qtl (28), SNPTEST (29), PLINK (30) or PropABEL (31). The secured data exchange and storage were facilitated by the Erasmus Medical Center, Department of Internal Medicine (32).

Meta-analysis of discovery studies

Prior to meta-analysis, SNPs with a MAF <0.01 and poorly imputed SNPs [r2hat <0.3 (MaCH); proper_info <0.4 (IMPUTE2); R2_BEALE <0.4 (BEAGLE)] were filtered. Genomic control (GC) (33) was applied to adjust the statistics generated within each cohort (see Supplementary Material, Table S1 for individual study λ values). Four out of the twenty-two sub-samples were genotyped on Metabochips. These SNP-arrays were enriched with ‘adult height SNPs'. Normal variation in early length growth seems to be associated with height in adulthood (1). Therefore, we assumed more true-positive hits in these studies and did not apply GC in these studies (GIANT et al., unpublished data). Details of any additional corrections for study specific population structure are given in the Supplementary Material, Table S1. Inverse variance fixed-effects meta-analyses were analyzed using METAL (released 2010-08-01) (34) by two meta-analysts in parallel and blinded to obtain identical results. After the METAL meta-analysis, we filtered SNPs with a MAF <0.05 and SNPs that were not available in at least 12 sub-samples to avoid false-positive findings. We used Cochran's Q test and the derived inconsistency statistic I2 to assess evidence of between-study heterogeneity of the effect sizes. The meta-analysis results were obtained for a total of 2 201 971 SNPs. SNPs that crossed the threshold of P ≤ 1 × 10−6 were considered to represent strong suggestive evidence of association with birth length. SNPs that were already known to be associated with adult height were excluded for the replication analysis (5). The explained variance of the top SNPs were calculated in one of the largest cohorts, the Generation R Study (n = 2085).

Stage 2: replication analysis of top birth length SNPs

In the discovery phase, we found seven independent SNPs with strong suggestive evidence of association (P < 1 × 10−6) with birth length. Four SNPs were already known to be associated with adult height (5). These SNPs were excluded for follow-up analyses. The three remaining novel SNPs were followed-up in replication studies. We included both GWA and Metabochip studies in our discovery analysis. Rs905938 was on our Metabochips, and rs12545524 and rs11037473 were not. This results in differences in numbers for our top SNPs in the discovery and replication analyses. rs905938 was taken forward in 9 independent replication studies (N = 11 995), rs12545524 and rs11037473 in 13 independent replication studies including the four discovery Metabochip studies (N = 17 679). Details of the replication studies are presented in Supplementary Material, Table S2. Within the replication studies, we analyzed the association between number of alleles and standardized birth length. Combined effect estimates and heterogeneity between cohorts was calculated using fixed effects meta-analyses in R Version 2.8.1 (The R foundation for Statistical Computing, library rmeta). Top SNPs that crossed the significant threshold of P-replication ≤0.05 and the widely accepted genome-wide significance threshold of P ≤ 5 × 10−8 for all studies combined were considered to represent robust evidence of association with birth length. The institutional review boards for human studies approved the protocols and written consent was obtained from the participating subjects or their caregivers if required by the institutional review board.

DEPICT analysis

We used the novel Data-driven Expression-Prioritized Integration for Complex Traits (DEPICT) method (Pers et al., unpublished data). DEPICT is designed to systematically identify the most likely causal gene at a given locus, gene sets that are enriched in genetic associations, and tissues and cell types in which genes from associated loci are highly expressed. First, DEPICT assigns genes to associated SNPs using LD r2 > 0.5 distance to define locus boundaries, merges overlapping loci and discards loci mapping within the extended major histocompatibility complex region (chromosome 6, base pairs 25 000–35 000). Next, the DEPICT method prioritizes genes within a given associated locus based on the genes' functional similarity to genes from other associated loci. Genes that are highly similar to genes from other loci obtain low prioritization P values, and simulated GWAS results are used to adjust for gene length bias as well as other potential confounders. There can be several prioritized genes in a given locus. Next, DEPICT conducts gene set enrichment analysis by testing whether genes in associated loci enrich for reconstituted versions of known pathways, gene sets as well as protein complexes. Leveraging the guilt by association hypothesis that genes co-expressing with genes from a given gene set are likely to be part of that gene set (see Cvejic et al. (35), for details), the gene set reconstitution is accomplished by identifying genes that were co-expressed with genes in a given gene set based on a panel of 77 840 gene expression microarrays. Gene sets from the following repositories were reconstituted: 5984 protein complexes that were derived from 169 810 high-confidence experimentally derived protein–protein interactions (36); 2473 phenotypic gene sets derived from 211 882 gene–phenotype pairs from the Mouse Genetics Initiative (37); 737 Reactome database pathways (38); 184 KEGG database pathways (39); and 5083 Gene Ontology database terms (40). Finally, DEPICT conducts tissue and cell type enrichment analysis, by testing whether genes in associated loci are highly expressed in any of 209 Medical Subject Heading annotations of 37 427 microarrays from the Affymetrix U133 Plus 2.0 Array platform (see Wood et al. (41) and Geller et al. (42) for previous applications of DEPICT). In this work, 21 autosomal SNPs for birth length and 44 autosomal SNPs for infant length were used as input to DEPICT resulting in 21 and 41 non-overlapping loci, respectively, that covered a total of 34 genes and 83 genes, respectively. The gene prioritization, gene set enrichment and tissue/cell type enrichment analyses were run using the default settings in DEPICT.

SUPPLEMENTARY MATERIAL

Supplementary Material is available at

FUNDING

R.M.F. is supported by a Sir Henry Wellcome Postdoctoral Fellowship (Wellcome Trust grant 085541/Z/08/Z). T.H.P. is supported by The Danish Council for Independent Research Medical Sciences (FSS) The Alfred Benzon Foundation. B.F. is supported by an Oak Foundation fellowship. M.M. is a Wellcome Trust Senior Investigator (Wellcome Trust grant 090532) and a NIHR Senior Investigator. T.M.F. is supported by the European Research Council grant: SZ-245 50371-GLUCOSEGENES-FP7-IDEAS-ERC. F.R. (VIDI 016.136.367) and V.W.V.J. (VIDI 016.136.361) received grants from the Netherlands Organization for Health Research and Development. The other authors did not receive funding for this manuscript.
  44 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

3.  Reliability of length measurements in full-term neonates.

Authors:  T S Johnson; J L Engstrom; J A Warda; M Kabat; B Peters
Journal:  J Obstet Gynecol Neonatal Nurs       Date:  1998 May-Jun

4.  Cloning and initial characterization of mouse meltrin beta and analysis of the expression of four metalloprotease-disintegrins in bone cells.

Authors:  D Inoue; M Reid; L Lum; J Krätzschmar; G Weskamp; Y M Myung; R Baron; C P Blobel
Journal:  J Biol Chem       Date:  1998-02-13       Impact factor: 5.157

5.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes.

Authors:  Yun Li; Cristen J Willer; Jun Ding; Paul Scheet; Gonçalo R Abecasis
Journal:  Genet Epidemiol       Date:  2010-12       Impact factor: 2.135

6.  IGF-I receptor mutations resulting in intrauterine and postnatal growth retardation.

Authors:  M Jennifer Abuzzahab; Anke Schneider; Audrey Goddard; Florin Grigorescu; Corinne Lautier; Eberhard Keller; Wieland Kiess; Jürgen Klammt; Jürgen Kratzsch; Doreen Osgood; Roland Pfäffle; Klemens Raile; Berthold Seidel; Robert J Smith; Steven D Chernausek
Journal:  N Engl J Med       Date:  2003-12-04       Impact factor: 91.245

7.  Variants in ADCY5 and near CCNL1 are associated with fetal growth and birth weight.

Authors:  Rachel M Freathy; Dennis O Mook-Kanamori; Ulla Sovio; Inga Prokopenko; Nicholas J Timpson; Diane J Berry; Nicole M Warrington; Elisabeth Widen; Jouke Jan Hottenga; Marika Kaakinen; Leslie A Lange; Jonathan P Bradfield; Marjan Kerkhof; Julie A Marsh; Reedik Mägi; Chih-Mei Chen; Helen N Lyon; Mirna Kirin; Linda S Adair; Yurii S Aulchenko; Amanda J Bennett; Judith B Borja; Nabila Bouatia-Naji; Pimphen Charoen; Lachlan J M Coin; Diana L Cousminer; Eco J C de Geus; Panos Deloukas; Paul Elliott; David M Evans; Philippe Froguel; Beate Glaser; Christopher J Groves; Anna-Liisa Hartikainen; Neelam Hassanali; Joel N Hirschhorn; Albert Hofman; Jeff M P Holly; Elina Hyppönen; Stavroula Kanoni; Bridget A Knight; Jaana Laitinen; Cecilia M Lindgren; Wendy L McArdle; Paul F O'Reilly; Craig E Pennell; Dirkje S Postma; Anneli Pouta; Adaikalavan Ramasamy; Nigel W Rayner; Susan M Ring; Fernando Rivadeneira; Beverley M Shields; David P Strachan; Ida Surakka; Anja Taanila; Carla Tiesler; Andre G Uitterlinden; Cornelia M van Duijn; Alet H Wijga; Gonneke Willemsen; Haitao Zhang; Jianhua Zhao; James F Wilson; Eric A P Steegers; Andrew T Hattersley; Johan G Eriksson; Leena Peltonen; Karen L Mohlke; Struan F A Grant; Hakon Hakonarson; Gerard H Koppelman; George V Dedoussis; Joachim Heinrich; Matthew W Gillman; Lyle J Palmer; Timothy M Frayling; Dorret I Boomsma; George Davey Smith; Chris Power; Vincent W V Jaddoe; Marjo-Riitta Jarvelin; Mark I McCarthy
Journal:  Nat Genet       Date:  2010-04-06       Impact factor: 38.330

8.  METAL: fast and efficient meta-analysis of genomewide association scans.

Authors:  Cristen J Willer; Yun Li; Gonçalo R Abecasis
Journal:  Bioinformatics       Date:  2010-07-08       Impact factor: 6.937

9.  Examination of the relationship between variation at 17q21 and childhood wheeze phenotypes.

Authors:  Raquel Granell; A John Henderson; Nicholas Timpson; Beate St Pourcain; John P Kemp; Susan M Ring; Karen Ho; Stephen B Montgomery; Emmanouil T Dermitzakis; David M Evans; Jonathan A C Sterne
Journal:  J Allergy Clin Immunol       Date:  2012-11-13       Impact factor: 10.793

10.  New loci associated with birth weight identify genetic links between intrauterine growth and adult height and metabolism.

Authors:  Momoko Horikoshi; Hanieh Yaghootkar; Dennis O Mook-Kanamori; Ulla Sovio; H Rob Taal; Branwen J Hennig; Jonathan P Bradfield; Beate St Pourcain; David M Evans; Pimphen Charoen; Marika Kaakinen; Diana L Cousminer; Terho Lehtimäki; Eskil Kreiner-Møller; Nicole M Warrington; Mariona Bustamante; Bjarke Feenstra; Diane J Berry; Elisabeth Thiering; Thiemo Pfab; Sheila J Barton; Beverley M Shields; Marjan Kerkhof; Elisabeth M van Leeuwen; Anthony J Fulford; Zoltán Kutalik; Jing Hua Zhao; Marcel den Hoed; Anubha Mahajan; Virpi Lindi; Liang-Kee Goh; Jouke-Jan Hottenga; Ying Wu; Olli T Raitakari; Marie N Harder; Aline Meirhaeghe; Ioanna Ntalla; Rany M Salem; Karen A Jameson; Kaixin Zhou; Dorota M Monies; Vasiliki Lagou; Mirna Kirin; Jani Heikkinen; Linda S Adair; Fowzan S Alkuraya; Ali Al-Odaib; Philippe Amouyel; Ehm Astrid Andersson; Amanda J Bennett; Alexandra I F Blakemore; Jessica L Buxton; Jean Dallongeville; Shikta Das; Eco J C de Geus; Xavier Estivill; Claudia Flexeder; Philippe Froguel; Frank Geller; Keith M Godfrey; Frédéric Gottrand; Christopher J Groves; Torben Hansen; Joel N Hirschhorn; Albert Hofman; Mads V Hollegaard; David M Hougaard; Elina Hyppönen; Hazel M Inskip; Aaron Isaacs; Torben Jørgensen; Christina Kanaka-Gantenbein; John P Kemp; Wieland Kiess; Tuomas O Kilpeläinen; Norman Klopp; Bridget A Knight; Christopher W Kuzawa; George McMahon; John P Newnham; Harri Niinikoski; Ben A Oostra; Louise Pedersen; Dirkje S Postma; Susan M Ring; Fernando Rivadeneira; Neil R Robertson; Sylvain Sebert; Olli Simell; Torsten Slowinski; Carla M T Tiesler; Anke Tönjes; Allan Vaag; Jorma S Viikari; Jacqueline M Vink; Nadja Hawwa Vissing; Nicholas J Wareham; Gonneke Willemsen; Daniel R Witte; Haitao Zhang; Jianhua Zhao; James F Wilson; Michael Stumvoll; Andrew M Prentice; Brian F Meyer; Ewan R Pearson; Colin A G Boreham; Cyrus Cooper; Matthew W Gillman; George V Dedoussis; Luis A Moreno; Oluf Pedersen; Maiju Saarinen; Karen L Mohlke; Dorret I Boomsma; Seang-Mei Saw; Timo A Lakka; Antje Körner; Ruth J F Loos; Ken K Ong; Peter Vollenweider; Cornelia M van Duijn; Gerard H Koppelman; Andrew T Hattersley; John W Holloway; Berthold Hocher; Joachim Heinrich; Chris Power; Mads Melbye; Mònica Guxens; Craig E Pennell; Klaus Bønnelykke; Hans Bisgaard; Johan G Eriksson; Elisabeth Widén; Hakon Hakonarson; André G Uitterlinden; Anneli Pouta; Debbie A Lawlor; George Davey Smith; Timothy M Frayling; Mark I McCarthy; Struan F A Grant; Vincent W V Jaddoe; Marjo-Riitta Jarvelin; Nicholas J Timpson; Inga Prokopenko; Rachel M Freathy
Journal:  Nat Genet       Date:  2012-12-02       Impact factor: 38.330

View more
  44 in total

1.  The Generation R Study: Biobank update 2015.

Authors:  Claudia J Kruithof; Marjolein N Kooijman; Cornelia M van Duijn; Oscar H Franco; Johan C de Jongste; Caroline C W Klaver; Johan P Mackenbach; Henriëtte A Moll; Hein Raat; Edmond H H M Rings; Fernando Rivadeneira; Eric A P Steegers; Henning Tiemeier; Andre G Uitterlinden; Frank C Verhulst; Eppo B Wolvius; Albert Hofman; Vincent W V Jaddoe
Journal:  Eur J Epidemiol       Date:  2014-12-21       Impact factor: 8.082

Review 2.  Complex Phenotypes: Mechanisms Underlying Variation in Human Stature.

Authors:  Pushpanathan Muthuirulan; Terence D Capellini
Journal:  Curr Osteoporos Rep       Date:  2019-10       Impact factor: 5.096

3.  Distinction between the effects of parental and fetal genomes on fetal growth.

Authors:  Thorhildur Juliusdottir; Valgerdur Steinthorsdottir; Lilja Stefansdottir; Gardar Sveinbjornsson; Erna V Ivarsdottir; Rosa B Thorolfsdottir; Jon K Sigurdsson; Vinicius Tragante; Kristjan E Hjorleifsson; Anna Helgadottir; Michael L Frigge; Gudmundur Thorgeirsson; Rafn Benediktsson; Emil L Sigurdsson; David O Arnar; Thora Steingrimsdottir; Ingileif Jonsdottir; Hilma Holm; Daniel F Gudbjartsson; Gudmar Thorleifsson; Unnur Thorsteinsdottir; Kari Stefansson
Journal:  Nat Genet       Date:  2021-07-19       Impact factor: 38.330

4.  Genetics of early growth traits.

Authors:  Diana L Cousminer; Rachel M Freathy
Journal:  Hum Mol Genet       Date:  2020-09-30       Impact factor: 6.150

Review 5.  Mammalian axoneme central pair complex proteins: Broader roles revealed by gene knockout phenotypes.

Authors:  Maria E Teves; David R Nagarkatti-Gude; Zhibing Zhang; Jerome F Strauss
Journal:  Cytoskeleton (Hoboken)       Date:  2016-01

6.  Genetic loci associated with renal function measures and chronic kidney disease in children: the Pediatric Investigation for Genetic Factors Linked with Renal Progression Consortium.

Authors:  Matthias Wuttke; Craig S Wong; Elke Wühl; Daniel Epting; Li Luo; Anselm Hoppmann; Anke Doyon; Yong Li; Betül Sözeri; Daniela Thurn; Martin Helmstädter; Tobias B Huber; Tom D Blydt-Hansen; Albrecht Kramer-Zucker; Otto Mehls; Anette Melk; Uwe Querfeld; Susan L Furth; Bradley A Warady; Franz Schaefer; Anna Köttgen
Journal:  Nephrol Dial Transplant       Date:  2015-09-28       Impact factor: 5.992

7.  Admixture mapping and fine-mapping of birth weight loci in the Black Women's Health Study.

Authors:  Heather M Ochs-Balcom; Holly Shaw; Leah Preus; Julie R Palmer; Stephen A Haddad; Lynn Rosenberg; Edward A Ruiz-Narváez
Journal:  Hum Genet       Date:  2018-07-13       Impact factor: 4.132

Review 8.  Genomic insights into growth and its disorders: an update.

Authors:  Christiaan de Bruin; Andrew Dauber
Journal:  Curr Opin Endocrinol Diabetes Obes       Date:  2016-02       Impact factor: 3.243

9.  Comprehensive analysis of schizophrenia-associated loci highlights ion channel pathways and biologically plausible candidate causal genes.

Authors:  Tune H Pers; Pascal Timshel; Stephan Ripke; Samantha Lent; Patrick F Sullivan; Michael C O'Donovan; Lude Franke; Joel N Hirschhorn
Journal:  Hum Mol Genet       Date:  2016-01-10       Impact factor: 6.150

10.  Genome-wide Analysis of Body Proportion Classifies Height-Associated Variants by Mechanism of Action and Implicates Genes Important for Skeletal Development.

Authors:  Yingleong Chan; Rany M Salem; Yu-Han H Hsu; George McMahon; Tune H Pers; Sailaja Vedantam; Tonu Esko; Michael H Guo; Elaine T Lim; Lude Franke; George Davey Smith; David P Strachan; Joel N Hirschhorn
Journal:  Am J Hum Genet       Date:  2015-04-09       Impact factor: 11.025

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.