| Literature DB >> 22498030 |
Nahid Turan1, Mohamed F Ghalwash, Sunita Katari, Christos Coutifaris, Zoran Obradovic, Carmen Sapienza.
Abstract
BACKGROUND: Infant birth weight is a complex quantitative trait associated with both neonatal and long-term health outcomes. Numerous studies have been published in which candidate genes (IGF1, IGF2, IGF2R, IGF binding proteins, PHLDA2 and PLAGL1) have been associated with birth weight, but these studies are difficult to reproduce in man and large cohort studies are needed due to the large inter individual variance in transcription levels. Also, very little of the trait variance is explained. We decided to identify additional candidates without regard for what is known about the genes. We hypothesize that DNA methylation differences between individuals can serve as markers of gene "expression potential" at growth related genes throughout development and that these differences may correlate with birth weight better than single time point measures of gene expression.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22498030 PMCID: PMC3359247 DOI: 10.1186/1755-8794-5-10
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Correlation of mechanism-based candidate gene expression levels with birth weight
| Gene Symbol | Transcript ID | HumanHT-12 v3 Expression | Real Time RT-PCR Expression | ||
|---|---|---|---|---|---|
| Cord Blood | Placenta | Cord Blood | Placenta | ||
| ILMN_2056087 | 2.0E-04 | 0.017 | nd | nd | |
| ILMN_1709613 | 0.003 | 0.002 | |||
| ILMN_1675048 | 0.009 | nd | nd | ||
| ILMN_1699867 | 1.3E-05 | 0.004 | 1.2E-04 (n = 190) | 4.5E-04 (n = 254) | |
| ILMN_2298035 | 0.008 | 0.001 | |||
| ILMN_2413956 | 0.003 | 0.003 | |||
| ILMN_1733807 | 0.007 | 1.0E-04 | nd | nd | |
| ILMN_1702447 | 0.003 | 0.016 | 1.0E-07 (n = 114) | ||
| ILMN_1807423 | 0.056 | 0.007 | nd | nd | |
| ILMN_1807662 | 0.006 | 3.0E-04 | 0.005 (n = 194) | 6.1E-04 (n = 241) | |
| ILMN_2387385 | 0.014 | 0.001 | ne | ||
| ILMN_1728445 | 0.001 | 0.001 | |||
| ILMN_1725193 | 0.006 | 0.031 | ne | 0.003 (n = 110) | |
| ILMN_1746085 | 0.007 | 0.002 | ne | 0.001 (n = 135) | |
| ILMN_2396875 | 0.009 | 0.002 | |||
| ILMN_1665865 | 0.006 | 0.003 | nd | nd | |
| ILMN_2132982 | 0.014 | 0.002 | nd | nd | |
| ILMN_1750324 | 0.001 | 0.003 | |||
| ILMN_1669362 | 0.001 | 0.009 | nd | nd | |
| ILMN_2062468 | 2.5E-05 | 0.005 | nd | nd | |
| ILMN_1666966 | 0.022 | 0.034 | nd | nd | |
| ILMN_1670918 | 0.031 | nd | nd | ||
| ILMN_1715374 | 0.007 | 3.9E-05 | nd | nd | |
| ILMN_1671557 | 0.036 | 0.001 | nd | nd | |
| ILMN_1815121 | 0.001 | 0.009 | 0.006 (n = 105) | 0.013 (n = 136) | |
| ILMN_2356955 | 0.014 | 0.004 | |||
| n/a | n/a | 0.002 (n = 186) | 0.002 (n = 241) | ||
Multiple entries represent data for multiple transcripts on the array. The best correlation obtained in each group is shown in bold
* Ratio IGF2/IGF2R expression
n/a = not applicable
nd = not done
ne = not expressed
Figure 1Expression levels of mechanism-based candidate genes versus birth weight. Illumina's HumanHT-12 v3 Expression of (A) INSR (transcript ID: ILMN_1670918) in cord blood (n = 48), and (B) IGF1R (transcript ID: ILMN_1675048) in placenta (n = 48). Real time RT-PCR expression of (C) IGF2BP2 in cord blood (n = 119), and (D) IGFBP1 in placenta (n = 150).
Correlation of mechanism-based candidate gene methylation levels with birth weight
| Gene Symbol | GoldenGate CpG ID | GoldenGate Methylation | Infinium | Infinium Methylation | ||
|---|---|---|---|---|---|---|
| Cord Blood | Placenta | Cord Blood | Placenta | |||
| cg17084217 | 0.004 | 0.004 | cg01305421 | 0.005 | 0.007 | |
| cg25163611 | 1.0E-04 | 0.031 | cg14568338 | |||
| cg19714640 | 0.038 | cg22375192 | 0.011 | 0.021 | ||
| cg20742855 | 0.005 | 0.018 | cg02166532 | 0.006 | 0.001 | |
| cg10649864 | 0.007 | 0.077 | cg02807948 | 0.049 | 4.0E-04 | |
| cg17626526 | 0.040 | 0.026 | cg13756879 | 4.0E-04 | 0.001 | |
| cg17084217 | 0.011 | 3.0E-04 | cg20339650 | 0.014 | 4.0E-04 | |
| cg22956483 | 3.0E-04 | 0.001 | ||||
| cg01305421 | 0.032 | 0.003 | ||||
| n/a | n/a | cg06638433 | 0.005 | 0.044 | ||
| cg13877465 | 0.019 | 8.3E-05 | ||||
| n/a | n/a | cg18234011 | 0.005 | 0.024 | ||
| cg24450631 | 0.005 | 0.006 | ||||
| cg00508334 | 3.5E-05 | 0.028 | cg02860543 | 0.049 | 1.2E-05 | |
| cg21413760 | 0.062 | 3.1E-07 | cg19042950 | 1.2E-05 | 0.002 | |
| cg07148501 | 0.009 | 0.076 | cg00230368 | 0.007 | 0.014 | |
| cg12721534 | 0.014 | 0.063 | cg14556618 | 8.4E-05 | 1.0E-04 | |
| cg20666158 | 0.015 | 0.059 | cg05660795 | 0.033 | 0.014 | |
| cg23864854 | 0.048 | 0.028 | cg27447599 | 0.021 | 0.018 | |
| cg07828219 | 0.032 | 0.018 | cg25854162 | 0.004 | 0.011 | |
| cg17207942 | 0.035 | 0.001 | cg26187237 | 6.7E-05 | 0.015 | |
| cg12826145 | 0.023 | 0.012 | cg04796162 | 0.014 | 0.036 | |
| cg14625938 | 0.001 | 0.010 | cg06713098 | 0.027 | 0.002 | |
| cg08831744 | 0.001 | 0.003 | ||||
| cg15898840 | 0.026 | 0.003 | ||||
| cg22083798 | 0.029 | 0.042 | ||||
| cg03940014 | 0.054 | 0.008 | cg00512374 | 0.008 | 0.022 | |
| cg22392383 | 0.018 | 0.042 | ||||
| cg20419545 | 0.066 | 0.001 | cg19008649 | 0.021 | 0.005 | |
| cg24617085 | 0.067 | 0.017 | cg22467567 | 0.001 | 0.006 | |
| cg00122038 | 0.009 | 0.011 | cg01773854 | 0.051 | 1.0E-04 | |
| cg22732012 | 0.072 | 2.0E-04 | cg08629913 | 0.024 | 0.003 | |
| cg00431950 | 0.023 | 0.037 | cg00884221 | 0.002 | 0.001 | |
| cg16546204 | 0.026 | 0.014 | cg03876618 | 3.3E-05 | 0.001 | |
| cg13349859 | 0.001 | 0.020 | cg00613255 | 0.001 | 0.005 | |
| cg14426263 | 0.008 | 0.005 | cg03366382 | 1.0E-04 | 0.044 | |
| cg13993218 | 0.003 | 0.012 | ||||
| cg25336198 | 0.005 | 0.008 | ||||
| cg05427477 | 0.002 | 0.084 | cg01263716 | n/a | n/a | |
| cg19110381 | 0.072 | 0.001 | cg01505590 | |||
| cg03637064 | 0.019 | cg04720330 | 4.0E-05 | 0.062 | ||
| cg18242686 | 0.024 | 0.006 | cg11961618 | 0.039 | 0.014 | |
| cg14415214 | 0.001 | |||||
| cg21259253 | 4.0E-04 | 0.031 | ||||
| cg26799802 | 3.0E-04 | 0.035 | ||||
| cg00702231 | 0.019 | 0.031 | ||||
| cg07077459 | 0.006 | |||||
| cg10923987 | 0.002 | 0.052 | cg08263357 | 3.8E-06 | 0.006 | |
| cg12757684 | 0.067 | 0.062 | cg12757684 | 0.001 | 0.013 | |
| cg14161241 | 0.002 | 0.030 | ||||
| cg17895149 | 0.001 | 0.009 | ||||
| cg22378065 | 0.017 | 0.034 | ||||
| cg25350411 | 0.002 | 0.001 | ||||
| cg00613255 | 0.007 | 0.003 | ||||
| cg03366382 | 0.010 | 0.001 | ||||
| cg25871270 | 0.001 | 0.065 | n/a | n/a | ||
| cg19731870 | 0.002 | 0.008 | ||||
Multiple entries represent data from multiple CpG sites. The best correlation obtained in each group is shown in bold
* IGF2/H19 differentially methylated region (DMR)
n/a = not applicable i.e. no probes on array
Figure 2Methylation levels of candidate genes versus birth weight. "Beta value" is fraction of methyl C observed at the PGRMC1 (CpG ID: cg19606309) site (A) and RGS14 (CpG ID: cg09010421) site (B) on Illumina GoldenGate array in placenta (n = 22, in duplicate). Methylation levels assayed at same site by bisulfite pyrosequencing also correlated with birth weight for PGRMC1 (C) and RGS14 (D) in placenta (n = 13).
Candidate genes whose methylation is correlated with birth weight
| Data Set | L1-regularized regression | Tissue | Genes in model | Gene ID | |
|---|---|---|---|---|---|
| 0.78 | 21/22 | Blood | Apolipoprotein E | ||
| Msh homeobox 1 | |||||
| Placenta | Growth factor receptor-bound protein 10 | ||||
| Progesterone receptor membrane component 1 | |||||
| Regulator of G-protein signaling 14 | |||||
| Serine hydroxymethyl transferase 2 (mitochondrial) | |||||
| 0.50 | 44/48 | Blood and placenta, as above | Six genes, above | ||
| 0.70 | 45/48 | Blood | Atpase, H + transporting, lysosomal accessory protein 1 | ||
| Protease, serine, 21 (testisin) | |||||
| REST co-repressor 1 | |||||
| Placenta | Angiopoietin 4 | ||||
| Cyclin-dependent kinase 2 | |||||
| Envoplakin | |||||
| FLJ37478: N-acetyltransferase 8-like (GCN5-related, putative) | |||||
| 0.84 | 44/48 | Blood and Placenta, as above | All 13 genes from both experiments, combined |
1 The maximum L1-regularized regression correlation obtained for the gene model in which more than 90% of the "leave one out" cross-validations exhibited non-zero regression parameters (third column)
Figure 3Methylation levels of candidate genes versus birth weight. "Beta value" is the fraction of methyl C observed at the ANGPT4 (CpG ID: cg26540515) site (A) in placenta and PRSS21 (CpG ID: cg21085768) site (B) in cord blood on the Illumina Infinium array (n = 48). Methylation levels assayed at same site by bisulfite pyrosequencing also correlated with birth weight for ANGPT4 (C) in placenta (n = 26) and PRSS21 (D) in cord blood (n = 25).
Correlation between DNA methylation and transcription of candidate genes
| Tissue | Methylation Genes | CpG ID | Transcript ID | ||
|---|---|---|---|---|---|
| cg24697031 | ILMN_1727288 | -0.30 | 0.04 | ||
| cg06386517 | ILMN_1669617 | 0.34 | 0.02 | ||
| cg20651681 | 0.39 | 0.01 | |||
| cg06790324 | 0.29 | 0.04 | |||
| cg03104936 | 0.29 | 0.05 | |||
| cg03104936 | ILMN_1652662 | 0.37 | 0.01 | ||
| cg06386517 | ILMN_2340919 | 0.33 | 0.02 | ||
| cg20651681 | 0.34 | 0.02 | |||
| cg24183958 | 0.38 | 0.01 | |||
| cg06790324 | 0.39 | 0.01 |
1 Pearson correlation coefficient
Correlation between DNA methylation and transcription in the candidate genes
| Tissue | Methylation Candidate | CpG ID | Gene Transcript | Transcript ID | ||
|---|---|---|---|---|---|---|
| cg14167596 | APOE | ILMN_1740938 | 0.76 | < 0.001 | ||
| cg11930592 | ATP6AP1 | ILMN_1697694 | 0.32 | 0.03 | ||
| cg15755084 | 0.32 | 0.03 | ||||
| cg20891301a | -0.302 | 0.04 | ||||
| cg26615830 | 0.32 | 0.03 | ||||
| cg15696627 | 0.44 | 0.002 | ||||
| cg03717979 | 0.51 | < 0.001 | ||||
| cg15755084 | PRSS21 | ILMN_2382964 | -0.38 | 0.01 | ||
| cg15696627 | -0.37 | 0.01 | ||||
| cg20588069 | -0.42 | 0.003 | ||||
| cg06677140 | -0.29 | 0.04 | ||||
| cg09573795 | ILMN_1774256 | -0.34 | 0.02 | |||
| cg03199651 | -0.30 | 0.04 | ||||
| cg20588069 | -0.36 | 0.01 | ||||
| cg22609784 | -0.31 | 0.03 | ||||
| cg15696627 | RCOR1 | ILMN_1743421 | -0.28 | 0.05 | ||
| cg03717979 | ILMN_1743421 | -0.29 | 0.04 | |||
| cg06677140 | ILMN_1743421 | -0.38 | 0.01 | |||
| cg09106999 | GRB10 | ILMN_1667771 | -0.323 | 0.03 | ||
| cg00129774 | ILMN_1669617 | 0.51 | < 0.001 | |||
| cg00129774 | ILMN_2340919 | 0.46 | 0.001 | |||
| cg04108502 | 0.37 | 0.01 | ||||
| cg09304040 | PGRMC1 | ILMN_1684771 | -0.34 | 0.02 | ||
| cg09106999 | RGS14 | ILMN_1696828 | -0.31 | 0.03 | ||
| cg24697031 | EVPL | ILMN_1727288 | -0.30 | 0.04 | ||
| cg20651681 | CDK2 | ILMN_1653443 | -0.29 | 0.04 | ||
| cg15774495 | -0.28 | 0.05 | ||||
| cg06790324 | -0.36 | 0.01 | ||||
| cg06386517 | GRB10 | ILMN_1669617 | 0.34 | 0.02 | ||
| cg20651681 | 0.39 | 0.01 | ||||
| cg06790324 | 0.29 | 0.04 | ||||
| cg03104936 | 0.29 | 0.05 | ||||
| cg03104936 | ILMN_1652662 | 0.37 | 0.01 | |||
| cg06386517 | ILMN_2340919 | 0.33 | 0.02 | |||
| cg20651681 | 0.34 | 0.02 | ||||
| cg24183958 | 0.38 | 0.01 | ||||
| cg06790324 | 0.39 | 0.01 | ||||
| cg20651681 | PGRMC1 | ILMN_1684771 | -0.30 | 0.04 | ||
| cg03104936 | -0.29 | 0.05 | ||||
| cg20651681 | RGS14 | ILMN_1696828 | -0.34 | 0.02 | ||
| cg08211091 | GRB10 | ILMN_1669617 | -0.31 | 0.03 |
1 Pearson correlation coefficient
2 MSX1 has five CpGs that are positively correlated with ATP6AP1, however, cg20891301, which is anomalously negatively correlated is located at the end of the CpG island
3 CDK2 has three CpGs that are positively correlated with GRB10, however, one CpG, cg09106999 is negatively correlated
Candidate genes whose transcript levels are correlated with birth weight
| Data Set | L1-regularized regression | Tissue | Genes in model | Description | |
|---|---|---|---|---|---|
| 0.55 | 45/48 | Blood | HS.406106 | BX090408 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGp998E08415; IMAGE:211951 | |
| LOC255130 | PREDICTED: Homo sapiens hypothetical LOC255130 (LOC255130) | ||||
| Placenta | HS.568324 | AGENCOURT_7975600 NIH_MGC_113 Homo sapiens cDNA clone IMAGE:6215286 5 | |||
| HS.572889 | DA236664 BRAWH3 Homo sapiens cDNA clone BRAWH3033381 5 | ||||
| NBPF10 | Homo sapiens neuroblastoma breakpoint family |
Five random permutation models with higher R2 than the L1 model and the adjusted R2 when tested on the Infinium data-set
| Data Set | Adjusted R2 and stability when tested on Infinium Data | Genes in model | Gene names | |
|---|---|---|---|---|
| 0.86 | 0.59 (46/48) | ADAM metallopeptidase domain 9 | ||
| dihydropyrimidinase-like 3 | ||||
| fatty acid binding protein 5 | ||||
| homeobox B4 | ||||
| CIITA, class II, major histocompatibility complex, transactivator | ||||
| C2orf56, chromosome 2 open reading frame 56 | ||||
| 0.82 | negative | growth factor receptor-bound protein 10 | ||
| PLA2G16, phospholipase A2, group XVI | ||||
| myosin, heavy chain 14, non-muscle | ||||
| wingless-type MMTV integration site family, member 16 | ||||
| 0.81 | 0.48 (44/48) | bestrophin 1 | ||
| IMP (inosine 5'-monophosphate) dehydrogenase 2 | ||||
| oxysterol binding protein-like 5 | ||||
| paired box 3 | ||||
| proteasome (prosome, macropain) 26S subunit, ATPase, 3 | ||||
| serpin peptidase inhibitor, clade F (alpha-2 antiplasmin, pigment epithelium derived factor), member 1 | ||||
| 0.80 | 0 (45/48) | chromobox homolog 1 | ||
| eomesodermin | ||||
| poly(A) binding protein, cytoplasmic 4 (inducible form) | ||||
| phosphoinositide-3-kinase, catalytic, gamma polypeptide | ||||
| solute carrier family 16, member 1 (monocarboxylic acid transporter 1) | ||||
| thymopoietin | ||||
| 0.79 | negative | TMEM9B, TMEM9 domain family, member B | ||
| chaperonin containing TCP1, subunit 3 (gamma) | ||||
| myosin, heavy chain 9, non-muscle | ||||
| prospero homeobox 1 | ||||
| RE1-silencing transcription factor | ||||
| ribosomal protein S2 |
Each of the two random permutation six-gene models that also had positive R2 in the Infinium data set (from Table 7) were combined with each random permutation seven-gene model that achieved an R2 higher than the L1 Infinium model (25 models) for a total of 50, 13 gene models
| Data Set | Genes in model | Adjusted R2 and stability when tested on Infinium Data | Genes in resulting model | Gene name |
|---|---|---|---|---|
| 0.87 (44/48) | Cortactin | |||
| IMP (inosine 5'-monophosphate) dehydrogenase 2 | ||||
| oxysterol binding protein-like 5 | ||||
| paired box 3 | ||||
| proteasome (prosome, macropain) 26S subunit, ATPase, 3 | ||||
| regenerating islet-derived 1 beta | ||||
| RuvB-like 1 (E. coli) | ||||
| serpin peptidase inhibitor, clade F (alpha-2 antiplasmin, pigment epithelium derived factor), member 1 | ||||
| vacuolar protein sorting 52 homolog | ||||
Only one of the 50 models achieved higher R2 than the L1 13 gene model. Genes in bold have suggested role in fetal or placental growth and development
Top ten genes whose transcript levels are correlated with methylation of CpG sites in MSX1, CDK2 and GRB10
| Tissue | Methylation Gene | CpG ID | Expression Gene | Transcript ID | Gene Name | |
|---|---|---|---|---|---|---|
| cg14167596 | ILMN_1740938 | 0.76 | Apolipoprotein E | |||
| cg14167596 | ILMN_1734176 | 0.70 | Glycoprotein Hormones, Alpha Polypeptide | |||
| cg03199651 | KRT6C | ILMN_1754576 | 0.69 | Keratin 6 C | ||
| cg14167596 | PAPPA | ILMN_1721770 | 0.67 | Protein Kinase C And Casein Kinase Substrate in Neurons 1 | ||
| cg14167596 | ILMN_1693397 | 0.67 | Pregnancy Specific Beta-1-Glycoprotein 4 | |||
| cg26615830 | DCN | ILMN_2347145 | 0.65 | Decorin | ||
| cg14167596 | ILMN_2309615 | 0.65 | Pregnancy Specific Beta-1-Glycoprotein 6 | |||
| cg14167596 | ILMN_1693617 | 0.65 | Chorionic somatomammotropin hormone 1 (placental lactogen) | |||
| cg14167596 | ILMN_1659354 | 0.65 | Growth Hormone 2 | |||
| cg14167596 | ILMN_1726266 | 0.65 | ADAM Metallopeptidase Domain 12 | |||
| cg04108502 | CXCL11 | ILMN_2067890 | 0.74 | Chemokine (C-X-C Motif) Ligand 11 | ||
| cg04108502 | HLA-DPB1 | ILMN_1749070 | 0.70 | Major Histocompatibility Complex, Classii, DP Beta 1 | ||
| cg04108502 | CXCL9 | ILMN_1745356 | 0.68 | Chemokine (C-X-C Motif) Ligand 9 | ||
| cg04108502 | GBP4 | ILMN_1771385 | 0.68 | Guanylate Binding Protein 4 | ||
| cg04108502 | GBP5 | ILMN_2114568 | 0.67 | Guanylate Binding Protein 5 | ||
| cg04108502 | UBD | ILMN_1678841 | 0.66 | Ubiquitin D | ||
| cg04108502 | VCY | ILMN_1683872 | 0.66 | Variable charge, Y-linked | ||
| cg04108502 | HLA-DRB3 | ILMN_1717261 | 0.66 | Major Histocompatibility Complex, Class II, DR Beta 3 | ||
| cg04108502 | CD3D | ILMN_2261416 | 0.65 | CD3d molecule, delta (CD3-TCR complex) | ||
| cg04108502 | CETP | ILMN_1681882 | 0.65 | Cholesteryl ester transfer protein, plasma | ||
| cg20651681 | SHROOM2 | ILMN_1681777 | 0.68 | Shroom Family Member 2 | ||
| cg20651681 | ILMN_1781565 | 0.67 | Mesoderm Development Candidate 1 | |||
| cg20651681 | CCDC146 | ILMN_1790555 | 0.67 | Coiled-Coil Domain Containing 146 | ||
| cg20651681 | VANGL2 | ILMN_1715647 | 0.67 | Vang-Like2 (Vangogh, Drosophila) | ||
| cg06790324 | SCG2 | ILMN_1703178 | 0.66 | Secretogranin II | ||
| cg20651681 | STGC3 | ILMN_1807244 | 0.66 | hypothetical STGC3 | ||
| cg06790324 | ABHD14B | ILMN_2227533 | 0.66 | Ab Hydrolase Domain Containing 14B | ||
| cg20651681 | ILMN_1699814 | 0.66 | Tolloid-Like 1 | |||
| cg20651681 | SOD1 | ILMN_1662438 | -0.65 | Superoxide Dismutase 1, Soluble | ||
| cg06790324 | INPP5E | ILMN_1811301 | 0.65 | Inositol polyphosphate-5-phosphatase, 72 kDa |
Genes in bold have suggested role in fetal or placental growth and development
1 Pearson correlation coefficient (P ≤ 0.01)