| Literature DB >> 33270668 |
Lisa Uechi1, Mahjoubeh Jalali1, Jayson D Wilbur2, Jonathan L French2, N L Jumbe3, Michael J Meaney4,5, Peter D Gluckman6, Neerja Karnani7,8, Nikita A Sakhanenko1, David J Galas1.
Abstract
The genetic mechanisms of childhood development in its many facets remain largely undeciphered. In the population of healthy infants studied in the Growing Up in Singapore Towards Healthy Outcomes (GUSTO) program, we have identified a range of dependencies among the observed phenotypes of fetal and early childhood growth, neurological development, and a number of genetic variants. We have quantified these dependencies using our information theory-based methods. The genetic variants show dependencies with single phenotypes as well as pleiotropic effects on more than one phenotype and thereby point to a large number of brain-specific and brain-expressed gene candidates. These dependencies provide a basis for connecting a range of variants with a spectrum of phenotypes (pleiotropy) as well as with each other. A broad survey of known regulatory expression characteristics, and other function-related information from the literature for these sets of candidate genes allowed us to assemble an integrated body of evidence, including a partial regulatory network, that points towards the biological basis of these general dependencies. Notable among the implicated loci are RAB11FIP4 (next to NF1), MTMR7 and PLD5, all highly expressed in the brain; DNMT1 (DNA methyl transferase), highly expressed in the placenta; and PPP1R12B and DMD (dystrophin), known to be important growth and development genes. While we cannot specify and decipher the mechanisms responsible for the phenotypes in this study, a number of connections for further investigation of fetal and early childhood growth and neurological development are indicated. These results and this approach open the door to new explorations of early human development.Entities:
Year: 2020 PMID: 33270668 PMCID: PMC7714163 DOI: 10.1371/journal.pone.0242684
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Significant dependencies among neurological phenotypes (Bayley) and growth parameters (Growth) with 2-way (top sub-table) and 3-way effects (middle and bottom sub-tables) were observed.
| Bayley | Growth / Bayley | Growth | P-values |
|---|---|---|---|
| Cognitive | 1.76E-05 | ||
| Adaptive | 3.29E-04 | ||
| Social-emotional | 4.7E-04 | ||
| Language | 2.7E-04 | ||
| Social-emotional | 1.293E-03 | ||
| Adaptive | 1.905E-03 | ||
| Cognitive | 2.257E-03 | ||
| Adaptive | 2.416E-03 | ||
| Social-emotional | Adaptive | 2.22E-06 | |
| Social-emotional | Adaptive | 3.97E-06 | |
| Language | ITSEA | 5.69E-06 | |
| Adaptive | CBCL | 7.96E-06 | |
| Language | Social-emotional | 8.87E-06 | |
| Language | Adaptive | 1.02E-05 |
Note that the change in statistical significance after multiple hypothesis correction here is small. The composite Cognitive Bayley scale score and the growth rate parameter alpha showed the strongest 2-way dependencies, as measured by the p-value (described in Methods section 4.4.) There was a clear relationship between robust head growth and the Bayley phenotypes for both 2-way and 3-way dependencies. With the exception of the two shown in the bottom sub-table, the other child-specific neurological phenotype data such as Infant Toddler Social Emotional Assessment (ITSEA) and Child Behavior Check List (CBCL) were observed in 3-way dependency with weaker significance levels (p-values > 10−4) and are not reported here.
Significant SNPs associated with Bayley phenotypes using two-way dependency measures (mutual information, MI).
| Bayley phenotype | SNP | Gene | MI | p-value |
|---|---|---|---|---|
| Motor | rs10833478 | NELL1 | 0.0843 | 3.409E-07 |
| Motor | rs645026 | YEATS4 | 0.0832 | 4.398E-07 |
| Adaptive | rs7387693 | MTMR7 | 0.0847 | 9.542E-07 |
| Adaptive | rs1291359 | HTR7P1 | 0.0829 | 1.545E-06 |
| Social-Emotional | rs11628108 | C14orf177 | 0.0826 | 1.563E-06 |
| Adaptive | rs4955988 | CACNA2D3 | 0.0817 | 2.134E-06 |
| Motor | rs1449848 | CPNE8 | 0.0772 | 2.156E-06 |
| Motor | rs7155811 | TMEM260 | 0.0772 | 2.165E-06 |
| Social-Emotional | rs1161106 | LOC100507175 | 0.0811 | 2.284E-06 |
| Language | rs11658800 | ELAC2 | 0.0803 | 2.475E-06 |
| Language | rs7239403 | SMIM21 | 0.0803 | 2.514E-06 |
| Social-Emotional | rs12434723 | C14orf177 | 0.0805 | 2.608E-06 |
| Adaptive | rs7462219 | MTMR7 | 0.0807 | 2.667E-06 |
SNPs are ordered by p-values for the mutual information (see Methods section 4.3). Note that the positions of the SNP’s are indicated for the human genome build hg19. Note also that rs7239403 is closest to the non-coding RNA gene LINC01898, and SMIM21 is the closest protein-coding gene. The loci at MTMR7 are the only pair of variants in the same gene. These MI values are not adjusted for ethnicity confounding effects (see section 4.4.2).
Fig 1Manhattan plot of SNPs with Bayley phenotype dependence.
Y axis shows p-values (negative log scale) of pairwise dependence of SNPs with Bayley phenotypes at 24 months (see Table 2). SNPs with p-value<2.7x10-6 (red line) are highlighted and labeled.
Fig 8The distributions of phenotypes by genotype for three of the pairwise genetic variant effects.
The numbers of subjects with each genotype are shown under each panel. a. NELL1 shows a distribution that suggests a strong dominance of the major allele for the Bayley scale score distribution. b. PPP1R12B shows a diametrically opposite distribution between the homozygote and heterozygote. c. PLD5 shows the same as in b) but with a distinct homozygous minor distribution.
Significant SNPs associated with each of growth phenotypes (linf, lambda and alpha) using two-way dependency measures (mutual information, MI).
| Growth parameter | SNP | Gene | MI | p-value |
|---|---|---|---|---|
| Linf | rs12734338 | PPP1R12B | 0.1130 | 2.229E-09 |
| Lambda | rs6672510 | PLD5 | 0.0316 | 3.120E-08 |
| Linf | rs7071157 | PFKFB3 | 0.0276 | 4.079E-07 |
| Linf | rs6710428 | CERS6 | 0.0264 | 8.826E-07 |
| Lambda | rs4793500 | CASC17 | 0.0259 | 1.163E-06 |
| Alpha | rs9691259 | IGFBP3 | 0.0258 | 1.299E-06 |
| Linf | rs6570627 | UTRN | 0.0250 | 2.202E-06 |
| Alpha | rs7101173 | MIR6072 | 0.0248 | 2.543E-06 |
| Linf | rs6884117 | C5orf22 | 0.0241 | 3.858E-06 |
| Lambda | rs564266 | NTM | 0.0241 | 4.054E-06 |
| Lambda | rs7075547 | LRRTM3 | 0.0237 | 5.153E-06 |
| Lambda | rs773024 | OSTF1 | 0.0236 | 5.530E-06 |
| Linf | rs373680 | FBXO33 | 0.0232 | 6.936E-06 |
| Lambda | rs154444 | ZNF608 | 0.0232 | 7.081E-06 |
| Linf | rs7981995 | DACH1 | 0.0232 | 7.181E-06 |
| Alpha | rs10196354 | ERBB4 | 0.0232 | 7.228E-06 |
The SNPs are ordered by p-value of the unadjusted mutual information (see Methods section 4.3) and the 16 loci that have p-value better than 8x10-6 are shown. For convenience the nearest gene to the SNP is indicated, even when there is a significant distance between them. Note that none of these SNPs are in the coding regions.
Fig 2Manhattan plot of SNPs with growth parameter dependence.
Y-axis shows p-values (negative log scale) of pairwise dependence of SNPs with three growth variables: a) linf, b) lambda, and c) alpha. SNPs with p-value<8x10-6 (red line) are highlighted and labeled.
Statistically significant pleiotropic SNPs dependent with growth phenotypes (linf, lambda and alpha) and Bayley phenotypes using three-way dependency.
| Chr | DBSNP | Nearest Gene | Major | Minor | Bayley Phenotype | Growth Phenotype | P-Val |
|---|---|---|---|---|---|---|---|
| 17 | rs178850 | RAB11FIP4 | C | T | Adaptive | 2.02E-08 | |
| 19 | rs2228611 | DNMT1 | C | A | Adaptive | 6.46E-08 | |
| 5 | rs6878810 | LHFPL2 | T | G | Social-Emotional | 1.13E-07 | |
| 16 | rs9933359 | LINC01082 | C | T | Adaptive | 2.26E-07 | |
| 1 | rs584297 | LOC105373115 | A | G | Adaptive | 3.15E-07 | |
| 1 | rs525410 | LAMC2 | C | T | Social-Emotional | 4.56E-07 | |
| 1 | rs1798246 | PRDM16 | A | G | Adaptive | 4.68E-07 | |
| 4 | rs5020219 | ANKRD17 | T | C | Adaptive | 5.97E-07 | |
| 18 | rs2663842 | ATP8B1 | C | A | Adaptive | 6.62E-07 | |
| 14 | rs12587898 | IFI27L1 | G | T | Adaptive | 6.70E-07 | |
| 12 | rs4763682 | PRB4 | A | C | Adaptive | 6.70E-07 | |
| 6 | rs2064317 | TULP1 | A | G | Adaptive | 7.00E-07 | |
| 15 | rs936214 | PAK6 | C | T | Language | 7.54E-07 | |
| 1 | rs12030971 | DEPDC1-AS1 | A | G | Language | 7.91E-07 | |
| 8 | rs11787410 | CSMD1 | G | A | Adaptive | 8.84E-07 | |
| 11 | rs10769889 | LMO1 | T | G | Social-Emotional | 8.88E-07 | |
| 7 | rs1123227 | LINC01448 | G | A | Social-Emotional | 9.56E-07 | |
| 16 | rs4843851 | IRF8 | A | C | Adaptive | 1.02E-06 | |
| 9 | rs10869192 | PIP5K1B | G | A | Adaptive | 1.04E-06 | |
| 3 | rs1131364 | FETUB | C | A | Adaptive | 1.07E-06 | |
| 1 | rs4846400 | ESRRG | T | C | Adaptive | 1.16E-06 | |
| 9 | rs4741967 | SLC1A1 | T | C | Social-Emotional | 1.21E-06 | |
| 22 | rs5751491 | LINC01639 | A | G | Adaptive | 1.32E-06 | |
| 2 | rs2164807 | ATOH8 | A | G | Adaptive | 1.39E-06 | |
| 12 | rs11608306 | PRB2 | A | G | Adaptive | 1.50E-06 | |
| 11 | rs11224253 | JRKL-AS1 | C | T | Language | 1.51E-06 | |
| 2 | rs4668039 | STK39 | C | T | Adaptive | 1.53E-06 | |
| 9 | rs9918994 | LINC00587 | T | C | Adaptive | 1.54E-06 | |
| 5 | rs252242 | LHFPL2 | A | G | Social-Emotional | 1.54E-06 | |
| 12 | rs987547 | PPFIBP1 | G | T | Social-Emotional | 1.56E-06 | |
| 20 | rs6123702 | CTCFL | C | T | Adaptive | 1.60E-06 | |
| 16 | rs2938729 | CDH8 | G | T | Language | 1.61E-06 | |
| 13 | rs11618394 | LINC00397 | T | C | Adaptive | 1.71E-06 | |
| 10 | rs10824914 | FAM21EP | T | C | Adaptive | 1.80E-06 | |
| 8 | rs10106310 | LINC00534 | C | A | Social-Emotional | 1.92E-06 | |
| 9 | rs2150696 | TYRP1 | T | C | Language | 2.01E-06 | |
| 22 | rs4822901 | LOC105372981 | C | T | Social-Emotional | 2.12E-06 | |
| 9 | rs871981 | TYRP1 | C | A | Adaptive | 2.13E-06 | |
| 12 | rs7976966 | RIMBP2 | C | A | Adaptive | 2.15E-06 | |
| 16 | rs9933765 | LOC101928737 | G | A | Social-Emotional | 2.17E-06 | |
| 9 | rs7029138 | LINC00587 | G | T | Adaptive | 2.19E-06 | |
| 12 | rs7139030 | NTN4 | A | C | Social-Emotional | 2.27E-06 | |
| 5 | rs770172 | FBXL17 | C | T | Social-Emotional | 2.34E-06 | |
| 1 | rs2643885 | SKI | A | G | Adaptive | 2.42E-06 | |
| 12 | rs4965006 | PUS1 | T | G | Language | 2.43E-06 | |
| 16 | rs9939461 | LOC101928737 | T | C | Social-Emotional | 2.60E-06 | |
| 4 | rs2725270 | ABCG2 | T | C | Adaptive | 2.66E-06 | |
| 8 | rs7822130 | TRAPPC9 | G | A | Social-Emotional | 2.67E-06 | |
| 16 | rs7200646 | LOC146513 | A | C | Adaptive | 2.87E-06 | |
| 18 | rs7232905 | BOD1L2 | G | A | Adaptive | 2.88E-06 | |
| 5 | rs10075788 | GPR150 | T | G | Language | 2.89E-06 | |
| 2 | rs2374380 | LOC388942 | C | T | Language | 3.13E-06 | |
| 3 | rs2310229 | ACPP | T | C | Adaptive | 3.16E-06 |
All 53 loci have a p-value better than 3.2x10-6. These results were obtained based on 428 subjects. The SNPs are ordered by p-values.
Fig 3The Manhattan plots show significant SNPs for pleiotropy with growth parameters and Bayley phenotypes.
Y-axis shows p-values (negative log scale) of three-way dependencies of SNPs, Bayley phenotypes, and each of the three Growth parameters: a) linf, b) lambda, and c) alpha. SNPs with p-value<3.2x10-6 (red line) are highlighted and labeled.
Loci on X and/or Y chromosomes with p-values < 1.5x10-5.
| Chr | Gender | DBSNP | Nearest Gene | Major | Minor | Bayley Phenotype | Growth Phenotype | P-Val | N |
|---|---|---|---|---|---|---|---|---|---|
| XY | Male | rs5949162 | LOC107985677 | G | A | Social-emotional | - | 1.92E-07 | 227 |
| XY | Male | rs306875 | SPRY3 | T | G | Adaptive | 1.15E-06 | 223 | |
| X | Female | rs5972504 | DMD | T | C | Adaptive | 5.64E-06 | 205 | |
| XY | Male | rs7054955 | Intergenic | T | C | Adaptive | 8.62E-06 | 223 | |
| XY | Male | rs28562204 | DHRSX | C | A | - | 1.08E-05 | 551 | |
| X | Female | rs989011 | GLRA2 | T | C | - | 1.37E-05 | 502 | |
| X | Female | rs2445644 | DMD | G | T | Adaptive | 1.45E-05 | 205 | |
| XY | Male | rs311043 | CD99 | T | G | Language | 1.46E-05 | 223 |
The SNPs are listed according to p-values. Note that this table includes both pairwise and 3-way dependencies, which are indicated by the presence of one or two phenotypes (indicated in the labeled columns). The numbers of subjects used are different because dependencies were computed separately for males and females, and the missing values were different in each case.
Summary of neurological development variants.
| Chr | rsID | Nearest Gene | Bayley Phenotype | Growth Phenotype | P-Val | N | Relevant Expression & effects of variants | Literature Reference |
|---|---|---|---|---|---|---|---|---|
| 17 | rs178850 | RAB11FIP4 | Adaptive | 2.02E-08 | 428 | Highly brain specific (7-10-fold higher than other tissues), affects neurofibroma growth, next to NF1 and OMG | Bartelt-Kirbach | |
| 1 | rs6672510 | PLD5 | 3.12E-08 | 1053 | High expression in brain, then adrenal and ovary, previously associated with autism and multiple sclerosis. | Anney | ||
| 5 | rs6878810 | LHFPL2 | Social-Emotional | 1.13E-07 | 428 | Genetic modifier of Parkinson's age of onset, new AD susceptibility locus | Hill-Burns | |
| XY-PAR1 | rs5949162 | LOC107985677 | Social-emotional | * | 1.92E-07 | 227 (M) | high expression in brain and endometrium | |
| 1 | rs584297 | LOC105373115, (near STUM) | Adaptive | 3.15E-07 | 428 | STUM codes for highly brain specific, mechanosensory protein | Desai | |
| 1 | rs525410 | LAMC2 | Social-Emotional | 4.56E-07 | 428 | LAMC1 is just upstream, highly placenta specific, LAMC2 is near NMNAT2, brain specific, cognitive traits candidate | Bi, | |
| 8 | rs7387693 | MTMR7 | Adaptive | * | 9.54E-07 | 433 | Highly brain specific, candidate for Creutzfeldt-Jacob risk, smoking cessation | Sanchez-Juan |
| 12 | rs1291359 | HTR7P1 | Adaptive | * | 1.55E-07 | 433 | Serotonin receptor pseudogene, SNP is in promoter region of HEBP1 (Immune system related) | |
| 8 | rs11787410 | CSMD1 | Adaptive | 8.84E-07 | 428 | High brain and testis expression, linked to schizophrenia risk | Sakamoto | |
| 11 | rs10769889 | LMO1 | Social-Emotional | 8.88E-07 | 428 | Pediatric neuroblastoma susceptibility candidate, expressed in adrenal, brain and skin | Oldridge | |
| 3 | rs4955988 | CACNA2D3 | Adaptive | * | 2.13E-06 | 433 | highest expression in brain, next in adrenal, calcium channel subunit, role in regulating iron uptake | Baeza-Richer, Carlos |
| 8 | rs7462219 | MTMR7 | Adaptive | * | 2.67E-06 | 433 | Highly brain specific, candidate for Creutzfeldt-Jacob risk, smoking cessation | Sanchez-Juan |
| XY—PAR2 | rs306875 | SPRY3 | Adaptive | 1.15E-06 | 223 (M) | Brain expression, Placental expression, autism candidate | Ning | |
| 9 | rs4741967 | SLC1A1 | Social-Emotional | 1.21E-06 | 428 | Glutamate transporter implicated in epilepsy, linked to OCD risk | Afshari | |
| 22 | rs5751491 | LINC01639, just upstream of MPPED1 | Adaptive | 1.32E-06 | 428 | Almost exclusively brain expressed, region linked to schizophrenia risk, bipolar disorder and brain malignancies | Chen | |
| 17 | rs11658800 | ELAC2 | Language | * | 2.48E-06 | 433 | High expression in the brain tissues | |
| 16 | rs2938729 | CDH8 | Language | 1.61E-06 | 428 | Highly brain-specific | ||
| 11 | rs10833478 | NELL1 | Motor | * | 3.41E-07 | 433 | Brain and kidney specific, interacts with neural EGFL | Shen et al., 2016 [ |
| 14 | rs7155811 | TMEM260 | Motor | * | 2.17E-06 | 433 | Implicated in pediatric neural development | Ta-Shma |
| 3 | rs7611656 | TRANK1* | Adaptive | 3.36E-06 | 428 | High endometrial expression, bipolar candidate | Chen | |
| 14 | rs11628108 | C14orf177 | Social-Emotional | * | 1.56E-06 | 433 | Testis expression, linked to late onset Alzheimer's | Kunkle |
| 14 | rs12434723 | C14orf177 | Social-Emotional | * | 2.61E-06 | 433 | Testis expression, linked to late onset Alzheimer's | Kunkle |
Features of a collection of the genes linked to 22 SNPs that we have tied to the neurological traits, having p-values < 5x10-6 and either: high brain levels of expression or specificity of expression, or published phenotypic effects related to neurological functions in human studies. The notation (M) in the N column indicates that the dependency was determined for male subjects only. TRANK1, marked with an asterisk, falls just below the p-value threshold so does not appear in Table 4.
Summary of some variants linked to fetal and early childhood development.
| Chr | rsID | Nearest Gene | Bayley Phenotype | Growth Phenotype | P-Val | N | Relevant Expression & effects of variants | Literature Reference |
|---|---|---|---|---|---|---|---|---|
| 1 | rs12734338 | PPP1R12B | * | 2.23E-09 | 1053 | protein phosphatase 1 regulatory subunit, expression in heart, skeletal muscle, brain and endometrium, link to celiac disease & asthma in children | Moorhead | |
| 1 | rs6672510 | PLD5 | * | 3.12E-08 | 1053 | High expression in brain, adrenal and ovary | Anney | |
| 19 | rs2228611 | DNMT1 | Adaptive | 6.46E-08 | 428 | DNA methyl-transferase—Highest expression in placenta, transcript level associates with placental weight | Mukhopadhyay | |
| 10 | rs7071157 | PFKFB3 | * | 4.08E-07 | 1053 | High expression in skeletal muscles, regulates glycolysis & cyclin-dependent kinase 1 Links glucose metabolism to cell proliferation, involved in brain development (GO:0007420) | Kessler & Eschrich, 2001 [ | |
| 1 | rs525410 | LAMC2 | Social-Emotional | 4.56E-07 | 428 | Expressed in several fetal tissues and placenta | ||
| 1 | rs1798246 | PRDM16 | Adaptive | 4.68E-07 | 428 | Linked to obesity, heart function and T2D | Pérez-Belmonte | |
| 4 | rs5020219 | ANKRD17 | Adaptive | 5.97E-07 | 428 | widespread expression, interacts with cyclin-dependent kinase 2 | ||
| 14 | rs12587898 | IFI27L1 | Adaptive | 6.70E-07 | 428 | expression high in in testis, adrenal and ovary, linked to anthropometric traits (height, weight etc.) | ||
| 15 | rs936214 | PAK6 | Language | 7.54E-07 | 428 | High expression in brain tissues, Kinase involved in cell proliferation and adhesion, placental expression | ||
| 1 | rs12030971 | DEPDC1-AS1 | Language | 7.91E-07 | 428 | regulates mitotic progression, placental expression | Mi | |
| 7 | rs1123227 | LINC01448 | Social-Emotional | 9.56E-07 | 428 | Placenta and testis specific expression | ||
| 7 | rs9691259 | IGFBP3 | * | 1.30E-06 | 1053 | Prolongs half-life of IGFs, high expression in placenta. Low levels linked to aging and cell senescence | Hong and Kim, 2018 [ | |
| 3 | rs1131364 | FETUB | Adaptive | 1.07E-06 | 428 | Liver-specific expression, linked to osteogenesis and bone resorption, regulation of the insulin and hepatocyte growth factor receptors | ||
| 1 | rs4846400 | ESRRG | Adaptive | 1.16E-06 | 428 | estrogen-related receptor, regulates DNMT1, involved in bone formation and cell growth regulation | ||
| 2 | rs2164807 | ATOH8 | Adaptive | 1.39E-06 | 428 | Transcription factor, highly expressed in fat cells, linked to muscle regeneration | Güttsches | |
| 12 | rs987547 | PPFIBP1 | Social-Emotional | 1.56E-06 | 428 | tyrosine-phosphatase interacting protein, high expression in heart and placenta | ||
| 20 | rs6123702 | CTCFL | Adaptive | 1.60E-06 | 428 | ZF transcription factor, insulator factor, spermatocyte-specific expression |
Features of a collection of genes linked to 17 SNPs that we have tied to the neurological traits, having p-values < 1.6x10-6. These had either: expression in the placenta or endometrium, or published phenotypic functions related to cell growth and selected other functions that could be connected to early development functions in human studies.
Locus interaction effects detected for notable loci exhibiting single locus effects.
| SNP 1 | Coord 1 | SNP 2 | Coord 2 | Gene 1 | Gene 2 | P-value |
|---|---|---|---|---|---|---|
| rs2228611 | 19_10267077 | rs10424964 | 19_10327812 | DNMT1 | S1PR2 (-) | 5.32E-08 |
| rs1131364 | 3_186370333 | exm2249408 | 10_87772933 | FETUB | GRID1 (-) | 6.22E-07 |
| rs1291359 | 12_13157267 | rs2271025 | 16_66951783 | HTR7P1 | CDH16 (-) | 4.91E-08 |
| rs1291359 | 12_13157267 | rs9374553 | 6_115937666 | HTR7P1 | FRK (-) | 1.01E-07 |
| rs1291359 | 12_13157267 | rs10873367 | 14_86054406 | HTR7P1 | FLRT2 (+) | 1.63E-07 |
| rs1291359 | 12_13157267 | rs11100377 | 4_162530770 | HTR7P1 | FSTL5 (-) | 1.87E-07 |
| rs1291359 | 12_13157267 | rs7232315 | 18_55566232 | HTR7P1 | ATP8B1 (-) | 1.90E-07 |
| rs1291359 | 12_13157267 | rs7552143 | 1_58260994 | HTR7P1 | DAB1 (-) | 2.72E-07 |
| rs1291359 | 12_13157267 | rs220172 | 21_43556691 | HTR7P1 | UMODL1 (+) | 2.93E-07 |
| rs1291359 | 12_13157267 | rs731957 | 16_85492882 | HTR7P1 | GSE1 (+) | 2.95E-07 |
| rs1291359 | 12_13157267 | rs472771 | 1_48556821 | HTR7P1 | SKINTL (-) | 3.52E-07 |
| rs5020219 | 4_74036166 | rs1381014 | 4_73862030 | ANKRD17 | COX18 (-) | 8.23E-08 |
| rs5020219 | 4_74036166 | rs7666763 | 4_73858464 | ANKRD17 | COX18 (-) | 8.06E-07 |
| rs525410 | 1_183176430 | rs13374873 | 1_30280472 | LAMC2 | LOC101929406 | 2.18E-07 |
| rs936214 | 15_40565705 | rs11903255 | 2_167464366 | PAK6 | SCN7A (-) | 1.09E-07 |
| rs936214 | 15_40565705 | rs17025241 | 3_88053396 | PAK6 | HTR1F (+) | 6.88E-07 |
| rs987547 | 12_27715010 | rs1436125 | 12_96299676 | PPFIBP1 | CCDC38 (-) | 1.86E-07 |
| rs178850 | 17_29759235 | rs6665385 | 1_176114487 | RAB11FIP4 | RFWD2 (COP1) | 2.22E-07 |
| rs178850 | 17_29759235 | rs1993451 | 12_125137009 | RAB11FIP4 | SCARB1 (-) | 2.31E-07 |
| rs178850 | 17_29759235 | rs704834 | 1_176189141 | RAB11FIP4 | PAPPA2 (+) | 2.34E-07 |
| rs4741967 | 9_4374278 | rs17741020 | 9_4359689 | SLC1A1 | SLC1A1 (+) | 3.77E-07 |
| rs6123702 | 20_56055633 | rs2075755 | 19_6422888 | CTCFL | KHSRP (-) | 7.57E-08 |
| rs10869192 | 9_71280103 | rs715521 | 22_48454426 | PIP5K1B | LOC284930 (+) | 1.19E-07 |
Gene 1 here indicates a locus that has been noted for several reasons: expression profiles, brain or growth specific known effects, or low p-values in a single locus effect (see Tables 8 and 9). The coordinates are for genome build hg19. The p-values are calculated for the three-way interaction measure (Delta3 for two SNPs, one of which is a and the single locus effect SNP).
Fig 4Distribution of functional consequence using VEP annotation tool and RegulomeDB scores for SNPs linked to neurological and growth phenotypes identified through two-way (mutual information) and three-way (Delta 3) dependency analysis.
a) Distribution of candidate SNPs across the functional locations based on VEP annotation. Most of the SNPs are located in non-coding locations, i.e., intronic and intergenic regions of the genome. b) The RegulomeDB score for the candidate SNPs. The lower the score, the more likely it is that a SNP has a regulatory function. eQTL = expression Quantitative Trait Loci; TF = Transcription Factor.
Fig 5Regulatory interaction network.
Depicted are interactions of transcription factors connected with regulatory SNPs (noted by their nearest gene). Clustering and visualization of the network was carried out using Cytoscape v.3.3.0 (undirected network and betweenness centrality statistics). The degree of nodes (the number of edges per node) is shown with their color, ranging from orange (the highest degree), to yellow, green, and then blue (the lowest degree). In addition, larger nodes correspond to hubs with higher degree. The edges with high betweenness centrality, whose removal would partition the network into connected subnetworks, are depicted by thick, orange lines. The small blue nodes are additional factors connected to the dependent loci. “Orphan genes” (unconnected nodes) are not shown. Nodes with blue, green, and red rings correspond to loci detected with two-way and three-way analysis (see the legend).
Three SNPs identified with tissue specific gene expression of their associated genes and /or eQTLs, using the Genotype-Tissue Expression (GTEx) database (http://gtexportal.org).
| rs12587898 | IFI27L1 | Brain cortex | 0.41 |
| rs1179161 | PNPLA4 | Tibial nerve | 0.42 |
| “ | “ | Frontal cortex | 0.21 |
| “ | “ | Putamen | 0.25 |
| “ | “ | Spinal cord | 0.36 |
| “ | “ | Cortex | -0.39 |
| “ | “ | Caudate | 0.18 |
| rs645026 | YEATS4 | Caudate | -0.45 |
The normalized effect size (NES) defined as the slope of the linear regression of the effect of the alternative (minor) allele relative to the reference (major) allele, based on hg19 reference genome [34].
Fig 6FUMA circular plots of chromatin interactions and eQTLs of lead SNPs.
a) The plot of chromosome 17, showing the lead SNP, rs178850, of RAB11FIP4 gene and its interactors. b) The plot of chromosome 1, illustrating the second lead SNP, rs6672510, of PLD5 gene and its interactors. The outer ring (grey dots) shows the Manhattan plot of all the SNPs in the chromosome, with p < 0.05, and not in LD with the lead SNP. The lead SNPs are indicated with a red dot. Both inner rings indicate the chromosome, with the risk loci highlighted in blue. The links and labels indicate chromatin interactions (orange) and eQTLs (green). When the SNP is mapped by both chromatin interactions and eQTLs, as in the case of rs178850, it is highlighted as red.
Fig 7The Kolmogorov-Smirnov scores for the genetically stratified phenotype distributions (two shown in Fig 8).
The scores, indicating the similarity between the distributions, show the dominance of the major allele for NELL1 and the dominance of the minor allele for PLD5, and MTMR7 (not shown in Fig 8).
Fig 9The Kolmogorov-Smirnov (K-S) test of similarity of distributions for the growth parameter phenotype (linf) genetically stratified according to PAK6 genotype (similarly to Fig 7).
The K-S test is shown for different values of Bayley scale phenotype (Composite language score at 24 months).
Fig 10Flow chart of the process of selecting gene candidates using our dependency measures.
The measures of multi-variable dependencies and Delta are described in Section 4.1.3, and the preprocessing of phenotypes and SNP data are described in Section 4.2. The statistical evaluation is explained in Section 4.3.
Fig 11The preprocessing steps of the genotype data showing the number of SNPs removed from Delta analysis.
Fig 12Distribution of SNPs in the X and Y region for Bayley’s and growth phenotypes.
a) The distributions of Bayley phenotypes by male and female infants used for the two-variable
Parameter estimates for Gompertz model with nonlinear deceleration fit to head circumference data from children between early pregnancy and earlys childhood.
| a) | |||||
| 49.96582 | 0.03487290 | 62416 | 1432.780 | 0 | |
| 5.62027 | 0.01535200 | 62416 | 366.0939 | 0 | |
| 0.10450 | 0.00025640 | 62416 | 407.5520 | 0 | |
| 0.75950 | 0.00148836 | 62416 | 510.2935 | 0 | |
| 0.01667 | 0.00007920 | 62416 | 196.5385 | 0 | |
| b) | |||||
| 1.305845 | |||||
| 0.1648373 | |||||
| 0.0005637008 | |||||
| 0.7218403 | |||||
a) Fixed parameter estimates. The top row shows maximum likelihood estimate for the growth limit parameter linf, the estimated standard error for the maximum likelihood estimator, and the corresponding degrees of freedom (df). t-value is the test statistic (in this case a t statistic) for the null hypothesis that linf = 0. Because the degrees of freedom are so large, the null distribution of the test statistic is essentially a normal distribution, and p-value = 1−2ϕ(|t*|), where ϕ is the standard normal cdf. The additional rows follow the same pattern for the other fixed effect (i.e., population-level) parameters. b) Subject parameter estimates. The maximum likelihood estimate of the variance parameter of the distribution of the subject-level random effects of the growth limit parameter is shown in the first row. The additional rows represent estimates for the other subject-level variance components as labelled.
Results for dependencies adjusted for the confounding effect of ethnicity.
A. is for the Bayley phenotypes, and B is for the growth phenotypes.
| Motor | rs10833478 | NELL1 | 0.0843 | 3.41E-07 | 0.142 | 0.0846 | 3.25E-07 | 4.7% | 0.135 |
| Motor | rs645026 | YEATS4 | 0.0832 | 4.40E-07 | 0.180 | 0.0828 | 4.78E-07 | 8.7% | 0.197 |
| Adaptive | rs7387693 | MTMR7 | 0.0847 | 9.54E-07 | 0.359 | 0.0844 | 1.05E-06 | 9.9% | 0.384 |
| Adaptive | rs7462219 | MTMR7 | 0.0807 | 2.67E-06 | 0.693 | 0.0802 | 2.96E-06 | 11.1% | 0.735 |
| Adaptive | rs4955988 | CACNA2D3 | 0.0817 | 2.13E-06 | 0.617 | 0.0812 | 2.38E-06 | 11.5% | 0.651 |
| Adaptive | rs1291359 | HTR7P1 | 0.0829 | 1.55E-06 | 0.506 | 0.0839 | 1.17E-06 | 24.2% | 0.414 |
| Language | rs11658800 | ELAC2 | 0.0803 | 2.48E-06 | 0.670 | 0.0828 | 1.31E-06 | 46.9% | 0.437 |
| Language | rs7239403 | SMIM21 | 0.0803 | 2.51E-06 | 0.675 | 0.0853 | 7.42E-07 | 70.5% | 0.288 |
| Social-Emotional | rs12434723 | C14orf177 | 0.0805 | 2.61E-06 | 0.680 | 0.0858 | 7.26E-07 | 72.2% | 0.272 |
| Social-Emotional | rs11628108 | C14orf177 | 0.0826 | 1.56E-06 | 0.510 | 0.0881 | 3.59E-07 | 77.0% | 0.143 |
| Motor | rs7155811 | TMEM260 | 0.0772 | 2.16E-06 | 0.610 | 0.0940 | 5.45E-08 | 97.5% | 0.020 |
| Social-Emotional | rs1161106 | LOC100507175 | 0.0811 | 2.28E-06 | 0.634 | 0.0780 | 4.78E-06 | 109.2% | 0.876 |
| Motor | rs1449848 | CPNE8 | 0.0772 | 2.16E-06 | 0.607 | 0.0645 | 5.19E-05 | 2305.1% | 1.000 |
| B | |||||||||
| Linf | rs12734338 | PPP1R12B | 0.1130 | 2.23E-09 | 0 | 0.1141 | 2.23E-09 | 0.0% | 0 |
| Linf | rs7981995 | DACH1 | 0.0232 | 7.18E-06 | 0.936 | 0.0231 | 7.57E-06 | 5.5% | 0.944 |
| Alpha | rs7101173 | MIR6072 | 0.0248 | 2.54E-06 | 0.619 | 0.0247 | 2.71E-06 | 6.7% | 0.643 |
| Linf | rs373680 | FBXO33 | 0.0232 | 6.94E-06 | 0.93 | 0.0235 | 5.83E-06 | 16.0% | 0.895 |
| Lambda | rs4793500 | CASC17 | 0.0259 | 1.16E-06 | 0.358 | 0.0256 | 1.46E-06 | 25.5% | 0.427 |
| Lambda | rs564266 | NTM | 0.0241 | 4.05E-06 | 0.771 | 0.0249 | 2.47E-06 | 39.0% | 0.620 |
| Lambda | rs773024 | OSTF1 | 0.0236 | 5.53E-06 | 0.855 | 0.0229 | 8.57E-06 | 55.0% | 0.952 |
| Linf | rs6884117 | C5orf22 | 0.0241 | 3.86E-06 | 0.761 | 0.0234 | 6.18E-06 | 60.1% | 0.908 |
| Lambda | rs154444 | ZNF608 | 0.0232 | 7.08E-06 | 0.916 | 0.0257 | 1.33E-06 | 81.2% | 0.404 |
| Linf | rs7071157 | PFKFB3 | 0.0276 | 4.08E-07 | 0.157 | 0.0300 | 7.13E-08 | 82.5% | 0.031 |
| Linf | rs6570627 | UTRN | 0.0250 | 2.20E-06 | 0.576 | 0.0240 | 4.19E-06 | 90.1% | 0.789 |
| Alpha | rs10196354 | ERBB4 | 0.0232 | 7.23E-06 | 0.937 | 0.0284 | 2.14E-07 | 97.0% | 0.085 |
| Lambda | rs6672510 | PDL5 | 0.0316 | 3.12E-08 | 0.013 | 0.0298 | 9.81E-08 | 214.3% | 0.038 |
| Alpha | rs9691259 | IGFBP3 | 0.0258 | 1.30E-06 | 0.4 | 0.0232 | 7.40E-06 | 469.8% | 0.937 |
| Linf | rs6710428 | CERS6 | 0.0264 | 8.83E-07 | 0.301 | 0.0224 | 1.19E-05 | 1253.3% | 0.993 |
| Lambda | rs7075547 | LRRTM3 | 0.0237 | 5.15E-06 | 0.836 | 0.0043 | 4.07E-01 | 7893823% | 1 |
We show the population wide mutual informations and corresponding p-values, with the adjusted mutual informations and p-values, followed by the FWER value. The three sets of SNP-phenotypes separated by thick lines are those for which: upper–the absolute change in p-values is less than 25% of the original; middle–the change in p-value is less than 100%; lower the change in p-value is greater than 100%. Note that there are some dependencies for which the p-values improve on adjustment for ethnicity confounding.