| Literature DB >> 28585928 |
D van der Meer1,2, P J Hoekstra1, M van Donkelaar3, J Bralten3, J Oosterlaan4, D Heslenfeld4, S V Faraone5,6,7, B Franke3, J K Buitelaar8,9, C A Hartman1.
Abstract
Identifying genetic variants contributing to attention-deficit/hyperactivity disorder (ADHD) is complicated by the involvement of numerous common genetic variants with small effects, interacting with each other as well as with environmental factors, such as stress exposure. Random forest regression is well suited to explore this complexity, as it allows for the analysis of many predictors simultaneously, taking into account any higher-order interactions among them. Using random forest regression, we predicted ADHD severity, measured by Conners' Parent Rating Scales, from 686 adolescents and young adults (of which 281 were diagnosed with ADHD). The analysis included 17 374 single-nucleotide polymorphisms (SNPs) across 29 genes previously linked to hypothalamic-pituitary-adrenal (HPA) axis activity, together with information on exposure to 24 individual long-term difficulties or stressful life events. The model explained 12.5% of variance in ADHD severity. The most important SNP, which also showed the strongest interaction with stress exposure, was located in a region regulating the expression of telomerase reverse transcriptase (TERT). Other high-ranking SNPs were found in or near NPSR1, ESR1, GABRA6, PER3, NR3C2 and DRD4. Chronic stressors were more influential than single, severe, life events. Top hits were partly shared with conduct problems. We conclude that random forest regression may be used to investigate how multiple genetic and environmental factors jointly contribute to ADHD. It is able to implicate novel SNPs of interest, interacting with stress exposure, and may explain inconsistent findings in ADHD genetics. This exploratory approach may be best combined with more hypothesis-driven research; top predictors and their interactions with one another should be replicated in independent samples.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28585928 PMCID: PMC5537639 DOI: 10.1038/tp.2017.114
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 6.222
List of genes based on our literature search
| Angiotensin-converting enzyme | 17 | 61454422 | 61675741 | 181 | |
| Alpha2B adrenergic receptor | 2 | 96678623 | 96881888 | 144 | |
| Apolipoprotein E | 19 | 45309039 | 45512650 | 481 | |
| Arginine vasopressin receptor 1A | 12 | 63436539 | 63646590 | 655 | |
| Arginine vasopressin receptor 1B | 1 | 206124283 | 206331482 | 164 | |
| Brain-derived neurotrophic factor | 11 | 27576442 | 27822600 | 368 | |
| Alpha7 nicotinic acetylcholine receptor | 15 | 32222686 | 32562384 | 394 | |
| Catechol-O-methyltransferase | 22 | 19829263 | 20057498 | 795 | |
| Corticotropin-releasing hormone binding protein | 5 | 76148680 | 76365299 | 388 | |
| Corticotropin-releasing hormone receptor 1 | 17 | 43761646 | 44013194 | 229 | |
| Corticotropin-releasing hormone receptor 2 | 7 | 30591559 | 30839719 | 575 | |
| Dopamine receptor D4 | 11 | 537305 | 740705 | 615 | |
| Estrogen receptor alpha | 6 | 152028454 | 152524408 | 1323 | |
| FK506 binding protein 5 | 6 | 35441362 | 35756719 | 820 | |
| Gamma-aminobutyric acid A receptor alpha 6 | 5 | 161012658 | 161229598 | 519 | |
| Serotonin receptor 1A | 5 | 63155875 | 63358119 | 264 | |
| Melanocortin 2 Receptor | 18 | 13782043 | 14015535 | 905 | |
| Neuropeptide S receptor | 7 | 34597897 | 35017944 | 1313 | |
| Neuropeptide Y | 7 | 24223807 | 24431484 | 905 | |
| Glucocorticoid receptor | 5 | 142557496 | 142884045 | 481 | |
| Mineralocorticoid receptor | 4 | 148899915 | 149463672 | 1147 | |
| Kappa opioid receptor | 8 | 54038276 | 54264194 | 904 | |
| Mu opioid receptor | 6 | 154260443 | 154540594 | 778 | |
| Oxytocin receptor | 3 | 8692095 | 8911300 | 538 | |
| Period circadian protein homolog 1 | 17 | 7943788 | 8155753 | 552 | |
| Period circadian protein homolog 3 | 1 | 7744714 | 8005237 | 861 | |
| Dopamine transporter | 5 | 1292905 | 1545543 | 442 | |
| Serotonin transporter | 17 | 28421337 | 28662986 | 305 | |
| Stathmin | 1 | 26110677 | 26332993 | 328 |
A total of 17 374 single-nucleotide polymorphisms (SNPs) spread out over these 29 genes were included in the analysis. Next to each gene is displayed its protein product, the chromosome (Chr.) it is located on, the start and end position (in base pairs, bp) of the region we included, and the number of SNPs in that region.
Top 25 most important predictors, based on the increase in prediction error following permutation
| 1 | Your child has a chronic illness or handicap | 0.23 | 15.01 | |||
| 2 | Your child has fewer friends than he/she would like | 0.15 | 4.64 | |||
| 3 | Your child is being bullied at school or in the neighborhood | 0.07 | 3.61 | |||
| 4 | Your child can’t get along with someone in your immediate family | 0.08 | 1.24 | |||
| 6 | Your immediate family has financial difficulties | 0.04 | 0.35 | |||
| 5 | rs4635969 | 5:1308552 | 84 kb from 3′ end | 0.20 | 0.44 | |
| 7 | rs35311906 | 7:34873557 | Intron | 0.15 | 0.23 | |
| 8 | rs985191 | 6:152283458 | Intron | 0.11 | 0.21 | |
| 9 | rs10035808 | 5:161189729 | 60 kb from 3′ end | 0.46 | 0.19 | |
| 10 | rs11587880 | 1:7822957 | 22 kb from 5′ end | 0.08 | 0.17 | |
| 11 | rs7932167 | 11:620599 | 17 kb from 5′ end | 0.19 | 0.16 | |
| 12 | rs77714417 | 6:152274190 | Intron | 0.06 | 0.13 | |
| 13 | rs56821207 | 1:7951115 | 46 kb from 3′ end | 0.18 | 0.13 | |
| 14 | rs179265 | 1:7942692 | 37 kb from 3′ end | 0.45 | 0.12 | |
| 15 | rs11587479 | 1:7823085 | 22 kb from 5′ end | 0.08 | 0.12 | |
| 16 | rs74325817 | 6:152271827 | Intron | 0.11 | 0.12 | |
| 17 | rs35365822 | 6:152270364 | Intron | 0.11 | 0.12 | |
| 18 | rs2530547 | 7:34697922 | 5′-UTR | 0.36 | 0.11 | |
| 19 | rs9340910 | 6:152272233 | Intron | 0.11 | 0.10 | |
| 20 | rs77595592 | 5:161034192 | 78 kb from 5′ end | 0.17 | 0.10 | |
| 21 | rs35953391 | 5:1312329 | 81 kb from 3′ end | 0.20 | 0.10 | |
| 22 | rs6930114 | 6:152268250 | Intron | 0.11 | 0.09 | |
| 23 | rs35527038 | 4:148970403 | 30 kb from 3′ end | 0.15 | 0.09 | |
| 24 | rs10002896 | 4:149028802 | Intron | 0.24 | 0.09 | |
| 25 | rs143748464 | 8:54146601 | Intron | 0.03 | 0.09 | |
Abbreviations: kb, kilo base pair; RS ID, reference SNP identification number; UTR, untranslated region; VIMP, Breiman–Cutler variable importance estimate.
The five stressors are listed first, followed by details on the 20 single-nucleotide polymorphisms (SNPs).
For the upper part of the table, the ‘Frequency’ column indicates the proportion of individuals that have experienced the stressor. For the lower part of the table, it displays the SNP’s minor allele frequency, the ‘Location’ column represents its genomic location by chromosome and base pair count, and the ‘Region’ column denotes the SNP’s position relative to its associated gene as documented in Table 1.
Figure 1Variable importance for prediction, for all single-nucleotide polymorphisms (SNPs) included in the analysis. SNPs are ordered on the x axis based on their genomic position, from chromosome 1 to 22, with the labels and alternating red and black sections marking the gene they belong to. The y axis indicates the variable importance, as percent increase in mean-squared error (MSE) of the out-of-bag predictions when the SNP was permuted. Those above the dashed blue line are part of the top 25 most important predictors, listed in Table 2.
Figure 2Interaction strengths for each pair of 25 top predictors from the random forest analysis. These were calculated by subtracting the sum of the pair’s individual importance estimates from their joint importance estimate. Negative numerals indicate that one predictor made it more likely that the other was selected for a split in its subtree, positive numerals indicate this was less likely. The predictors are sorted on the basis of the first principal component of their interaction strengths.
Figure 3Visualization of the interaction between SLC6A3 rs4635969 and each of the five long-term difficulties among the top predictors. The participants are grouped based on their genotype and exposure to the individual long-term difficulty shown on the x axis. On the y axis is the observed score on the Conners’ Parent Rating Scale (CPRS), subscale N. The boxes show the median, and the first and third quantiles of each group. Their width is scaled by the number of participants. ADHD, attention-deficit/hyperactivity disorder.