| Literature DB >> 30696812 |
Mauricio Arcos-Burgos1,2,3, Jorge I Vélez4,5, Ariel F Martinez4, Marta Ribasés6,7,8, Josep A Ramos-Quiroga6,7,8,9, Cristina Sánchez-Mora6,7,8, Vanesa Richarte7,8,9, Carlos Roncero7,8,9,10, Bru Cormand11,12,13,14, Noelia Fernández-Castillo11,12,13,14, Miguel Casas6,7,8,9, Francisco Lopera15, David A Pineda15, Juan D Palacio15, Johan E Acosta-López16, Martha L Cervantes-Henriquez5,16, Manuel G Sánchez-Rojas16, Pedro J Puentes-Rozo16,17, Brooke S G Molina18, Margaret T Boden19, Deeann Wallis20, Brett Lidbury21, Saul Newman21, Simon Easteal21, James Swanson22,23, Hardip Patel24, Nora Volkow25, Maria T Acosta4, Francisco X Castellanos26,27, Jose de Leon19, Claudio A Mastronardi28,29, Maximilian Muenke30.
Abstract
Genetic factors are strongly implicated in the susceptibility to develop externalizing syndromes such as attention-deficit/hyperactivity disorder (ADHD), oppositional defiant disorder, conduct disorder, and substance use disorder (SUD). Variants in the ADGRL3 (LPHN3) gene predispose to ADHD and predict ADHD severity, disruptive behaviors comorbidity, long-term outcome, and response to treatment. In this study, we investigated whether variants within ADGRL3 are associated with SUD, a disorder that is frequently co-morbid with ADHD. Using family-based, case-control, and longitudinal samples from disparate regions of the world (n = 2698), recruited either for clinical, genetic epidemiological or pharmacogenomic studies of ADHD, we assembled recursive-partitioning frameworks (classification tree analyses) with clinical, demographic, and ADGRL3 genetic information to predict SUD susceptibility. Our results indicate that SUD can be efficiently and robustly predicted in ADHD participants. The genetic models used remained highly efficient in predicting SUD in a large sample of individuals with severe SUD from a psychiatric institution that were not ascertained on the basis of ADHD diagnosis, thus identifying ADGRL3 as a risk gene for SUD. Recursive-partitioning analyses revealed that rs4860437 was the predominant predictive variant. This new methodological approach offers novel insights into higher order predictive interactions and offers a unique opportunity for translational application in the clinical assessment of patients at high risk for SUD.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30696812 PMCID: PMC6351584 DOI: 10.1038/s41398-019-0396-7
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 6.222
A concise description of the cohorts’ principal demographic and clinical data.
| Paisa sample | Spanish sample | MTA sample | Kentucky sample | |||||
|---|---|---|---|---|---|---|---|---|
|
| % |
| % |
| % |
| % | |
| Sex | ||||||||
| Males | 231 | 49% | 1193 | 72% | 287 | 76% | 285 | 53% |
| Females | 241 | 51% | 454 | 28% | 89 | 24% | 248 | 47% |
| Total | 472 | 100% | 1647 | 100% | 376 | 100% | 533 | |
| ADHD | ||||||||
| Affected | 79 | 17% | 670 | 41% | 140 | 37% |
|
|
| Unaffected | 249 | 53% | 486 | 29% | 236 | 63% |
|
|
| Unknown | 144 | 30% | 491 | 30% | 376 | 100% |
|
|
| ODD | ||||||||
| Affected | 78 | 17% | 81 | 5% |
|
|
|
|
| Unaffected | 250 | 53% | 391 | 24% |
|
|
|
|
| Unknown | 144 | 30% | 1175 | 71% |
|
|
|
|
| CD | ||||||||
| Affected | 84 | 18% | 102 | 6% |
|
|
|
|
| Unaffected | 244 | 52% | 357 | 22% |
|
|
|
|
| Unknown | 144 | 30% | 1188 | 72% |
|
|
|
|
| Nicotine | ||||||||
| Affected | 102 | 22% | 646 | 39% | 97 | 26% | 372 | 70% |
| Unaffected | 226 | 48% | 613 | 37% | 40 | 11% | 161 | 30% |
| Unknown | 144 | 30% | 388 | 24% | 239 | 63% | 0 | 0% |
| Alcohol | ||||||||
| Affected | 124 | 27% | 396 | 24% | 120 | 32% | 342 | 64% |
| Unaffected | 204 | 43% | 637 | 39% | 106 | 28% | 191 | 36% |
| Unknown | 144 | 30% | 614 | 37% | 150 | 40% | 0 | 0% |
| Cannabis | ||||||||
| Affected |
|
|
|
| 94 | 25% |
|
|
| Unaffected |
|
|
|
| 71 | 19% |
|
|
| Unknown |
|
|
|
| 211 | 56% |
|
|
| Other drugs | ||||||||
| Affected | 12 | 3% |
|
|
|
| 147 | 28% |
| Unaffected | 197 | 56% |
|
|
|
| 386 | 72% |
| Unknown | 263 | 41% |
|
|
|
| 0 | 0% |
| SUD | ||||||||
| Affected |
|
| 768 | 47% |
|
| 452 | 85% |
| Unaffected |
|
| 879 | 53% |
|
| 81 | 15% |
| Unknown |
|
| 0 | 0% |
|
|
|
|
| Phobias | ||||||||
| Affected | 156 | 37% | 50 | 3% |
|
|
|
|
| Unaffected | 172 | 33% | 584 | 36% |
|
|
|
|
| Unknown | 144 | 30% | 1013 | 61% |
|
|
|
|
| Anxiety | ||||||||
| Affected | 58 | 13% | 107 | 6% |
|
|
|
|
| Unaffected | 270 | 57% | 818 | 50% |
|
|
|
|
| Unknown | 144 | 30% | 722 | 44% |
|
|
|
|
| Depression | ||||||||
| Affected | 117 | 25% | 143 | 9% |
|
|
|
|
| Unaffected | 211 | 45% | 490 | 30% |
|
|
|
|
| Unknown | 144 | 30% | 1014 | 61% |
|
|
|
|
| Mood | ||||||||
| Affected |
|
|
|
|
|
| 157 | 29% |
| Unaffected |
|
|
|
|
|
| 376 | 71% |
| Unknown |
|
|
|
|
|
| 0 | 0% |
| Schizophrenia | ||||||||
| Affected |
|
|
|
|
|
| 253 | 47% |
| Unaffected |
|
|
|
|
|
| 280 | 53% |
| Unknown |
|
|
|
|
|
| 0 | 0% |
For the Paisa cohort, only information for founder members (adults) used in the ARPA-based predictive model for SUD is shown. See Methods section for more details
ADHD attention-deficit/hyperactivity disorder, CD conduct disorder, ODD oppositional defiant disorder, SUD substance use disorders
Data not available
Fig. 1Advanced Recursive Partitioning Analysis (ARPA) for the Paisa sample.
a Derived Classification and Regression Tree (CART) for SUD status as categorical target variable (disjunctive affection status, i.e., substance use of either alcohol, or nicotine, or other drugs). Only founder individuals were included in the analysis to avoid kinship relatedness bias. Class 0 (unaffected) is indicated in red and class 1 (affected) in blue. This derived tree for the Paisa sample included demographic (age), clinical (conduct disorder (CD)), and genetic variables (markers rs5010235 and rs4860437). The T allele of the rs4860437 variant (node 4) generates a highly discriminant split in combination with age (45.5 years) to terminal node 3 of ADHD individuals without CD (see root node 1). b Variable importance scores derived by Random Forest and TreeNet analysis were compatible with the variables included in the tree derived by CART. c, d TreeNet analysis to maximize the ROC area and minimize the classification error using 200 trees. The areas under the ROC curve (AUC) were 0.954 and 0.87 for learning and testing samples (blue and red curves and values, respectively), while the proportions of misclassification for SUD cases in the cross-validation experiment were 0.124 and 0.177 for learning and testing data sets, respectively
Fig. 2ARPA for the Spanish sample.
a Derived tree by CART for SUD status as categorical target variable (disjunctive affection status, i.e., substance use of either alcohol, or nicotine, or other drugs). This derived tree for the Spanish sample included demographic (sex), clinical (CD, ODD, depression, and ADHD), and genetic variables (markers rs677642, rs4860437, rs1868790). b Variable importance scores derived by Random Forest and TreeNet analysis were compatible with the variables included in the tree derived by CART. c, d TreeNet analysis to maximize the AUC and minimize the classification error using 200 trees. The AUC were 0.911 and 0.897 for learning and testing samples while the proportions of misclassification for SUD cases in the cross-validation experiment were 0.151 and 0.175 for learning and testing data sets, respectively. Conventions as in Fig. 1
Fig. 3ARPA for the MTA sample.
a Derived tree by CART for the SUD status as categorical target variable (disjunctive affection status, i.e., substance use of either alcohol, or nicotine, cannabis, or other drugs). As the MTA is a longitudinal study, we used SUD status at 96 and 120 month follow-ups and applied a lag analysis of SUD emergence. The derived tree included demographic (site of ascertainment), and genetic variables (markers rs2172802, rs61747658, rs12509110, and rs6856328). The combination of variants rs61747658 and rs2172802 generated an important discriminant splitting of SUD affected and unaffected classes. b Variable importance scores derived by Random Forest and TreeNet analysis were compatible with the variables included in the tree derived by CART. c, d TreeNet analysis to maximize ROC area and minimize classification error using 200 trees. The AUC were 0.808 and 0.643 for learning and testing samples, while the proportions of misclassification for SUD cases in the cross-validation experiment, for learning and testing data were 0.314 and 0.358, respectively. Conventions as in Fig. 1
Fig. 4ARPA for the Kentucky sample.
a Derived CART tree for SUD status as categorical target variable (disjunctive affection status, i.e., substance use of either alcohol, or nicotine, or other drugs). This derived tree for the Kentucky sample included demographic (sex), clinical (high Body Mass index [HBMI], and schizophrenia diagnosis), and genetic variables (markers rs4860437 and rs7659636). Notably, the T allele of the rs4860437 variant generated a split in the same direction as occurred for the derived tree in the Paisa and in the Spain samples. b Variable importance scores derived by Random Forest and TreeNet analysis were compatible with the variables included in the tree derived by CART. c, d TreeNet analysis to maximize ROC area and minimize classification error using 200 trees. The AUC were 0.811 and 0.744 for learning and testing samples, respectively, while the proportions of misclassification for SUD cases in the cross-validation experiment, for learning and testing data were 0.285 and 0.252, respectively. Conventions as in Fig. 1
Fig. 5Fixed-effects meta-analysis for the prediction accuracy of the ARPA-based predictive model for SUD derived in each cohort.
The overall SUD correct classification rate is ~73%. CI Confidence Interval, SE Standard Error, Z test statistic, P P-value