| Literature DB >> 32669683 |
Ji Hye Won1,2, Mansu Kim1,2, Jinyoung Youn3,4, Hyunjin Park5,6.
Abstract
The age at onset (AAO) is an important determinant in Parkinson's disease (PD). Neuroimaging genetics is suitable for studying AAO in PD as it jointly analyzes imaging and genetics. We aimed to identify features associated with AAO in PD by applying the objective-specific neuroimaging genetics approach and constructing an AAO prediction model. Our objective-specific neuroimaging genetics extended the sparse canonical correlation analysis by an additional data type related to the target task to investigate possible associations of the imaging-genetic, genetic-target, and imaging-target pairs simultaneously. The identified imaging, genetic, and combined features were used to construct analytical models to predict the AAO in a nested five-fold cross-validation. We compared our approach with those from two feature selection approaches where only associations of imaging-target and genetic-target were explored. Using only imaging features, AAO prediction was accurate in all methods. Using only genetic features, the results from other methods were worse or unstable compared to our model. Using both imaging and genetic features, our proposed model predicted the AAO well (r = 0.5486). Our findings could have significant impacts on the characterization of prodromal PD and contribute to diagnosing PD early because genetic features could be measured accurately from birth.Entities:
Mesh:
Year: 2020 PMID: 32669683 PMCID: PMC7363828 DOI: 10.1038/s41598-020-68301-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographic and clinical data of the enrolled subjects.
| All | Age at onset, years | |||||
|---|---|---|---|---|---|---|
| Years | 50 | 60 | years | |||
| Age at onset of PD | 61.35 | 46.22 | 54.82 | 65.03 | 73.23 | – |
| Sex, male, % (n) | 65.07 (95) | 70.83 (17) | 45.71 (16) | 79.63 (43) | 57.58 (19) | – |
| PD duration, month | 7.55 | 6.25 | 7.09 | 8.30 | 7.79 | 0.63 |
| H&Y stage | 1.68 | 1.71 | 1.51 | 1.67 | 1.88 | 0.04 |
| MDS-UPDRS total | 32.32 | 30.00 | 32.11 | 30.59 | 37.06 | 0.20 |
| UPDRS Part1 | 5.31 | 4.71 | 6.00 | 5.04 | 5.45 | 0.66 |
| UPDRS Part2 | 5.41 | 5.88 | 5.06 | 5.11 | 5.94 | 0.69 |
| UPDRS Part3 | 21.45 | 19.21 | 20.43 | 25.42 | 0.08 | |
| UPDRS Part4 | 0.15 | 0.21 | 0.23 | 0.02 | 0.24 | 0.40 |
H&Y Hoehn and Yahr, MDS-UPDRS Movement Disorder Society-Unified Parkinson’s Disease Rating Scale.
p value computed from an ANOVA of the four groups.
*p value < 0.05 for comparing group 3 and 4.
Figure 1Schematic of our objective specific sparse canonical correlation analysis (os-SCCA) model. Our os-SCCA model is an extension of the sparse canonical correlation analysis. It includes an objective term (i.e., AAO of PD) to investigate the possible associations of the imaging–genetic, genetic–target, and imaging–target pairs simultaneously.
Selected SNPs from os-SCCA.
| CHRa | RSID for the SNP | BPb | MAc | Associated gene |
|---|---|---|---|---|
| 1 | rs823118 | 205754444 | C | NUCKS1 |
| 1 | rs4653767 | 226728377 | C | ITPKB |
| 2 | rs6430538 | 134782397 | C | ACMSD/TMEM163 |
| 2 | rs353116 | 165277122 | T | SCN3A/ SCN2A |
| 3 | rs12497850 | 48711556 | G | NCKIPSD/CDC71/IP6K2 |
| 4 | rs11724635 | 15735478 | C | BST1 |
| 4 | rs6812193 | 76277833 | T | FAM47E/STBD1 |
| 4 | rs3910105 | 89761420 | G | SNCA |
| 7 | rs199347 | 23254127 | G | GPNMP |
| 8 | rs591323 | 16839582 | A | MICU3/FGF20 |
| 9 | rs13294100 | 17579692 | T | SH3GL2 |
| 12 | rs11060180 | 122819039 | G | OGFOD2/CCDC62 |
| 14 | rs8005172 | 88006268 | T | GALC/GPR65 |
| 16 | rs14235 | 31110472 | A | ZNF646/KAT8/BCKDK |
| 22 | rs737866 | 19942586 | C | COMT |
| 22 | rs174674 | 19946502 | A | COMT |
| 22 | rs740603 | 19957654 | A | COMT |
| 22 | rs165656 | 19961340 | G | COMT |
| 22 | rs6269 | 19962429 | G | COMT |
| 22 | rs4633 | 19962712 | C | COMT |
| 22 | rs2239393 | 19962905 | G | COMT |
| 22 | rs4818 | 19963684 | G | COMT |
| 22 | rs4680 | 19963748 | A | COMT |
| 22 | rs165599 | 19969258 | G | COMT |
aCHR chromosome number.
bBP base-pair location in hg38 coordinates.
cMA minor allele of variant based on PPMI sample.
Figure 2Summary of the over-representation enrichment analysis from the identified SNPs. The plot shows the significant gene ontology analysis results from the 25 genes of the 24 identified SNPs. Genes were annotated with the three selected functional categories (corresponding to the categories written vertically), which were also in the reference list. The length of the bar represents the number of the identified genes observed in the reference gene list.
Selected SNPs from mRMR.
| CHR | RSID for the SNP | BP | MA | Associated gene |
|---|---|---|---|---|
| 1 | rs823118 | 205754444 | C | NUCKS1 |
| 2 | rs34043159 | 101796654 | C | IL1R2/MAP4K4 |
| 2 | rs6430538 | 134782397 | C | ACMSD/TMEM163 |
| 2 | rs353116 | 165277122 | T | SCN3A/SCN2A |
| 3 | rs4073221 | 18235996 | G | SATB1 |
| 3 | rs12497850 | 48711556 | G | NCKIPSD/CDC71/IP6K2 |
| 4 | rs34311866 | 958159 | C | TMEM175 |
| 4 | rs11724635 | 15735478 | C | BST1 |
| 4 | rs6812193 | 76277833 | T | FAM47E/STBD1 |
| 4 | rs356181 | 89704988 | G | SNCA |
| 4 | rs3910105 | 89761420 | G | SNCA |
| 4 | rs4444903 | 109912954 | G | EGF |
| 8 | rs2280104 | 22668467 | T | BIN3 |
| 10 | rs10906923 | 15527599 | C | FAM171A1/ITGA8 |
| 11 | rs329648 | 133895472 | T | MIR4697 |
| 12 | rs11060180 | 122819039 | G | OGFOD2/CCDC62 |
| 14 | rs11158026 | 54882151 | T | GCH1 |
| 14 | rs8005172 | 88006268 | T | GALC/GPR65 |
| 15 | rs2414739 | 61701935 | G | VPS13C |
| 16 | rs14235 | 31110472 | A | ZNF646/KAT8/BCKDK |
| 16 | rs4784227 | 52565276 | T | TOX3/CASC16 |
| 18 | rs12456492 | 43093415 | G | SYT4/RIT2 |
| 20 | rs55785911 | 3172857 | A | DDRGK1 |
| 22 | rs174674 | 19946502 | A | COMT |
| 22 | rs740603 | 19957654 | A | COMT |
| 22 | rs165599 | 19969258 | G | COMT |
Figure 3Prediction plots using different approaches. The actual and predicted AAO of PD using various approaches. Our models were compared with those using mRMR and LASSO. (a) The prediction plot using our os-SCCA; (b) the prediction plot using mRMR; (c) the prediction plot using LASSO. There were five colored lines and dots. Each represents a different left-out fold. The solid lines represent a linear fit of the corresponding data.