| Literature DB >> 36017394 |
Zehao Liu1, Songxian Zeng2, Xinglin Quan3.
Abstract
The development of neuroimaging technology and molecular genetics has produced a large amount of imaging genetics data, which has greatly promoted the study of complex mental diseases. However, because the feature dimension of the data is too high, the correlation measure assumes that the data obey Gaussian distribution, and traditional algorithms often cannot explain these two types of data well. This article mainly studies image genetics analysis and its application based on neural network. In this paper, based on the theory and application technology of neural network, the tree structure is established by prior knowledge, that is, each SNP site is used as a leaf node of the tree, and the LD block and genome formed by the linkage imbalance of multiple SNP sites are used as intermediate nodes. Then, the hierarchical relationship of features was introduced. On this basis, a sparse learning method based on tree structure guidance is used to select features from multiple features of multiple SNPs locus regression candidate brain regions. Finally, the identification of SNPs in feature selection is used to predict quantitative traits of brain regions. The distribution of the typical vector values obtained by the algorithm in the experimental data is basically consistent with the distribution of the median of the actual data, and the correlation coefficient obtained is closest to the actual correlation coefficient in the data set. The average correlation coefficient of the algorithm reaches 82.3%, which is about 4.2% higher than the control algorithm. Experimental results show that this method can not only significantly improve the regression performance but also detect the risk gene SNPs loci with spatial clustering features and functional interpretation significance. It is practical and effective to use it in clinical trials.Entities:
Mesh:
Year: 2022 PMID: 36017394 PMCID: PMC9398771 DOI: 10.1155/2022/5861928
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.246
Details of the simulation data set.
| Data set |
|
|
| Correlation coefficient |
|---|---|---|---|---|
| Data1 | 100 | 250 | 600 | 0.6214 |
| Data2 | 100 | 250 | 600 | 0.8384 |
| Data3 | 100 | 250 | 600 | 0.7525 |
| Data4 | 100 | 500 | 900 | 0.6542 |
Figure 1Spatial specific expression analysis of gene coexpression network and M8 module.
Correlation results of correlations on simulated data sets under different methods.
| Attribute | SCCA | GSCCA | TGSCCA |
|---|---|---|---|
| BL | 0.957 | 0.959 | 0.963 |
| M06 | 0.938 | 0.946 | 0.949 |
| M12 | 0.942 | 0.951 | 0.956 |
| M24 | 0.924 | 0.944 | 0.946 |
Figure 2Correlation results of correlations on simulated data sets under different methods.
Correlation coefficient results.
| Methods/data sets | Data1 | Data2 | Data3 | Data4 | AvgError |
|---|---|---|---|---|---|
| True cc | 0.63 | 0.85 | 0.73 | 0.64 | — |
| L1-SCCA | 0.57(-0.06) | 0.58(-0.26) | 0.47(-0.26) | 052(-0.12) | 0.18 |
| FL-SCCA | 0.64(+0.01) | 0.79(-0.06) | 0.64(-0.09) | 0.65(+0.01) | 0.06 |
| SC-SCCA | 0.65(+0.02) | 0.82(-0.03) | 0.75(+0.02) | 0.65(+0.01) | 0.03 |
Figure 3Comparison of correlation coefficients for each method on each data set.
Figure 4Genes are partially correlated with the volume of dorsal striatum substructure.