| Literature DB >> 34433483 |
Xue Jiang1, Weihao Pan2, Miao Chen2, Weidi Wang2, Weichen Song2, Guan Ning Lin3,4.
Abstract
BACKGROUND: Huntington's disease is a kind of chronic progressive neurodegenerative disease with complex pathogenic mechanisms. To data, the pathogenesis of Huntington's disease is still not fully understood, and there has been no effective treatment. The rapid development of high-throughput sequencing technologies makes it possible to explore the molecular mechanisms at the transcriptome level. Our previous studies on Huntington's disease have shown that it is difficult to distinguish disease-associated genes from non-disease genes. Meanwhile, recent progress in bio-medicine shows that the molecular origin of chronic complex diseases may not exist in the diseased tissue, and differentially expressed genes between different tissues may be helpful to reveal the molecular origin of chronic diseases. Therefore, developing integrative analysis computational methods for the multi-tissues gene expression data, exploring the relationship between differentially expressed genes in different tissues and the disease, can greatly accelerate the molecular discovery process.Entities:
Keywords: Artificial neuron; Differentially expressed gene; Huntington’s disease; Multi-tissues
Mesh:
Year: 2021 PMID: 34433483 PMCID: PMC8386081 DOI: 10.1186/s12920-021-00988-x
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Schematic illustration of the integrative enrichment analysis artificial neuron model
Fig. 2Pipeline of the integrative analysis of gene expression data based on the IEAAN model. Data preparation and pre-processing steps are described firstly. The training process of the IEAAN are illustrated in detail. Disease-associated genes are prioritized according to the enrichment scores finally
Experimental data description
| Age | 6-month-old | |||||
| Tissue | Striatum | Cortex | Liver | |||
| Genotype | poly Q20 | poly Q80 | poly Q92 | poly Q111 | poly Q140 | poly Q175 |
| Total sample number | 144 | |||||
Performance of the t-test method
| t-test | Normal–case | Normal–normal | Case–case | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Str_Str | Cor_Cor | Liv_Liv | Str_Cor | Str_Liv | Cor_Liv | Str_Cor | Str_Liv | Cor_Liv | |
| AUC | 0.48 | 0.52 | 0.47 | 0.545 | 0.529 | 0.512 | 0.521 | 0.514 | 0.523 |
| AUPR | 0.16 | 0.17 | 0.15 | 0.200 | 0.186 | 0.178 | 0.178 | 0.178 | 0.183 |
Performance of the FC method
| FC | Normal–case | Normal–normal | Case–case | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Str_Str | Cor_Cor | Liv_Liv | Str_Cor | Str_Liv | Cor_Liv | Str_Cor | Str_Liv | Cor_Liv | |
| AUC | 0.55 | 0.51 | 0.56 | 0.472 | 0.582 | 0.584 | 0.485 | 0.583 | 0.583 |
| AUPR | 0.18 | 0.19 | 0.23 | 0.168 | 0.218 | 0.216 | 0.174 | 0.219 | 0.216 |
Performance of the jNMFMA method
| jNMFMA | Normal–case | Normal–normal | Case–case | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Str_Str | Cor_Cor | Liv_Liv | Str_Cor | Str_Liv | Cor_Liv | Str_Cor | Str_Liv | Cor_Liv | |
| AUC | 0.567 | 0.554 | 0.585 | 0.527 | 0.534 | 0.548 | 0.537 | 0.581 | 0.563 |
| ±0.016 | ±0.005 | ±0.021 | ±0.013 | ±0.011 | ±0.029 | ±0.023 | ±0.009 | ±0.014 | |
| AUPR | 0.207 | 0.194 | 0.216 | 0.181 | 0.191 | 0.196 | 0.187 | 0.221 | 0.206 |
| ±0.015 | ±0.005 | ±0.011 | ±0.006 | ±0.008 | ±0.012 | ±0.019 | ±0.009 | ±0.009 | |
Performance of the FNMF method
| FNMF | Normal–case | Normal–normal | Case–case | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Str_Str | Cor_Cor | Liv_Liv | Str_Cor | Str_Liv | Cor_Liv | Str_Cor | Str_Liv | Cor_Liv | |
| AUC | 0.554 | 0.556 | 0.569 | 0.542 | 0.566 | 0.537 | 0.540 | 0.549 | 0.545 |
| ±0.016 | ±0.014 | ±0.029 | ±0.015 | ±0.022 | ±0.029 | ±0.016 | ±0.032 | ±0.032 | |
| AUPR | 0.199 | 0.197 | 0.194 | 0.194 | 0.192 | 0.198 | 0.188 | 0.195 | 0.191 |
| ±0.017 | ±0.010 | ±0.016 | ±0.008 | ±0.013 | ±0.024 | ±0.009 | ±0.013 | ±0.011 | |
Fig. 3The ROC curves of FC-based results
Fig. 4The PR curves of FC-based results
The overlap degree of the top 800 genes in any two ranked lists obtained by FC
| Normal–case | Normal–normal | Case–case | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Str_Str | Cor_Cor | Liv_Liv | Str_Cor | Str_Liv | Cor_Liv | Str_Cor | Str_Liv | Cor_Liv | ||
| Normal–case | Cor_Cor | 0.38 | ||||||||
| Liv_Liv | 0.22 | 0.19 | ||||||||
| Normal–normal | Str_Cor | 0.48 | 0.26 | 0.21 | ||||||
| Str_Liv | 0.25 | 0.14 | 0.44 | 0.27 | ||||||
| Cor_Liv | 0.21 | 0.13 | 0.45 | 0.21 | 0.92 | |||||
| Case–case | Str_Cor | 0.39 | 0.29 | 0.41 | 0.87 | 0.26 | 0.20 | |||
| Str_Liv | 0.25 | 0.14 | 0.44 | 0.27 | 0.96 | 0.91 | 0.26 | |||
| Cor_Liv | 0.21 | 0.12 | 0.45 | 0.20 | 0.91 | 0.96 | 0.19 | 0.92 | ||
| IEAAN | – | 0.23 | 0.14 | 0.46 | 0.23 | 0.93 | 0.95 | 0.23 | 0.93 | 0.95 |
The functional annotations of the five genes
| Gene | GOTERM_BP_DIRECT | GOTERM_CC_DIRECT | GOTERM_MF_DIRECT |
|---|---|---|---|
| Arpp21 | Cellular response to heat | Cytoplasm | Nucleic acid binding |
| Rgs4 | Inactivation of MAPK activity | Nucleus | GTPase activator activity |
| Regulation of G-protein coupled | Cytoplasm | ||
| Receptor protein signaling pathway | |||
| Rasd2 | Synaptic transmission | Intracellular | GTP binding |
| Dopaminergic | Membrane | ||
| Small GTPase mediated signal transduction | |||
| Gabrd | Transport | Plasma membrane | GABA-A receptor activity |
| Ion transport | Membrane | Extracellular ligand-gated ion channel activity | |
| Signal transduction | Integral component of membrane | ||
| Cell junction | |||
| Synapse | |||
| GABA-A receptor complex | |||
| Tmod1 | Muscle contraction | COP9 signalosome | Tropomyosin binding |
| Adult locomotory behavior | Membrance | ||
| Myofibril assembly | sarcomere | ||
| Pointed-end action filament capping | Cortical cytoskeleton | ||
| Lens fiber cell development |