| Literature DB >> 34465339 |
Xue Jiang1, Miao Chen1, Weichen Song1, Guan Ning Lin2,3.
Abstract
BACKGROUND: Clinically, behavior, cognitive, and mental functions are affected during the neurodegenerative disease progression. To date, the molecular pathogenesis of these complex disease is still unclear. With the rapid development of sequencing technologies, it is possible to delicately decode the molecular mechanisms corresponding to different clinical phenotypes at the genome-wide transcriptomic level using computational methods. Our previous studies have shown that it is difficult to distinguish disease genes from non-disease genes. Therefore, to precisely explore the molecular pathogenesis under complex clinical phenotypes, it is better to identify biomarkers corresponding to different disease stages or clinical phenotypes. So, in this study, we designed a label propagation-based semi-supervised feature selection approach (LPFS) to prioritize disease-associated genes corresponding to different disease stages or clinical phenotypes.Entities:
Keywords: Biomarkers that corresponding to clinical phenotypes; Feature selection; Label propagation clustering
Mesh:
Year: 2021 PMID: 34465339 PMCID: PMC8406783 DOI: 10.1186/s12920-021-00985-0
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Ranking of the means of gene expression values in all samples
Fig. 2Ranking of the variances of gene expression values in all samples
Gene expression data of Huntington’s disease mice
| Tissue | Striatum | ||
|---|---|---|---|
| Age | 2-Month-old | 6-Month-old | 10-Month-old |
| Genotype | Poly Q20 | Poly Q80 | Poly Q92 |
| Poly Q111 | Poly Q140 | Poly Q175 | |
Fig. 3The flowchart of LPFS with Huntington’s disease RNA-seq data
The number of key genes for each category
| Normal samples | Case samples | |||||
|---|---|---|---|---|---|---|
| 2-Month-old | 6-Month-old | 10-Month-old | 2-Month-old | 6-Month-old | 10-Month-old | |
| Num. | 133 | 73 | 101 | 38 | 22 | 30 |
The GO and KEGG pathway enrichment analysis of normal mice marker genes by LPFS
| GO | Category | Description | Log10(P) |
|---|---|---|---|
| R-MMU-176412 | Reactome Gene Sets | Phosphorylation of the APC/C | − 4.11 |
| GO:0021983 | GO Biological Processes | Pituitary gland development | − 3.16 |
| GO:0022412 | GO Biological Processes | Cellular process involved in reproduction in multicellular organism | − 3.11 |
| R-MMU-500792 | Reactome Gene Sets | GPCR ligand binding | − 2.78 |
| R-MMU-2980736 | Reactome Gene Sets | Peptide hormone metabolism | − 2.60 |
| GO:0097305 | GO Biological Processes | Response to alcohol | − 2.55 |
| R-MMU-500792 | Reactome Gene Sets | Aromatic amino acid family metabolic process | − 3.64 |
| GO:0048589 | GO Biological Processes | Steroid hormone biosynthesis | − 2.05 |
| GO:0009072 | GO Biological Processes | Arachidonic acid metabolic process | − 6.52 |
| mmu00140 | KEGG Pathway | Steroid hormone biosynthesis | − 4.54 |
| GO:0019369 | GO Biological Processes | Arachidonic acid metabolic process | − 3.97 |
| GO:0002819 | GO Biological Processes | Regulation of adaptive immune response | − 3.60 |
| mmu04610 | KEGG Pathway | Complement and coagulation cascades | − 3.35 |
| GO:0001580 | GO Biological Processes | Response to alcohol | − 3.02 |
| R-MMU-174824 | Reactome Gene Sets | Response to alcohol | − 2.76 |
| GO:0010466 | GO Biological Processes | Response to alcohol | − 2.50 |
The GO and KEGG pathway enrichment analysis of case mice marker genes by LPFS
| GO | Category | Description | Log10(P) |
|---|---|---|---|
| GO:0007605 | GO Biological Processes | Sensory perception of sound | − 2.63 |
| GO:0002021 | GO Biological Processes | Response to dietary excess | − 3.84 |
| R-MMU-2559586 | Reactome Gene Sets | DNA Damage/Telomere Stress Induced Senescence | − 2.66 |
| GO:0007568 | GO Biological Processes | Aging | − 2.22 |
| GO:0051092 | GO Biological Processes | Positive regulation of NF-kappaB transcription factor activity | − 2.17 |
| GO:0003007 | GO Biological Processes | Heart morphogenesis | − 2.04 |
| GO:0046631 | GO Biological Processes | Alpha–beta T cell activation | − 4.14 |
| GO:0050878 | GO Biological Processes | Regulation of body fluid levels | − 3.54 |
| GO:0006820 | GO Biological Processes | Anion transport | − 2.48 |
| GO:0090277 | GO Biological Processes | Positive regulation of peptide hormone secretion | − 2.39 |
| GO:0050728 | GO Biological Processes | Negative regulation of inflammatory response | − 2.13 |
Fig. 4The enrichment analysis of 397 specific gene markers
The performance of LPFS for disease gene selection and sample label prediction
| Experiment | Hamming loss | One-error | Coverage | AUC | AUPR |
|---|---|---|---|---|---|
| Q20 versus Q80 | 0.210 ± 0.027 | 0.676 ± 0.081 | 0.382 ± 0.404 | 0.513 ± 0.064 | 0.193 ± 0.024 |
| Q20 versus Q92 | 0.220 ± 0.021 | 0.707 ± 0.063 | 0.397 ± 0.313 | 0.524 ± 0.060 | 0.211 ± 0.024 |
| Q20 versus Q111 | 0.229 ± 0.024 | 0.733± 0.071 | 0.410 ± 0.353 | 0.556 ± 0.058 | 0.186 ± 0.020 |
| Q20 versus Q140 | 0.226 ± 0.016 | 0.724 ± 0.048 | 0.406 ± 0.241 | 0.570 ± 0.066 | 0.210 ± 0.031 |
| Q20 versus Q175 | 0.226 ± 0.015 | 0.726 ± 0.046 | 0.407 ± 0.232 | 0.605 ± 0.067 | 0.226 ± 0.015 |
The AUC and AUPR of different methods
| Methods | FC | t-test | DESeq2 | edgeR | limma | jNMFMA | FNMF | LPFS |
|---|---|---|---|---|---|---|---|---|
| AUC | 0.570 | 0.509 | 0.524 | 0.531 | 0.497 | 0.547 ± 0.033 | 0.548 ± 0.019 | 0.554 ± 0.063 |
| AUPR | 0.227 | 0.166 | 0.179 | 0.180 | 0.160 | 0.188 ± 0.02 | 0.196 ± 0.01 | 0.205 ± 0.023 |
Fig. 5The ROC curves of FC, t-test, DESeq2, edgeR, limma, jNMFMA, and LPFS
Fig. 6The precision recall curves of FC, t-test, DESeq2, edgeR, limma, jNMFMA, and LPFS
The overlap degree of the top 1000 genes obtained by any two methods (397 genes for LPFS)
| DESeq2 | edgeR | limma | t-test | FC | jNMFMA | FNMF | |
|---|---|---|---|---|---|---|---|
| edgeR | 523 | ||||||
| limma | 312 | 457 | |||||
| t-test | 463 | 539 | 435 | ||||
| FC | 230 | 362 | 304 | 221 | |||
| jNMFMA | 175 | 252 | 304 | 192 | 546 | ||
| FNMF | 120 | 141 | 246 | 147 | 215 | 213 | |
| LPFS | 36 | 77 | 242 | 80 | 121 | 81 | 71 |