| Literature DB >> 26044212 |
Benjamin Hur, Heejoon Chae, Sun Kim.
Abstract
RNA-sequencing is widely used to measure gene expression level at the whole genome level. Comparing expression data from control and case studies provides good insight on potential gene markers for phenotypes. However, discovering gene markers that represent phenotypic differences in a small number of samples remains a challenging task, since finding gene markers using standard differential expressed gene methods produces too many candidate genes and the number of candidates varies at different threshold values. In addition, in a small number of samples, the statistical power is too low to discriminate whether gene expressions were altered by genetic differences or not. In this study, to address this challenge, we purpose a four-step filtering method that predicts gene markers from RNA-sequencing data of mouse knockout studies by utilizing a gene regulatory network constructed from omics data in the public domain, biological knowledge from curated pathways, and information of single-nucleotide variants. Our prediction method was not only able to reduce the number of candidate genes than the differentialy expressed gene-only filtered method, but also successfully predicted significant genes that were reported in research findings of the data contributors.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26044212 PMCID: PMC4460612 DOI: 10.1186/1755-8794-8-S2-S10
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Figure 1Overview of the gene marker selection method.
Number of genes selected by each method.
| Feature | DEG | DEG+GRN | DEG+GRN+Pathway | DEG+GRN+Pathway+SNV |
|---|---|---|---|---|
| More than 1, Less than 1 | 12298 | 8834 | 3436 | 2622 |
| More than 1.1, Less than 0.91 | 8953 | 6264 | 2441 | 1861 |
| More than 1.2, Less than 0.83 | 6466 | 4322 | 1683 | 1272 |
| More than 1.3, Less than 0.77 | 4631 | 2986 | 1179 | 879 |
| More than 1.4, Less than 0.71 | 3495 | 2125 | 845 | 629 |
| More than 1.5, Less than 0.67 | 2712 | 1574 | 633 | 463 |
| More than 1.6, Less than 0.63 | 2153 | 1184 | 478 | 343 |
| More than 1.7, Less than 0.59 | 1750 | 914 | 364 | 257 |
| More than 1.8, Less than 0.56 | 1439 | 718 | 284 | 203 |
| More than 1.9, Less than 0.53 | 1235 | 562 | 233 | 165 |
| More than 2.0, Less than 0.5 | 1064 | 456 | 192 | 135 |
Figure 2Comparison of the number of candidates of each method at each fold change.
Figure 3Comparison of the number of the gene marker candidates between non-filtered and filtered method in NF-kappa B signaling pathway. (A) NF-kappa B signaling pathway mapped with non-filtered candidates. With no filtering method, too many genes are shown in the pathway which makes it difficult to find an appropriate gene marker. (B) NF-kappa B signaling pathway mapped with candidates filtered by DEG. The number of genes is greatly reduced compared to the non-filtered method. However, difficulty exists in finding significant gene marker as the number of genes are still too many. (C) NF-kappa B signaling pathway mapped with full-filtered candidates. The number of genes was greatly reduced compared to the non-filtered or DEG-only filtered candidates while keeping the genes reported by Yagi et al.(2013).
Figure 4Comparison of the number of the gene marker candidates between non-filtered and full-filtered method in TNF signaling pathway. (A) TNF signaling pathway mapped with non-filtered candidates. With no filtering method, too many genes are shown in the pathway which makes it difficult to find an appropriate gene marker. (B) TNF signaling pathway mapped with candidates filtered by DEG. The number of genes is greatly reduced compared to the non-filtered method. However, difficulty exists in finding significant gene marker as the number of genes are still too many. (C) TNF signaling pathway mapped with fullfiltered candidates. The number of genes was greatly reduced compared to non-filtered or DEG-only filtered candidates while keeping the genes reported by Yagi et al.(2013).
Figure 5Comparison of the number of the gene marker candidates between non-filtered and full-filtered method in Cell cycle pathway. (A) Cell cycle pathway mapped with non-filtered candidates. With no filtering method, too many genes are shown in the pathway which makes it difficult to find an appropriate gene marker. (B) Cell cycle pathway mapped with candidates filtered by DEG. Number of genes are greatly reduced than non-filtered method. However, difficulty exists in finding significant gene markers as the number of genes is still too great. (C) Cell cycle pathway mapped with full-filtered candidates. The number of genes was greatly reduced compared to non-filtered or DEG-only filtered candidates while keeping the genes reported by Yagi et al.(2013).
Performance comparison of gene marker prediction methods.
| Gene Symbol | NONE | DEG | DEG+GRN | DEG+GRN+Pathway | DEG+GRN+Pathway+SNV |
|---|---|---|---|---|---|
| Relb | HIT | HIT | HIT | HIT | HIT |
| Nfkb2 | HIT | HIT | HIT | HIT | HIT |
| Tnfrsf9 | HIT | HIT | HIT | HIT | HIT |
| Tnfrsf21 | |||||
| Icos | HIT | HIT | HIT | HIT | HIT |
| Il2ra | HIT | HIT | HIT | HIT | HIT |
| Cysltr1 | HIT | HIT | |||
| Kit | HIT | HIT | HIT | HIT | HIT |
| Il1r2 | HIT | HIT | HIT | HIT | |
| Il13 | HIT | HIT | HIT | HIT | HIT |
| Il5 | HIT | HIT | HIT | HIT | HIT |
| Areg | HIT | HIT | HIT | HIT | HIT |
| Il1rl1 | HIT | HIT | HIT | ||
| Ccr8 | HIT | HIT | |||
| Tph1 | HIT | HIT | |||
| Htr1b | |||||
| Cd244 | HIT | HIT | HIT | HIT | |
| Lta | HIT | HIT | HIT | HIT | |
| Il10 | HIT | HIT | HIT | HIT | HIT |
| Tnf | HIT | HIT | HIT | HIT | HIT |
| Nfkbia | HIT | HIT | HIT | HIT | |
| Cdkn2b | HIT | HIT | HIT | HIT | HIT |
| Lif | HIT | HIT | HIT | HIT | HIT |
| Il2ra | HIT | HIT | HIT | HIT | HIT |
| Il9r | HIT | HIT | |||
| Il24 | |||||
| Num of Hits | 23 | 23 | 19 | 18 | 14 |
| Selected Candidates | 12298 | 2153 | 1184 | 478 | 343 |
| Precision | 0.002 | 0.011 | 0.016 | 0.038 | 0.041 |
| Recall | 0.885 | 0.885 | 0.731 | 0.692 | 0.538 |
| F-measure | 0.004 | 0.021 | 0.032 | 0.073 | 0.076 |
| Accuracy | 0.002 | 0.827 | 0.905 | 0.962 | 0.972 |