| Literature DB >> 27776539 |
Benjamin Hur1, Sangsoo Lim1, Heejoon Chae2, Seokjun Seo3, Sunwon Lee4, Jaewoo Kang4, Sun Kim5,6,7.
Abstract
MOTIVATION: Transcriptome data from the gene knockout experiment in mouse is widely used to investigate functions of genes and relationship to phenotypes. When a gene is knocked out, it is important to identify which genes are affected by the knockout gene. Existing methods, including differentially expressed gene (DEG) methods, can be used for the analysis. However, existing methods require cutoff values to select candidate genes, which can produce either too many false positives or false negatives. This hurdle can be addressed either by improving the accuracy of gene selection or by providing a method to rank candidate genes effectively, or both. Prioritization of candidate genes should consider the goals or context of the knockout experiment. As of now, there are no tools designed for both selecting and prioritizing genes from the mouse knockout data. Hence, the necessity of a new tool arises.Entities:
Keywords: Gene prioritization; Gene selection; Knockout mouse; Web tool
Mesh:
Substances:
Year: 2016 PMID: 27776539 PMCID: PMC5078909 DOI: 10.1186/s13062-016-0158-x
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Fig. 1A Workflow of CLIP-GENE. CLIP-GENE prioritize user-interested genes that are relevant to phenotypic/functional differences of knockout mice data. CLIP-GENE takes DEG as input and filter out genes by using TF network and SNV information. Then prioritize these genes by using BEST and PPI information
Fig. 2Prioritization step 1: Prioritizing genes with Biomedical Entity Search Tool (BEST). BEST is utilized to find the relevance between knockout gene and candidate gene as well as the relationship between candidate gene and given context. Then CLIP-GENE retrieves the maximum score to represent that the candidate gene is highly relevant to the user’s interest or knockout gene. As a result, candidate gene with higher relevance score is ranked with high priority
Fig. 3Prioritization step 2: Prioritizing genes with Biomedical Entity Search Tool (BEST) and PPI information. CLIP-GENE summarizes ranks from step 1 and PPI shortest path by using Borda count
Performance of CLIP-GENE while analyzing GSE47851 (Gata3 KO)
| Methods | Precision | Recall | F-measure |
|---|---|---|---|
| DEG | 0.0105 | 1 | 0.0208 |
| IA | 0.0239 | 0.72 | 0.0463 |
| GeneFriends | 0.0038 | 0.92 | 0.0075 |
|
|
|
|
|
| CLIP-GENE (Inflammatory response) | 0.0354 | 0.76 | 0.0677 |
| CLIP-GENE (Cell differentiation) | 0.0294 | 0.72 | 0.0564 |
| CLIP-GENE (Cell proliferation) | 0.0201 | 0.72 | 0.0391 |
The best performed measurement is marked with a star (*) with a bold text
Performance of CLIP-GENE while analyzing GSE54932 (Setd2 KO)
| Methods | Precision | Recall | F-measure |
|---|---|---|---|
| DEG | 0.0099 | 0.5238 | 0.0195 |
| IA | 0.0183 | 0.1905 | 0.0333 |
| GeneFriends | 0.0015 | 0.5238 | 0.0031 |
|
|
|
|
|
| CLIP-GENE (Cell proliferation) | 0.0252 | 0.3333 | 0.0468 |
| CLIP-GENE (DNA mismatch repair) | 0.1304 | 0.1429 | 0.1364 |
| CLIP-GENE (Histone modification) | 0.0408 | 0.1905 | 0.0672 |
The best performed measurement is marked with a star (*) with a bold text
Performance of CLIP-GENE while analyzing GSE53398 (Barx2 KO)
| Methods | Precision | Recall | F-measure |
|---|---|---|---|
| DEG | 0.0071 | 0.7872 | 0.0142 |
| IA | 0.0111 | 0.3617 | 0.0215 |
| GeneFriends | 0.0036 | 0.617 | 0.0071 |
|
|
|
|
|
| CLIP-GENE (Muscle maintenance) | 0.0476 | 0.0426 | 0.0449 |
| CLIP-GENE (Chondrogensis) | 0.1667 | 0.0426 | 0.0678 |
| CLIP-GENE (Morphogenesis) | 0.0217 | 0.4255 | 0.0412 |
The best performed measurement is marked with a star (*) with a bold text
Gene prioritization results of CLIP-GENE at GSE47851 (Gata3 KO) data
| Context: | Context: | Context: | Context: | |
|---|---|---|---|---|
| Reported gene |
| Inflammatory response | Cell differentiation | Cell proliferation |
| Relb | 144 | 56 | 117 | 209 |
| Nfkb2 | - | - | - | - |
| Tnfrsf9 | 236 | 246 | 408 | 751 |
| Tnfrsf21 | - | - | - | - |
| Icos | 16 | 44 |
| 66 |
| Cysltr1 | 186 | 282 | 392 | 638 |
| Kit | - | - | - | - |
| Il1r2 | - | 203 | - | 483 |
| Il13 | 4 | 37 | 44 | 125 |
| Il5 |
| 50 | 52 | 149 |
| Areg | 71 | 87 | 154 | 96 |
| Il1rl1 | 67 | 109 | 237 | 507 |
| Ccr8 | 108 | 88 | 281 | 643 |
| Tph1 | 216 | 180 | 86 | 222 |
| Htr1b | - | - | - | - |
| Cd244 | - | - | - | - |
| Lta | 19 |
| 122 | 203 |
| Il10 |
|
| 47 | 74 |
| Tnf |
|
|
|
|
| Nfkbia | - | 33 | 341 | 563 |
| Cdkn2b | 78 | 256 | 255 | 122 |
| Lif | - | - | - | 402 |
| Il2ra | 84 | 206 | 88 | 546 |
| Il9r | 165 | 351 | 398 | - |
| Il24 | - | 436 | 343 | - |
| reproduced true positives/ | ||||
| predicted candidates | 16/260 | 19/536 | 18/613 | 18/896 |
The table represents how CLIP-GENE succeed to reproduce and prioritize the reported genes from the data produced study [19]. Genes that are ranked in top 10 are marked with bold font while star (*) represents best performed context
Gene prioritization results of CLIP-GENE at GSE54932 (Setd2 KO) data
| Context: | Context: | Context: | Context: | |
|---|---|---|---|---|
| Reported gene |
| Cell Proliferation | DNA mismatch repair | Histone modification |
| Gata6 |
| 40 | 15 | 23 |
| Sox7 |
| 72 | - | - |
| Sox17 | - | - | - | - |
| Dab2 | 16 | 44 | - | 65 |
| Cubn | - | - | - | - |
| Cdx2 |
| 19 |
| 13 |
| Psx1 | - | - | - | - |
| Fgf5 | - | - | - | - |
| Pax6 | - | - | - | - |
| T |
|
|
|
|
| Gata4 | - | - | - | - |
| Hnf1b | - | 213 | - | - |
| Colora1 | - | - | - | - |
| Myo6 | - | - | - | - |
| Pfn2 | - | - | - | - |
| Cldn1 | - | - | - | - |
| Vil1 | - | - | - | - |
| Fgfr3 | - | - | - | - |
| Fgfr4 | - | 28 | - | - |
| Arc | - | - | - | - |
| Cd97 | - | - | - | - |
| reproduced true positives/ | ||||
| predicted candidates | 5/24 | 7/278 | 3/23 | 4/98 |
The table represents how CLIP-GENE succeed to reproduce and prioritize the reported genes from the data produced study [24]. Genes that are ranked in top 10 are marked with bold font while star (*) represents best performed context
Gene prioritization results of CLIP-GENE at GSE53398 (Barx2 KO) data
| Context: | Context: | Context: | Context: | |
|---|---|---|---|---|
| Reported gene |
| Muscle maintenance | Chondrogensis | Morphogenesis |
| Gdnf | - | - | - | - |
| Id2 | - | - | - | 271 |
| Mmp9 |
| 29 |
| 129 |
| Smo | - | - | - | - |
| Sox2 | - | - | - |
|
| Wisp1 | - | - | - | - |
| Wisp2 | - | - | - | - |
| Ahr | - | - | - | - |
| Axin2 | - | - | - | - |
| Cacna2d3 | - | - | - | - |
| Ccnd1 | - | - | - | - |
| Ccnd2 | - | - | - | - |
| Ctgf | - | - | - | - |
| Dlk1 | - | - | - | 319 |
| Fgf7 | - | - | - | - |
| Fst | - | - | - | 307 |
| Fzd7 | - | - | - | - |
| Gdf5 | - | - | - | - |
| Igf2 | - | - | - | - |
| Klf5 | - | - | - | 96 |
| Pdgfra |
|
|
| 573 |
| Pitx2 | - | - | - |
|
| Tgfb3 | - | - | - | 780 |
| Wnt5a | - | - | - | - |
| Fzd4 | - | - | - | 193 |
| Fzd6 | - | - | - | - |
| Sfrp1 | - | - | - | 188 |
| Tle2 | - | - | - | - |
| Dvl1 | - | - | - | 168 |
| Nkd1 | - | - | - | - |
| Porcn | - | - | - | 367 |
| Wif1 | - | - | - | 74 |
| Wnt4 | - | - | - | - |
| Axin1 | - | - | - | - |
| Fstl1 | - | - | - | - |
| Fstl3 | - | - | - | - |
| Hey1 | - | - | - | 122 |
| Hey2 | - | - | - | - |
| Heyl | - | - | - | - |
| Hes1 | - | - | - | 34 |
| Hes6 | - | - | - | - |
| Snai1 | - | - | - | 295 |
| Snai2 | - | - | - | 399 |
| Snai3 | - | - | - | 526 |
| Fos | - | - | - | - |
| Nrap | - | - | - | - |
| Id1 | - | - | - | 205 |
| reproduced true positives/ | ||||
| predicted candidates | 2/11 | 2/42 | 2/12 | 20/923 |
The table represents how CLIP-GENE succeed to reproduce and prioritize the reported genes from the data produced study [25]. Genes that are ranked in top 10 are marked with bold font while star (*) represents best performed context