| Literature DB >> 30524460 |
Jing Jiang1, Fei Xing1, Chunyu Wang2, Xiangxiang Zeng3.
Abstract
Rice (Oryza sativa L.) is one of the most important staple foods in the world. It is possible to identify candidate genes associated with rice yield using the model of random walk with restart on a functional similarity network. We demonstrated the high performance of this approach by a five-fold cross-validation experiment, as well as the robustness of the parameter r. We also assessed the strength of associations between known seeds and candidate genes in the light of the results scores. The candidates ranking at the top of the results list were considered to be the most relevant rice yield-related genes. This study provides a valuable alternative for rice breeding and biology research. The relevant dataset and script can be downloaded at the website: http://lab.malab.cn/jj/rice.htm.Entities:
Keywords: function; network; random walking; rice; yield
Year: 2018 PMID: 30524460 PMCID: PMC6262309 DOI: 10.3389/fpls.2018.01685
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
FIGURE 1Illustration of the proposed method. Our method takes a set of seed genes as the input, and gives a ranking list of the candidates as the output. A functional similarity network was constructed by applying a random walk with restart algorithm to the network to obtain scores for candidate genes, and then the candidates were ranked according to their scores.
FIGURE 2Five-fold cross validation of the parameter r in RWR. The abscissa represents the top 500 ranking positions, and the ordinate represents the number of matching seed nodes.
FIGURE 3The ranking of all seeds in different r-values after 100 times fivefold cross validation. The number of ordinate in left part presents the sum of all seeds ranking and in right part presents the ranking position of all seeds.
The top 100 candidate genes in the ranking list.
| Ranking | Gene name | P score | PubMedID |
|---|---|---|---|
| 1 | LOC_Os06g09390 | 0.001487617 | PMID: 20713616, PMID: 27555860 |
| 2 | LOC_Os06g50480 | 0.001475286 | |
| 3 | LOC_Os02g02480 | 0.00146746 | |
| 4 | LOC_Os08g42470 | 0.001461294 | |
| 5 | LOC_Os01g03340 | 0.000941415 | |
| 6 | LOC_Os01g03390 | 0.00080268 | PMID: 12972663 |
| 7 | LOC_Os01g04040 | 0.00080268 | |
| 8 | LOC_Os01g04050 | 0.00080268 | |
| 9 | LOC_Os07g02350 | 0.000775571 | PMID: 16240106, PMID: 11416158 |
| 10 | LOC_Os08g02640 | 0.000669873 | |
| 11 | LOC_Os04g37619 | 0.000640376 | PMID: 24634194 |
| 12 | LOC_Os11g35500 | 0.00062345 | PMID:29813124, PMID:29402905 |
| 13 | LOC_Os05g41970 | 0.000594578 | PMID: 1731968 |
| 14 | LOC_Os12g16890 | 0.000594578 | |
| 15 | LOC_Os01g03680 | 0.000584668 | |
| 16 | LOC_Os07g10580 | 0.000564849 | PMID: 28158863, PMID: 22108719 |
| 17 | LOC_Os06g50340 | 0.000561268 | PMID: 19704753, PMID: 16511358 |
| 18 | LOC_Os10g14150 | 0.000555163 | PMID: 19201764 |
| 19 | LOC_Os01g55540 | 0.000551598 | PMID: 15753104 |
| 20 | LOC_Os10g22860 | 0.00054974 | PMID: 23384860, PMID: 28101092 |
| 21 | LOC_Os10g32990 | 0.000547737 | PMID: 23384860, PMID: 28101092 |
| 22 | LOC_Osm1g00450 | 0.000540982 | |
| 23 | LOC_Os01g60670 | 0.000536737 | |
| 24 | LOC_Os07g11410 | 0.00053512 | |
| 25 | LOC_Os01g13800 | 0.000533159 | |
| 26 | LOC_Os02g13780 | 0.000533159 | |
| 27 | LOC_Os10g06760 | 0.000533159 | PMID: 23384860, PMID: 28101092 |
| 28 | LOC_Os10g13970 | 0.000533159 | PMID: 23384860, PMID: 28101092 |
| 29 | LOC_Os10g19160 | 0.000533159 | PMID: 23384860, PMID: 28101092 |
| 30 | LOC_Os02g57530 | 0.000532385 | PMID: 14754915 |
| 31 | LOC_Os10g21810 | 0.000529529 | |
| 32 | LOC_Os01g47730 | 0.000507068 | |
| 33 | LOC_Os07g11920 | 0.000505391 | PMID: 28158863, PMID: 22108719 |
| 34 | LOC_Os01g07870 | 0.00049357 | |
| 35 | LOC_Os03g54790 | 0.000492652 | |
| 36 | LOC_Os01g18670 | 0.000492651 | |
| 37 | LOC_Os07g42300 | 0.000483507 | PMID: 24466124 |
| 38 | LOC_Os11g10100 | 0.000478643 | |
| 39 | LOC_Os11g40150 | 0.000478361 | PMID:28071676 |
| 40 | LOC_Os12g31370 | 0.000478361 | PMID:28071676 |
| 41 | LOC_Os03g05740 | 0.000472443 | |
| 42 | LOC_Os08g38720 | 0.000468006 | |
| 43 | LOC_Os03g50330 | 0.000462237 | |
| 44 | LOC_Os04g08740 | 0.000461766 | PMID: 19417056 |
| 45 | LOC_Os01g42650 | 0.000461755 | PMID: 16263700 |
| 46 | LOC_Os03g27290 | 0.000460621 | PMID: 19217306, PMID: 15672456 |
| 47 | LOC_Os10g39670 | 0.000460227 | |
| 48 | LOC_Os01g65230 | 0.000459159 | |
| 49 | LOC_Os03g54780 | 0.000456546 | |
| 50 | LOC_Os08g03640 | 0.000456163 | |
| 51 | LOC_Os01g14830 | 0.000454589 | |
| 52 | LOC_Os01g10820 | 0.000453601 | |
| 53 | LOC_Os10g42110 | 0.000449388 | |
| 54 | LOC_Os03g26860 | 0.000448345 | |
| 55 | LOC_Os07g41750 | 0.000448221 | |
| 56 | LOC_Os03g17580 | 0.000448145 | |
| 57 | LOC_Os10g42940 | 0.000447386 | PMID: 24715026, PMID: 10873582 |
| 58 | LOC_Os03g03570 | 0.000446501 | PMID: 10364408 |
| 59 | LOC_Os12g43550 | 0.000445728 | |
| 60 | LOC_Os03g49500 | 0.000444206 | PMID: 29767552 |
| 61 | LOC_Os10g04674 | 0.000442469 | PMID: 24145853, PMID: 17986178 |
| 62 | LOC_Os10g06740 | 0.000442469 | PMID: 28154240 |
| 63 | LOC_Os01g05980 | 0.000442411 | |
| 64 | LOC_Os10g33650 | 0.000440094 | |
| 65 | LOC_Os01g18150 | 0.000438562 | |
| 66 | LOC_Os01g22490 | 0.000436139 | |
| 67 | LOC_Os02g18550 | 0.000436139 | |
| 91 | LOC_Os05g50930 | 0.000408491 | |
| 92 | LOC_Os10g39440 | 0.000408336 | PMID: 24372780, PMID: 18335199 |
| 93 | LOC_Os08g06630 | 0.000407594 | |
| 94 | LOC_Osp1g00820 | 0.000407028 | PMID:25658309 |
| 95 | LOC_Osp1g01050 | 0.000407028 | PMID:25658309 |
| 96 | LOC_Osp1g00420 | 0.00040642 | PMID:25658309 |
| 97 | LOC_Os05g49320 | 0.000404017 | |
| 98 | LOC_Os12g07720 | 0.000400566 | PMID: 14756303 |
| 99 | LOC_Os10g06930 | 0.000399998 | PMID: 29356995 |
| 100 | LOC_Os03g06410 | 0.000399411 | PMID: 1731968 |
The confirmation rate of top 100 candidate genes in the ranking list.
| Top | Confirmation Number | Confirmation Rate |
|---|---|---|
| 20 | 11 | 55% |
| 30 | 16 | 53.33% |
| 40 | 20 | 50% |
| 60 | 26 | 43.33% |
| 70 | 30 | 42.86% |
| 80 | 33 | 41.25% |
| 100 | 46 | 46% |
FIGURE 4GO terms in which the top 100 candidate genes are enriched. The abscissa shows GO terms, and the ordinate represents the number of GO terms.