| Literature DB >> 24312912 |
Tao Zhang1, Min Jiang, Lei Chen, Bing Niu, Yudong Cai.
Abstract
Observing what phenotype the overexpression or knockdown of gene can cause is the basic method of investigating gene functions. Many advanced biotechnologies, such as RNAi, were developed to study the gene phenotype. But there are still many limitations. Besides the time and cost, the knockdown of some gene may be lethal which makes the observation of other phenotypes impossible. Due to ethical and technological reasons, the knockdown of genes in complex species, such as mammal, is extremely difficult. Thus, we proposed a new sequence-based computational method called kNNA-based method for gene phenotypes prediction. Different to the traditional sequence-based computational method, our method regards the multiphenotype as a whole network which can rank the possible phenotypes associated with the query protein and shows a more comprehensive view of the protein's biological effects. According to the prediction result of yeast, we also find some more related features, including GO and KEGG information, which are making more contributions in identifying protein phenotypes. This method can be applied in gene phenotype prediction in other species.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24312912 PMCID: PMC3838811 DOI: 10.1155/2013/870795
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Breakdown of 1462 budding yeast proteins according to their 11 phenotypes.
| Tag | Phenotype category | Number of proteins |
|---|---|---|
|
| Conditional phenotypes | 536 |
|
| Cell cycle defects | 272 |
|
| Mating and sporulation defects | 198 |
|
| Auxotrophies, carbon, and nitrogen utilization defects | 266 |
|
| Cell morphology and organelle mutants | 535 |
|
| Stress response defects | 147 |
|
| Carbohydrate and lipid biosynthesis | 46 |
|
| Nucleic acid metabolism defects | 219 |
|
| Sensitivity to amino acid analogs and other drugs | 124 |
|
| Sensitivity to antibiotics | 43 |
|
| Sensitivity to immunosuppressants | 14 |
|
| ||
| Total | — | 2,400 |
The 11 order prediction accuracies by kNNA-based method.
| Method order | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | |
|
| 62.38 | 30.44 | 22.16 | 14.09 | 9.03 | 6.43 | 5.75 | 2.8 | 3.08 | 3.49 | 4.51 |
Figure 1The curve showing the trend of the 11 order prediction accuracies.
Figure 230 IFS curves of kNNA-based method corresponding to different values of k.
Figure 3The peak and its coordinate of these IFS curves.
The 11 order prediction accuracies by RPC-based methods (Dagging, RandomForest, SMO).
| Methods order | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | |
| Dagging | 60.05 | 33.58 | 21.96 | 13.75 | 10.53 | 8.28 | 6.57 | 3.56 | 2.6 | 1.85 | 1.44 |
| RandomForest | 58.62 | 34.2 | 22.3 | 14.7 | 9.92 | 7.66 | 5.95 | 5.2 | 3.28 | 1.5 | 0.82 |
| SMO | 56.16 | 34.68 | 21.55 | 14.84 | 10.88 | 7.8 | 6.36 | 4.65 | 3.21 | 2.26 | 1.78 |