| Literature DB >> 30515394 |
Nesrine Sghaier1,2, Rayda Ben Ayed1, Riadh Ben Marzoug1, Ahmed Rebai1.
Abstract
Auxin is a major regulator of plant growth and development; its action involves transcriptional activation. The identification of Auxin-response element (AuxRE) is one of the most important issues to understand the Auxin regulation of gene expression. Over the past few years, a large number of motif identification tools have been developed. Despite these considerable efforts provided by computational biologists, building reliable models to predict regulatory elements has still been a difficult challenge. In this context, we propose in this work a data fusion approach for the prediction of AuxRE. Our method is based on the combined use of Dempster-Shafer evidence theory and fuzzy theory. To evaluate our model, we have scanning the DORNRÖSCHEN promoter by our model. All proven AuxRE present in the promoter has been detected. At the 0.9 threshold we have no false positive. The comparison of the results of our model and some previous motifs finding tools shows that our model can predict AuxRE more successfully than the other tools and produce less false positive. The comparison of the results before and after combination shows the importance of Dempster-Shafer combination in the decrease of false positive and to improve the reliability of prediction. For an overall evaluation we have chosen to present the performance of our approach in comparison with other methods. In fact, the results indicated that the data fusion method has the highest degree of sensitivity (Sn) and Positive Predictive Value (PPV).Entities:
Mesh:
Substances:
Year: 2018 PMID: 30515394 PMCID: PMC6236769 DOI: 10.1155/2018/3837060
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Datasets.
|
|
|
|
|
|---|---|---|---|
|
| AuxRE | 16 | [ |
|
| ABRE | 12 | [ |
|
| TATA Box | 16 | [ |
|
| Ypatch | 11 | [ |
|
| DRE | 9 | [ |
Figure 1Modelling approach.
Figure 2Learning graph 1: distribution of different type of motifs in significance score/position feature space.
Figure 3Learning graph 2: distribution of different type of motifs in occurrence/density feature space.
Figure 4Learning graph 3: distribution of different type of motifs in f1/f2 feature space.
Proportion of false positive and true positive in the regions of the significance score/position feature space and associated propositions.
|
| % | % |
|
|---|---|---|---|
| Z1: R11 | 7 | 93 | P4(H2) |
| Z2: R12 | 25 | 75 | P2(H2) |
| Z3: R13 | 0 | 100 | P4(H2) |
| Z4: R14, R24, R34 | 0 | 100 | P4(H2) |
| Z5: R23 | 80 | 20 | P3(H1) |
| Z6: R21, R31, R32, R33, R22 | 100 | 0 | P4(H1) |
Proportion of false positive and true positive in the regions of the occurrence/density feature space and associated propositions.
|
| % | % |
|
|---|---|---|---|
| Z7: D11 | 43 | 57 | P1 |
| Z8: D12 | 13 | 87 | P3(H2) |
| Z9: D22 | 91 | 9 | P3(H1) |
| Z10: D41 | 100 | 0 | P4(H1) |
| Z11: D21, D31 | 0 | 100 | P4(H2) |
| Z12: D13, D23, D32, D33, D42, D43, | 0 | 100 | P4(H2) |
Proportion of false positive and true positive in the regions of the f1/f2 feature space and associated propositions.
|
| % | % |
|
|---|---|---|---|
| Z11:Q11, Q21, R12 | 0 | 100 | P4(H2) |
| Z12: Q22 | 70 | 30 | P3(H1) |
| Z13: Q13, Q23, Q33 | 0 | 100 | P4(H2) |
| Z14: Q31, Q32 | 10 | 90 | P3(H2) |
| Z15: Q41, Q42, Q43 | 0 | 100 | P4(H2) |
Association of propositions with mass values.
|
|
|
|
|
|---|---|---|---|
| P1(H1,H2) | 0 | 0 | 1 |
| P2(H1,H2) | 0,33 | 0 | 0,67 |
| P3(H1,H2) | 0,67 | 0 | 0,33 |
| P4(H1) | 1 | 0 | 0 |
| P2(H2,H1) | 0 | 0,33 | 0,67 |
| P3(H2,H1) | 0 | 0,67 | 0,33 |
| P4(H2) | 0 | 1 | 0 |
Figure 5Scanning of the DRN promoter by the data fusion method.
Figure 6Evolution of the positive and false detection as a function of credibility obtained before and after data fusion.
Figure 7ROC curves before and after data fusion (scan of DRN promoter).
Figure 8Variation of Positive Predictive Value (PPV).
Comparison between our method and the other published methods.
|
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|---|
|
|
| 99.91 |
|
| 51.83 | 0.02 |
| 1 |
| |
|
|
| 99.92 |
|
| 52.08 | 0.03 |
| 1 | 31702.8 | [ |
| Matrix-Scan | 51 | 99.94 | 46.36 | 99.95 | 53.64 | 0.05 | 0.51 | 1 | 22661.6 | [ |
| Patser | 63.64 | 99.92 | 43.45 | 99.96 | 56.55 | 0.04 | 0.64 | 1 | 27285.9 | [ |
| FIMO | 22 | 99.97 | 42.31 | 99.92 | 57.69 | 0.08 | 0.22 | 1 | 8911.9 | [ |
| PoSSuMsearch | 56.41 | 83.84 | 40.74 | 90.71 | 59.26 | 9.29 | 0.4 | 0.74 | 30 | [ |
Average sensitivities (Sn). Specificity (Sp). Positive Predictive Value (PPV). Negative Predictive Value (NPV). False Positive Rate (FPR). False Negative Rate (FNR). Youden index (YI). Q coefficient of Yule (QCY) and Χ2 test value (Χ2). The best-performing tools. Data fusion and Clover are highlighted in bold.