| Literature DB >> 30477446 |
Qi Zhao1, Qian Mao2, Zheng Zhao3, Tongyi Dou4, Zhiguo Wang1,5, Xiaoyu Cui1, Yuanning Liu6, Xiaoya Fan7.
Abstract
BACKGROUND: An increasing number of studies reported that exogenous miRNAs (xenomiRs) can be detected in animal bodies, however, some others reported negative results. Some attributed this divergence to the selective absorption of plant-derived xenomiRs by animals.Entities:
Keywords: Cross-kingdom regulation; Machine learning; Plant-derived xenomiR; Selective absorption; Statistics analysis; miRNA
Mesh:
Substances:
Year: 2018 PMID: 30477446 PMCID: PMC6258294 DOI: 10.1186/s12864-018-5227-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1The top 8 plant miRNA families that contain the most xenomiRs. More than one half of xenomiRs belong to these 8 miRNA families. In mir168 family, up to 41.2% miRNAs (7 of 17 miRNA sequences) were mapped. All redundant sequences were removed from each RNA family
Fig. 2Nucleotide position comparison between xenomiRs and non-xenomiRs. Percentage of the four kinds of nucleotide at each position of a) 166 xenomiRs and b) 942 non-xenomiRs
Sequence feature comparison between xenomiRs and non-xenomiRs
| Feature | FDR | Mean (positive) | Mean (negative) | |
|---|---|---|---|---|
|
|
|
|
|
|
| U | 1.03E-03 | 5.16E-03 | 0.242165895 | 0.276586575 |
|
|
|
|
|
|
| AU | 3.95E-04 | 3.13E-03 | 0.051751301 | 0.068725933 |
|
|
|
|
|
|
|
|
|
|
|
|
| GU | 2.35E-07 | 4.11E-06 | 0.034289003 | 0.057152059 |
| UA | 1.13E-09 | 3.97E-08 | 0.021754371 | 0.045343654 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| GGU | 5.00E-04 | 3.27E-03 | 0.00755333 | 0.015033431 |
| GUA | 5.30E-04 | 3.27E-03 | 0.003583374 | 0.01049709 |
| UAA | 4.42E-04 | 3.13E-03 | 0.003479138 | 0.010353066 |
| UAU | 7.81E-09 | 2.05E-07 | 0.001510348 | 0.013291427 |
| UUA | 4.47E-04 | 3.13E-03 | 0.003682193 | 0.01009223 |
| UUU | 1.09E-03 | 5.18E-03 | 0.010166663 | 0.021144843 |
| GU (seed) | 4.75E-05 | 4.98E-04 | 0.031124498 | 0.060686483 |
|
|
|
|
|
|
| UA (seed) | 8.28E-04 | 4.69E-03 | 0.016064257 | 0.037508846 |
Feature comparison was performed between xenomiRs and non-xenomiRs, and 23 features with FDR less than 0.01 were listed. Bold indicates the values are higher and non-bold indicates lower in xenomiRs than non-xenomiRs
Fig. 3Dimension reduction of features extracted from xenomiRs and non-xenomiRs. Dimension reduction of features extracted from xenomiRs and non-xenomiRs was performed using LDA to show the differences between them. Overall, the xenomiRs and non-xenomiRs could be separated with partial overlap in the middle, and the distribution of xenomiRs is more compact than that of non-xenomiRs
Fig. 4The architecture of our 1D-CNN model. This model consists of two convolutional layers, one flatten layer, two dense layers and one output layer
Training set and testing set
| Data set | Class | # of miRNAs | Data source |
|---|---|---|---|
| Training | Positive | 141 | Literature [ |
| Negative | 917 | miRbase, GEO | |
| Testing | Positive | 25 | Literature [ |
| Negative | 25 | miRbase, GEO |
Accuracy comparison between RF model and 1D-CNN model on test set and 5-fold cross-validation set
| Model | SN | SP | ACC | AUC | MCC | |
|---|---|---|---|---|---|---|
| Independent test set | RF | 0.920 | 0.560 | 0.740 | 0.866 | 0.547 |
| 1D-CNN | 0.880 | 0.680 | 0.780 | 0.817 | 0.574 | |
| 5-fold CV | RF | 0.901 | 0.666 | 0.736 | 0.840 | 0.538 |
| 1D-CNN | 0.859 | 0.712 | 0.766 | 0.794 | 0.543 | |
ACC, SN, SP, AUC and MCC indicate accuracy, sensitivity, specificity, area under the ROC
curve and Mathews correlation coefficient, respectively
Fig. 5ROC curves for performance comparison. ROC curves for performance comparison between RF and 1D-CNN models by (a) test set and (b) 5-fold cross validation, respectively
Fig. 6Enriched biological processes and KEGG pathways. The top 20 enriched biological processes (a) and the top 20 KEGG pathways (b) shown by Gene Ontology analysis pathway enrichment analysis, respectively