| Literature DB >> 28828382 |
Jianxiao Liu1,2, Zonglin Tian3.
Abstract
BACKGROUND ANDEntities:
Mesh:
Substances:
Year: 2017 PMID: 28828382 PMCID: PMC5554554 DOI: 10.1155/2017/1813494
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
The advantages and disadvantages of all the methods.
| ID | Methods | Advantages and disadvantages |
|---|---|---|
| 1 | Linkage analysis method genome-wide association analysis | Advantages: detecting the relationship between genetic loci and phenotypic trait |
| Disadvantages: low accuracy; time-consuming; high false positive rate; high cost | ||
|
| ||
| 2 | Expression Quantitative Trait Loci (eQTL) | Advantages: mining the relationship between genetic loci and quantitative trait |
| Disadvantages: high false positive rate; high cost | ||
|
| ||
| 3 | Mathematical and statistical methods | Advantages: high efficiency; low cost |
| Disadvantages: low accuracy; can not deal with the data with noise and nonlinear relationship; high computational complexity | ||
|
| ||
| 4 | Dimensionality Reduction Multifactor (MDR) | Advantages: detecting the epistasis interaction; low cost |
| Disadvantages: high calculation complexity | ||
|
| ||
| 5 | Clustering method | Advantages: establishing the global relationship; low cost |
| Disadvantages: cannot deal with data with nonlinear relationship; cannot detect the correlation between genes and phenotype | ||
|
| ||
| 6 | Support vector machine random forest method | Advantages: low cost; high efficiency |
| Disadvantages: the correctness depends on the quality of training set, but the training set is often hard to obtain | ||
|
| ||
| 7 | Bayesian method | Advantages: using the prior knowledge; realize the accurate calculation; low cost |
| Disadvantages: lack of network visibility; can not detect the epistasis interaction | ||
|
| ||
| 8 | Bayesian network | Advantages: processing data with noise and non-linear relationship; supporting different data types; high precision; low cost; detecting the epistasis interaction |
| Disadvantages: low learning efficiency; easy to cause local search | ||
Our dataset.
| Type | Data |
|---|---|
| Germplasm resources | 527 inbred lines for association mapping panel (AMP) with different populations (143 lines for NSS, non-stiff-stock; 33 for SS, Stiff-stock; 232 for TST, tropical and semitropical; and the left 119 are regarded as MIXED) |
|
| |
| Transcriptome quantification | 28,769a genes' quantitative expression of maize whole kernel (15 days after pollination, 15 DAP). Using Illumina high throughput sequencing technology, we get about 1 million 60 thousand high quality SNP markers and expression of 28,769 genes, which cover about 70% of the predicted genes in maize genome [ |
|
| |
| Phenotype data | 5 kinds of carotenoid component phenotype data of 482 materials in association mapping panel: |
aGenes filtered as expressed in >50% lines.
Results of different thresholds of Interval discretization method.
| Results | Parameters | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| (0.01,0.01,0.01) | (0.015,0.015,0.015) | (0.02,0.02,0.02) | (0.025,0.025,0.025) | (0.02,0.01,0.01) | ||||||
| 8-value | 2-value | 8-value | 2-value | 8-value | 2-value | 8-value | 2-value | 8-value | 2-value | |
| Number of edges (A) | 7 | 6 | 4 | 3 | 2 | 2 | 3 | 1 | 4 | 2 |
| Number of edges (B) | 18 | 13 | 10 | 9 | 6 | 7 | 6 | 4 | 11 | 8 |
| Ratio of 4 genes | 1.75 | 1.53 | 1.0 | 0.75 | 0.5 | 0.5 | 0.75 | 0.25 | 1.0 | 0.5 |
| Ratio of other 100 genes | 0.18 | 0.1 | 0.1 | 0.09 | 0.06 | 0.07 | 0.06 | 0.04 | 0.11 | 0.08 |
Results of different thresholds of Quantile discretization method.
| Results | Parameters | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| (0.01,0.01, 0.01) | (0.015, 0.015,0.015) | (0.02, 0.02,0.02) | (0.025, 0.025,0.025) | (0.02, 0.01, 0.01) | ||||||
| 8-value | 2-value | 8-value | 2-value | 8-value | 2-value | 8-value | 2-value | 8-value | 2-value | |
| Number of edges (A) | 6 | 3 | 5 | 3 | 5 | 3 | 2 | 1 | 5 | 3 |
| Number of edges (B) | 16 | 14 | 13 | 9 | 10 | 9 | 9 | 7 | 14 | 12 |
| Ratio of 4 genes | 1.5 | 0.75 | 1.25 | 0.75 | 1.25 | 0.75 | 0.5 | 0.25 | 1.25 | 0.75 |
| Ratio of other 100 genes | 0.16 | 0.14 | 0.13 | 0.09 | 0.1 | 0.09 | 0.09 | 0.07 | 0.14 | 0.12 |
Learning efficiency comparison of different thresholds and discretization values.
|
| Parameters | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| (0.01,0.01, 0.01) | (0.015, 0.015,0.015) | (0.02, 0.02,0.02) | (0.025, 0.025,0.025) | (0.02, 0.01, 0.01) | ||||||
| Interval | Quantile | Interval | Quantile | Interval | Quantile | Interval | Quantile | Interval | Quantile | |
| 8-value | 34.854 | 17.324 | 22.618 | 11.341 | 16.022 | 7.631 | 12.042 | 5.531 | 19.81 | 8.266 |
| 7-value | 35.336 | 17.78 | 23.557 | 11.351 | 14.594 | 7.508 | 8.592 | 5.34 | 10.33 | 8.124 |
| 6-value | 18.861 | 17.131 | 12.791 | 10.979 | 8.904 | 7.553 | 6.619 | 5.194 | 10.17 | 8.041 |
| 5-value | 19.644 | 16.924 | 13.017 | 10.885 | 9.16 | 7.257 | 6.717 | 5.066 | 10.564 | 7.871 |
| 4-value | 18.75 | 16.282 | 12.172 | 10.076 | 8.564 | 6.65 | 6.965 | 4.743 | 15.698 | 7.287 |
| 3-value | 29.124 | 15.083 | 18.52 | 9.133 | 11.128 | 6.022 | 8.355 | 4.031 | 13.326 | 6.324 |
| 2-value | 13.624 | 11.109 | 7.91 | 5.951 | 5.133 | 3.735 | 3.227 | 2.125 | 5.577 | 3.922 |
Figure 1Learning result comparison of Interval discretization method.
Figure 2Learning result comparison of Quantile discretization method.
Learning effect comparison of different discretization values.
| Cases | Methods | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| TPDA |
|
|
|
|
|
|
|
|
| |
| 2-value (Interval) | (1.05,0.12) | (1.53,0.06) | (0,88,0.12) | (0.85,0.07) | (1.6,0.1) | (0.98,0.04) | (0.88,0.12) | (1.0,0.07) | (0.88,0.06) | (0.8,0.05) |
| 3-value (Interval) | (1.38,0.17) | (0.0,0.05) | (0.25,0.05) | (0.05,0.08) | (0.35,0.09) | (0.0,0.04) | (0.25,0.05) | (0.4,0.04) | (0.05,0.07) | (0.0,0.04) |
| 4-value (Interval) | (1.3,0.17) | (0.0,0.08) | (0.0,0.03) | (0.38,0.08) | (0.5,0.08) | (0.0,0.03) | (0.0,0.03) | (0.0,0.04) | (0.4,0.08) | (0.0,0.03) |
| 5-value (Interval) | (1.35,0.19) | (0.0,0.04) | (0.0,0.02) | (0.0,0.04) | (0.0,0.04) | (0.0, .01) | (0.0,0.02) | (0.0,0.04) | (0.0,0.04) | (0.0,0.01) |
| 6-value (Interval) | (1.5,0.19) | (0.0,0.02) | (0.0,0.01) | (0.0,0.04) | (0.5,0.03) | (0.0,0.0) | (0.0,0.01) | (0.0,0.04) | (0.0,0.04) | (0.0,0.0) |
| 7-value (Interval) | (1.68,0.19) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) |
| 8-value (Interval) | (1.65,0.19) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) |
| 2-value (Quantile) | (0.88,0.15) | (1.25,0.07) | (0.73,0.11) | (0.68,0.08) | (1.1,0.98) | (0.85,0.04) | (0.75,0.11) | (0.75,0.07) | (0.6,0.07) | (0.55,0.05) |
| 3-value (Quantile) | (1.53,0.17) | (0.5,0.08) | (0.15,0.06) | (0.03,0.1) | (0.5,0.11) | (0.2,0.04) | (0.18,0.06) | (0.0,0.09) | (0.03,0.09) | (0.25,0.04) |
| 4-value (Quantile) | (1.38,0.18) | (0.25,0.1) | (0.0,0.03) | (0.33,0.06) | (1.35,0.08) | (0.0,0.03) | (0.0,0.03) | (0.0,0.04) | (0.28,0.06) | (0.0,0.03) |
| 5-value (Quantile) | (1.55,0.19) | (0.0,0.06) | (0.0,0.02) | (0.43,0.07) | (0.5,0.11) | (0.0,0.01) | (0.0,0.02) | (0.0,0.04) | (0.4,0.07) | (0.0,0.01) |
| 6-value (Quantile) | (1.56,0.20) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) |
| 7-value (Quantile) | (1.78,0.19) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) |
| 8-value (Quantile) | (1.44,0.20) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) | (0.0,0.0) |
Learning efficiency comparison of different discretization values (Interval).
|
| Methods | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| TPDA |
|
|
|
|
|
|
|
|
| |
| 6-value | 18.861 | 0.599 | 0.167 | 0.611 | 0.459 | 0.59 | 0.212 | 0.493 | 0.678 | 0.489 |
| 5-value | 19.644 | 0.692 | 0.208 | 0.897 | 0.463 | 0.693 | 0.246 | 0.652 | 0.880 | 0.484 |
| 4-value | 18.75 | 0.641 | 0.257 | 0.551 | 0.495 | 0.659 | 0.313 | 0.253 | 0.571 | 0.51 |
| 3-value | 29.124 | 0.728 | 0.336 | 0.416 | 0.518 | 0.749 | 0.409 | 0.398 | 0.449 | 0.543 |
| 2-value | 13.624 | 1.332 | 0.601 | 0.795 | 0.563 | 1.349 | 0.730 | 0.474 | 0.84 | 0.616 |
Learning efficiency comparison of different discretization values (Quantile).
|
| Methods | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| TPDA |
|
|
|
|
|
|
|
|
| |
| 5-value | 16.924 | 0.79 | 0.206 | 0.887 | 0.575 | 0.782 | 0.255 | 0.318 | 0.97 | 0.588 |
| 4-value | 16.282 | 1.077 | 0.252 | 1.065 | 0.576 | 1.07 | 0.325 | 0.248 | 1.211 | 0.583 |
| 3-value | 15.083 | 1.125 | 0.321 | 1.357 | 0.533 | 1.125 | 0.398 | 0.384 | 1.944 | 0.582 |
| 2-value | 11.109 | 1.462 | 0.613 | 0.814 | 0.563 | 1.463 | 0.739 | 0.431 | 0.925 | 0.666 |