| Literature DB >> 30032279 |
Yang-Jun Wen1, Ya-Wen Zhang2, Jin Zhang1, Jian-Ying Feng1, Jim M Dunwell3, Yuan-Ming Zhang1,4.
Abstract
In the genetic system that regulates complex traits, metabolites, gene expression levels, RNA editing levels and DNA methylation, a series of small and linked genes exist. To date, however, little is known about how to design an efficient framework for the detection of these kinds of genes. In this article, we propose a genome-wide composite interval mapping (GCIM) in F2. First, controlling polygenic background via selecting markers in the genome scanning of linkage analysis was replaced by estimating polygenic variance in a genome-wide association study. This can control large, middle and minor polygenic backgrounds in genome scanning. Then, additive and dominant effects for each putative quantitative trait locus (QTL) were separately scanned so that a negative logarithm P-value curve against genome position could be separately obtained for each kind of effect. In each curve, all the peaks were identified as potential QTLs. Thus, almost all the small-effect and linked QTLs are included in a multi-locus model. Finally, adaptive least absolute shrinkage and selection operator (adaptive lasso) was used to estimate all the effects in the multi-locus model, and all the nonzero effects were further identified by likelihood ratio test for true QTL identification. This method was used to reanalyze four rice traits. Among 25 known genes detected in this study, 16 small-effect genes were identified only by GCIM. To further demonstrate GCIM, a series of Monte Carlo simulation experiments was performed. As a result, GCIM is demonstrated to be more powerful than the widely used methods for the detection of closely linked and small-effect QTLs.Entities:
Keywords: adaptive lasso; genome-wide composite interval mapping; linked QTLs; mixed linear model; multi-locus model; small-effect QTL
Year: 2019 PMID: 30032279 PMCID: PMC6917223 DOI: 10.1093/bib/bby058
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
Previously reported genes for yield/plant (YIELD), tillers/plant (TILLER), grains/panicle (GRAIN) and thousand grain weight (KGW) in rice using GCIM-random, GCIM-fixed, ICIM and CIM methods
| Trait | Gene | MSU_locus | Chr | Pos (Mb) | Marker associated | GCIM-random (A) | GCIM-fixed (B) | ICIM (C) | CIM (D) | Reference | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LOD | Add | Dom |
| LOD | Add | Dom |
| LOD | Add | Dom |
| LOD | Add | Dom |
| |||||||
| YIELD |
| LOC_Os01g10110 | 1 | 5.667 | Bin40 | 10.52 | −1.29 | 0.00 | 1.13 | Ashikari | ||||||||||||
|
| 1LOC_Os01g47900 | 1 | 28.397 | Bin135 | 12.79 | 1.47 | 0.00 | 1.71 |
1Zou | |||||||||||||
|
| 2LOC_Os01g54860 | |||||||||||||||||||||
|
2Huo | ||||||||||||||||||||||
|
| LOC_Os02g14720 | 2 | 8.810 | Bin268 | 3.00 | 0.00 | −0.58 | 0.13 | Song | |||||||||||||
|
| LOC_Os02g56610 | 2 | 34.340 | Bin339 | 12.82 | 0.00 | 1.40 | 0.77 | Li | |||||||||||||
| 34.738 | Bin344 | 10.68 | 0.00 | −1.28 | 0.64 | |||||||||||||||||
|
| LOC_Os03g08170 | 3 | 4.894A, B | Bin378A, B, D | 5.41 | 0.00 | −0.84 | 0.28 | 5.91 | 0.00 | −1.86 | 2.20 | 4.72 | 0.18 | −3.06 | 7.23 | 4.33 | 0.78 | −3.17 | 6.44 | Ramegowda | |
| 4.9D, 5C | Bin378∼Bin379C | |||||||||||||||||||||
|
| LOC_Os03g29380 | 3 | 15.597 | Bin433 | 6.21 | 0.86 | 0.00 | 0.58 | Fan | |||||||||||||
|
| LOC_Os04g33740 | 4 | 19.644 | Bin617 | 13.39 | 0.00 | −1.51 | 0.90 | Wang E | |||||||||||||
|
| LOC_Os05g09520 | 5 | 3.438 | Bin722 | 5.38 | 0.87 | 0.00 | 0.60 | Liu | |||||||||||||
|
| LOC_Os06g06050 | 6 | 3.291 | Bin855 | 6.23 | 0.93 | 0.00 | 0.68 | Ishikawa | |||||||||||||
|
| 1LOC_Os06g44100 | 6 | 24.309 | Bin936 | 12.39 | 0.00 | −1.37 | 0.74 |
1Song | |||||||||||||
|
| 2LOC_Os06g45460 | |||||||||||||||||||||
|
| LOC_Os07g05900 | 7 | 2.817 | Bin989 | 10.58 | 1.31 | 0.00 | 1.36 | Tan | |||||||||||||
|
| LOC_Os07g15770 | 7 | 8A, C | Bin1003∼Bin1004A, C | 44.84 | 2.79 | 3.41 | 10.75 | 7.38 | 2.05 | 2.21 | 8.44 | 3.29 | −0.22 | 2.52 | 5.10 | 6.91 | −2.32 | 2.63 | 17.17 | Xue | |
| 12B | Bin1007∼Bin1008B | |||||||||||||||||||||
| 12.4D | Bin1007D | |||||||||||||||||||||
|
| LOC_Os08g31470 | 8 | 20.696 | Bin1143 | 8.85 | −1.10 | 0.95 | 1.31 | Zhao | |||||||||||||
| TILLER |
| LOC_Os06g06050 | 6 | 4A | Bin859∼Bin860A | 8.18 | 0.00 | −0.56 | 2.44 | 8.28 | 0.00 | −0.63 | 3.64 | 6.51 | −0.26 | −0.84 | 0.47 | Ishikawa | ||||
| 5.164B, 5.2D | Bin867B, D | |||||||||||||||||||||
|
| LOC_Os06g49080 | 6 | 24.666 | Bin938 | 2.86 | 0.25 | 0.00 | 0.93 | Wang L | |||||||||||||
|
| LOC_Os09g35980 | 9 | 19.55A, B | Bin1262A, B | 4.34 | 0.24 | 0.80 | 5.81 | 3.39 | 0.00 | 0.35 | 1.15 | Yu | |||||||||
| 5.761B | Bin42B | |||||||||||||||||||||
| GRAIN |
| LOC_Os01g10110 | 1 | 6C | Bin43∼Bin44C | 5.45 | 4.06 | −2.88 | 3.31 | 5.44 | 4.66 | −1.18 | 3.04 | 6.11 | −6.00 | −2.69 | 6.00 | 5.58 | −6.01 | −3.08 | 3.11 | Ashikari |
| 6.04A, 6.2D | Bin44A, D | |||||||||||||||||||||
|
| LOC_Os01g54860 | 1 | 28.442 | Bin136 | 4.76 | 3.23 | 0.00 | 1.68 | Huo | |||||||||||||
|
| LOC_Os07g05900 | 7 | 7 | Bin998∼Bin999 | 10.65 | −8.53 | 2.46 | 10.89 | Tan | |||||||||||||
|
| LOC_Os07g15770 | 7 | 8.4D, 8.407A | Bin1003D, Bin1004A | 15.22 | −3.40 | 5.13 | 3.96 | 14.48 | 6.79 | 5.72 | 8.45 | 10.81 | −8.20 | 5.37 | 18.14 | Xue | |||||
| 8.756B | Bin1005B | |||||||||||||||||||||
| KGW |
| LOC_Os03g29380 | 3 | 16.224A, 16.7D | Bin437A, Bin438D | 15.02 | −0.54 | 0.00 | 4.30 | 6.39 | 0.33 | −0.38 | 2.63 | 28.06 | −0.98 | −0.15 | 16.00 | 21.06 | −1.19 | −0.30 | 16.83 | Fan |
| 17B, C | Bin440∼Bin441B, C | |||||||||||||||||||||
|
| LOC_Os05g09520 | 5 | 5A, B, C, 5.3D | Bin728∼Bin729A, B, C, Bin729D | 33.11 | 1.00 | 0.00 | 14.76 | 33.56 | 0.97 | −0.01 | 13.84 | 25.01 | 0.96 | −0.16 | 13.78 | 13.65 | 0.96 | −0.23 | 15.94 | Liu | |
|
| LOC_Os08g39890 | 8 | 25C | 1Bin1151∼Bin1152C | 15.75 | 0.40 | 0.15 | 2.87 | Jiao | |||||||||||||
| 28C | 2Bin1175∼Bin1176C | 5.08 | −0.25 | 0.00 | 0.96 | 8.25 | −0.38 | 0.00 | 2.13 | 210.11 | −0.15 | −0.17 | 4.80 | |||||||||
| 28.118A, B | Bin1176A, B | |||||||||||||||||||||
The individuals with missing phenotypes were excluded. The critical value for significance was for all the methods. The data set was derived from Zhou et al. (2012). chr: chromosome; LOD: logarithm of odds.
Figure 1.Multi-locus QTL mapping for yield per plant (YIELD) in rice using CIM, ICIM, GCIM-random and GCIM-fixed methods. The data set is derived from Zhou et al. [32].
Figure 2.Comparison of statistical powers of QTL detection in the first (A), second (B) and third (C) simulation experiments using CIM, ICIM, GCIM-random and GCIM-fixed methods.
The P-values in paired t-tests of differences for power and mean absolute deviation (MAD) between the new (GCIM-random and GCIM-fixed) and current (ICIM and CIM) methods
| QTL | GCIM-random (A) and GCIM- fixed (B) | GCIM-random (A) and ICIM (B) | GCIM-random (A) and CIM (B) | GCIM-fixed (A) and ICIM (B) | GCIM- fixed (A) and CIM (B) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Power | MAD (Add) | MAD (Dom) | Power | MAD (Add) | MAD (Dom) | Power | MAD (Add) | MAD (Dom) | Power | MAD (Add) | MAD (Dom) | Power | MAD (Add) | MAD (Dom) | |
| The first simulation experiment (phenotype = mean + 19 main-effect QTLs + residual error with normal distribution) | |||||||||||||||
| All | 3e-04*** (5.711) | 0.9645 (−0.001) | 0.0000*** (0.033) | 1e-04*** (30.026) | 0.9046 (0.007) | 0.011* (−0.141) | 0.0000*** (43.447) | 0.0232 (−0.211) | 4e-04*** (−0.328) | 1e-04*** (24.316) | 0.9020 (0.008) | 0.0043** (−0.174) | 0.0000*** (37.737) | 0.0240* (−0.211) | 2e-04*** (−0.361) |
| Small | 0.0606 (6.833) | 0.0729 (0.016) | 0.0293* (0.063) | 0.0617 (17.167) | 0.1564 (−0.138) | 0.0047** (−0.258) | 0.0260* (26.833) | 0.0732 (−0.302) | 0.0026** (−0.452) | 0.080 (10.333) | 0.1302 (−0.154) | 0.0031** (−0.321) | 0.0473* (20.000) | 0.0671 (−0.318) | 0.0011** (−0.515) |
| Large | 1.0000 (0.000) | 0.2078 (0.018) | 0.2406 (0.023) | 1.0000 (0.000) | 0.7253 (0.168) | 0.4038 (0.200) | 0.3608 (2.750) | 0.7512 (0.161) | 0.4245 (0.271) | 1.0000 (0.000) | 0.7477 (0.150) | 0.4617 (0.176) | 0.3608 (2.750) | 0.7732 (0.143) | 0.4663 (0.247) |
| Linked | 0.0016** (6.286) | 0.6592 (−0.007) | 0.0014** (0.028) | 1e-04*** (37.071) | 0.8206 (0.015) | 0.0088** (−0.164) | 0.0000*** (52.821) | 0.0288 (−0.245) | 2e-04*** (−0.386) | 2e-04*** (30.786) | 0.759 (0.022) | 0.0053** (−0.192) | 1e-04*** (46.536) | 0.0348* (−0.238) | 1e-04*** (−0.414) |
| The second simulation experiment (phenotype = mean + 19 main-effect QTLs + polygenic background + residual error with normal distribution) | |||||||||||||||
| All | 0.0029** (5.211) | 2e-04*** (0.039) | 2e-04*** (0.059) | 0.0000*** (36.158) | 0.8896 (0.009) | 0.0289* (−0.122) | 0.0000*** (50.474) | 0.0415* (−0.242) | 4e-04*** (−0.345) | 0.0000*** (30.947) | 0.6424 (−0.03) | 0.0018** (−0.181) | 0.0000*** (45.263) | 0.0179* (−0.280) | 1e-04*** (−0.404) |
| Small | 0.0397* (11.500) | 0.0394* (0.054) | 0.0517 (0.092) | 0.0788 (28.833) | 0.1865 (−0.156) | 0.0098** (−0.220) | 0.0469* (35.000) | 0.1055 (−0.363) | 0.0049** (−0.430) | 0.1382 (17.333) | 0.1405 (−0.211) | 8e-04*** (−0.312) | 0.0717 (23.500) | 0.0908 (−0.417) | 0.0085** (−0.522) |
| Large | 0.5000 (−0.250) | 0.3454 (0.060) | 0.3431 (0.150) | 0.5000 (1.250) | 0.7323 (0.187) | 0.0726 (0.325) | 0.5000 (0.250) | 0.8306 (0.140) | 0.1576 (0.421) | 0.5000 (1.500) | 0.7957 (0.127) | 0.4003 (0.175) | 0.5000 (0.500) | 0.8945 (0.080) | 0.3997 (0.270) |
| Linked | 0.0207* (4.643) | 0.0075** (0.032) | 3e-04*** (0.039) | 0.0000*** (42.714) | 0.7784 (0.018) | 0.0052** (−0.164) | 0.0000*** (60.964) | 0.0615 (−0.270) | 0.0000*** (−0.436) | 0.0000*** (38.071) | 0.8481 (−0.014) | 0.0018** (−0.203) | 0.0000*** (56.321) | 0.0367* (−0.303) | 0.0000*** (−0.475) |
| The third simulation experiment (phenotype = mean + 19 main-effect QTLs + residual error with log-normal distribution) | |||||||||||||||
| All | 0.0032** (3.789) | 0.7010 (−0.003) | 0.0082** (0.019) | 0.0000***(27.868) | 0.040** (−0.123) | 1e-04*** (−0.207) | 0.0000*** (43.947) | 0.0226* (−0.205) | 3e-04*** (−0.340) | 0.0000*** (24.079) | 0.0534 (−0.120) | 1e-04*** (−0.227) | 0.0000*** (40.158) | 0.0218* (−0.203) | 2e-04*** (−0.360) |
| Small | 0.1060 (6.167) | 0.1147 (0.015) | 0.0052** (0.024) | 0.0298* (17.000) | 0.0503 (−0.232) | 0.0075** (−0.329) | 0.0179* (24.667) | 0.084 (−0.231) | 0.0025** (−0.514) | 0.1296 (10.833) | 0.0524 (−0.247) | 0.0071** (−0.352) | 0.0743 (18.500) | 0.0826 (−0.246) | 0.0024** (−0.538) |
| Large | 0.5000 (−0.250) | 0.6107 (−0.010) | 0.4001 (−0.010) | 0.5000 (−0.250) | 0.9015 (0.042) | 0.8582 (0.061) | 1.0000 (0.000) | 0.7854 (0.121) | 0.4751 (0.262) | 1.0000 (0.000) | 0.8839 (0.053) | 0.8406 (0.071) | 0.5000 (0.250) | 0.7772 (0.131) | 0.4727 (0.271) |
| Linked | 0.0152* (3.857) | 0.5446 (−0.005) | 0.0176 (0.023) | 0.0000*** (34.214) | 0.0856 (−0.123) | 1e-04*** (−0.22) | 0.0000*** (54.357) | 0.0281* (−0.246) | 2e-04*** (−0.389) | 0.0000*** (30.357) | 0.1109 (−0.118) | 0.0000*** (−0.242) | 0.0000**** (50.5) | 0.0263* (−0.241) | 1e-04**** (−0.412) |
Note: *, ** and ***: significance at the 0.05, 0.01 and 0.001 levels, respectively.
Note: Small QTL: QTL1, QTL11 and QTL15; large QTL: QTL14 and QTL19; linked QTL: QTL2∼QTL10, QTL12∼QTL13 and QTL16∼QTL18. The differences (A−B) were in the brackets.
Figure 3.FPRs of QTL detection in the first simulation experiment plotted against Type I error (in a log10 scale) for CIM, ICIM, GCIM-random and GCIM-fixed methods.
Comparison of four QTL mapping methods and their packages
| Case | GCIM-random | GCIM-fixed | ICIM | CIM |
|---|---|---|---|---|
| Model | Multi-locus model | Multi-locus model | Single-locus model | Single-locus model |
| Model transformation | FASTmrEMMA algorithm | FASTmrEMMA algorithm | NA | Interval mapping for |
| QTL effect | Random | Fixed | Fixed | Fixed |
| Estimation of QTL effect | REML or ML | REML or ML | ML | ML |
| Polygenic background control | Polygenic additive and dominant variances via mixed model framework of GWAS | Polygenic additive and dominant variances via mixed model framework of GWAS | The associated markers (cofactors), except the two markers flanking the current mapping interval; their effects are estimated at each position of genome scanning | The cofactors except for the two markers flanking the current mapping interval; the effects for all the cofactors are estimated only one time |
| No. of variance components | Five | Three | NA | NA |
| Polygenic-to-residual variance ratio | Fixed | Fixed | NA | NA |
| Running time | Fast | Fast | Fast | Slow |
| Software | GCIM-random and GCIM-fixed: QTL.gCIMapping ( | |||
| ICIM: QTL IciMapping ( | ||||
| CIM: Windows QTL Cartographer ( | ||||