| Literature DB >> 24420573 |
Jia Wang1, Huihui Yu, Xiaoyu Weng, Weibo Xie, Caiguo Xu, Xianghua Li, Jinghua Xiao, Qifa Zhang.
Abstract
The ability to reveal the regulatory architecture of genes at the whole-genome level by constructing a regulatory network is critical for understanding the biological processes and developmental programmes of organisms. Here, we conducted an eQTL-guided function-related co-expression analysis to identify the putative regulators and construct gene regulatory network. We performed an eQTL analysis of 210 recombinant inbred lines (RILs) derived from a cross between two indica rice lines, Zhenshan 97 and Minghui 63, the parents of an elite hybrid, using data obtained by hybridizing RNA samples of flag leaves at the heading stage with Affymetrix whole-genome arrays. Making use of an ultrahigh-density single-nucleotide polymorphism bin map constructed by population sequencing, 13 647 eQTLs for 10 725 e-traits were detected, comprising 5079 cis-eQTLs (37.2%) and 8568 trans-eQTLs (62.8%). The analysis revealed 138 trans-eQTLs hotspots, each of which apparently regulates the expression variations of many genes. Co-expression analysis of functionally related genes within the framework of regulator-target relationships outlined by the eQTLs led to the identification of putative regulators in the system. The usefulness of the strategy was demonstrated with the genes known to be involved in flowering. We also applied this strategy to the analysis of QTLs for yield traits, which also suggested likely candidate genes. eQTL-guided co-expression analysis may provide a promising solution for outlining a framework for the complex regulatory network of an organism.Entities:
Keywords: Gene expression; network construction; phenotype; quantitative genomics; regulation.
Mesh:
Year: 2014 PMID: 24420573 PMCID: PMC3935569 DOI: 10.1093/jxb/ert464
Source DB: PubMed Journal: J Exp Bot ISSN: 0022-0957 Impact factor: 6.992
Fig. 1.eQTLs identified using flag leaf tissue at the heading stage of RILs from a cross between Zhenshan 97 and Minghui 63. The positions of 13 647 eQTLs for 10 725 e-traits in the genome are shown. The x-axis shows the physical positions of expressed probe sets (e-traits) in the genome, and the y-axis the physical positions of eQTLs. Each dot represents an eQTL detected. The diagonal indicates possible cis-eQTLs, which are co-located with their corresponding probe sets. All the off-diagnal points indicate trans-eQTLs. The 12 rice chromosomes are separated by grey lines. The color key reflects the LOD scores (LOD scores >30 are set to 30). Most of the dots in higher scores are located in the diagonal representing cis-eQTLs. Multiple horizontal bands (trans-eQTL hotspots) are shown on four chromosomes (1, 3, 6, and 9), which suggests that transcript levels for many of the probe sets are associated with the polymorphisms in these regions. LOD scores >4.95 were adopted as the cut-off point for eQTLs.
Statistics of the eQTLs on each chromosome
| Chr. | eQTLs on chr. |
|
| Chr. size (Mb) | Expected | SR | Chr. size (cM) | Expected | SR | cM/Mb |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2563 | 943 | 1620 | 43.3 | 1588 | 24.5 | 201 | 1684 | 21.4 | 4.6 |
| 2 | 929 | 463 | 466 | 35.9 | 1316 | –10.7 | 175 | 1473 | –14.2 | 4.9 |
| 3 | 1855 | 579 | 1276 | 36.4 | 1335 | 14.3 | 188 | 1574 | 7.1 | 5.2 |
| 4 | 706 | 379 | 327 | 35.3 | 1294 | –16.4 | 127 | 1068 | –11.1 | 3.6 |
| 5 | 608 | 373 | 235 | 29.9 | 1096 | –14.7 | 116 | 974 | –11.7 | 3.9 |
| 6 | 2453 | 410 | 2043 | 31.2 | 1144 | 38.7 | 144 | 1212 | 35.6 | 4.6 |
| 7 | 1135 | 290 | 845 | 29.7 | 1089 | 1.4 | 135 | 1137 | –0.1 | 4.6 |
| 8 | 610 | 326 | 284 | 28.4 | 1041 | –13.4 | 120 | 1010 | –12.6 | 4.2 |
| 9 | 1093 | 356 | 737 | 23 | 843 | 8.6 | 107 | 900 | 6.4 | 4.7 |
| 10 | 509 | 332 | 177 | 23.1 | 847 | –11.6 | 85 | 716 | –7.1 | 3.7 |
| 11 | 739 | 380 | 359 | 28.5 | 1045 | –9.5 | 117 | 980 | –7.7 | 4.1 |
| 12 | 447 | 248 | 199 | 27.5 | 1008 | –17.7 | 109 | 918 | –15.5 | 4.0 |
| 13647 | 5079 | 8568 | 372.2 | 13647 | 1625 | 13647 | 4.4 |
Expected number of eQTLs based on chromosome size from physical map. χ2 = 3690.2 (P < 2.2e–16) for the test of goodness of fit between the observed and expected numbers of eQTLs on the 12 chromosomes.
SR: for physical map, standardized residue [=(observed – expected)/√expected], which follows a normal distribution asymptotically. An absolute SR value >2.33 indicates statistical significance at P<0.01. A positive value indicates that the observed number is greater than expected.
Expected number of eQTLs based on chromosome size from genetic map. χ2 = 2801.18 (P < 2.2e–16).
SR: standardized residue for genetic map.
Fig. 2.Schematic diagram of the procedure for identification of master regulators and construction of regulatory network for functionally related genes. (A) Identification of master regulators in the trans-eQTL hotspots by eQTL mapping. (B) eQTL map of functionally related genes. (C) The process of seeking master regulators and their targets according to the expression correlation between putative regulators and candidate targets for constructing a regulatory network (see main text for details).
Information on the six putative master regulators genes, associated with more than 100 genes as targetsThe candidate regulators were suggested to be the gene with cis-eQTLs that co-mapped with trans-eQTLs of many co-expressed genes (P<0.01).
| Regulator |
| Bin | Chr. | LOD | Inf. Mb | Sup. Mb | Var | Annotation |
|---|---|---|---|---|---|---|---|---|
| Os.3539.1.S1_at | 412 | Bin851 | 6 | 5.61 | 2.89 | 4.50 | 7.45% | Vesicle-associated membrane protein, putative, expressed |
| Os.12583.1.S1_at | 213 | Bin389 | 3 | 14.77 | 8.66 | 9.53 | 22.27% | Outer-membrane protein, OMP85 family, putative, expressed |
| Os.53449.1.A1_at | 204 | Bin395 | 3 | 6.80 | 8.75 | 10.04 | 11.39% | Expressed protein |
| Os.9960.1.S1_at | 145 | Bin857 | 6 | 23.56 | 3.29 | 4.02 | 35.33% | WD-40 repeat family protein, putative, expressed |
| Os.16410.1.S1_s_at | 127 | Bin853 | 6 | 4.96 | 2.89 | 4.50 | 4.61% | BSD domain containing protein, expressed |
| Os.27513.1.A1_a_at | 103 | Bin36 | 1 | 5.45 | 4.75 | 6.04 | 5.17% | Phosphoenolpyruvate carboxylase, putative, expressed |
N, the number of co-expression probe sets with trans-eQTLs located in the 1.5 LOD-drop interval of corresponding cis-eQTL of the e-trait.
LOD, the LOD value for the cis-eQTL for the regulator.
Inf.Mb, the inferior position for the 1.5 LOD-drop interval of cis-eQTLs.
Sup.Mb, the superior position for the 1.5 LOD-drop interval of cis-eQTLs.
Var, the expression variation explained by the cis-eQTL for the regulator.
Annotation, the gene annotation for the regulator.
The eight regulatory groups identified in the analysis
| Group | Members | Number changed |
| Gene | MUS6.1 |
|---|---|---|---|---|---|
| Os.15230.1.S1_at | 24 | 19 | 1.14E–05 |
| LOC_Os06g06300 |
| Os.10204.1.S1_at | 10 | 7 | 1.08E–04 |
| LOC_Os01g09590 |
| Os.12795.1.S1_at | 2 | 2 | 7.68E–03 |
| LOC_Os11g05930 |
| OsAffx.28467.1.S1_at | 11 | 11 | 0.016 |
| LOC_Os07g15770 |
| Os.12674.1.S1_s_at | 2 | 2 | 0.022 |
| LOC_Os03g11340 |
| Os.1189.1.S1_at | 6 | 6 | 0.032 |
| LOC_Os06g16370 |
| Os.30077.2.S1_at | 5 | 3 | 0.033 |
| LOC_Os03g16210 |
| Os.2720.1.S1_at | 3 | 1 | 0.042 |
| LOC_Os10g41440 |
Group, All local regulatory genes were classified as group names.
Members, the number of e-traits with trans-eQTLs co-mapped with the corresponding group.
Number changed, the number of candidate targets that may be controlled by the regulator.
P value changed, the probability of change, indicating the likelihood of observing the members of the group that high up in the list by chance.
Fig. 3.The regulatory network of genes involved in flower development and the flowering time pathway. Genes in yellow are putative regulators, while other genes connected to them in black are the candidate targets. Arrows indicate positive regulation and bars indicate negative regulation.
Fig. 4.qPCR verification for the expression abundance of candidate targets in NIL(mh7) and Zhenshan 97. Flag leaves at the heading date were used for the analysis. The y-axis represents relative expression levels. Means ±standard error are based on six biological replicates.