| Literature DB >> 21283795 |
Fan Zhang1, Hu-Qu Zhai, Andrew H Paterson, Jian-Long Xu, Yong-Ming Gao, Tian-Qing Zheng, Rong-Ling Wu, Bin-Ying Fu, Jauhar Ali, Zhi-Kang Li.
Abstract
Great progress has been made in genetic dissection of quantitative trait variation during the past two decades, but many studies still reveal only a small fraction of quantitative trait loci (QTLs), and epistasis remains elusive. We integrate contemporary knowledge of signal transduction pathways with principles of quantitative and population genetics to characterize genetic networks underlying complex traits, using a model founded upon one-way functional dependency of downstream genes on upstream regulators (the principle of hierarchy) and mutual functional dependency among related genes (functional genetic units, FGU). Both simulated and real data suggest that complementary epistasis contributes greatly to quantitative trait variation, and obscures the phenotypic effects of many 'downstream' loci in pathways. The mathematical relationships between the main effects and epistatic effects of genes acting at different levels of signaling pathways were established using the quantitative and population genetic parameters. Both loss of function and "co-adapted" gene complexes formed by multiple alleles with differentiated functions (effects) are predicted to be frequent types of allelic diversity at loci that contribute to the genetic variation of complex traits in populations. Downstream FGUs appear to be more vulnerable to loss of function than their upstream regulators, but this vulnerability is apparently compensated by different FGUs of similar functions. Other predictions from the model may account for puzzling results regarding responses to selection, genotype by environment interaction, and the genetic basis of heterosis.Entities:
Mesh:
Year: 2011 PMID: 21283795 PMCID: PMC3024316 DOI: 10.1371/journal.pone.0014541
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Molecular quantitative genetics models underlying expression of complex traits.
(A) A generalized molecular quantitative genetics model (1) underlying expression of complex traits. E, , , and represent five major levels {G(I) - G(V)} of the genetic system at the signal transduction regulatory level, epigenetic regulatory level, transcriptional-posttranscriptional regulatory level, and biosynthesis-transportation level. P(I), P(II), P(III) and P(IV) represent the four levels of the phenotypic system with P(I) = metabolites at the biochemical level (Mijk), P(II) = component traits (CTs), P(III) = integrated traits (ITs), and P(IV) = fitness. SS and DS represent the two major types of selection - the stabilizing selection and directional selection defined in the population genetics theory. E and E represent two types of environmental components. E represents the physical environment of a multicellular organism encounters, which contains two parts, E (the basic or average elements in an environment required for the organism to survive) and E (the unique physical features of the environment that deviate from E and require expression of specific signaling pathways for survival). Thus, E is part of the genetic system. E is the random and non-heritable part of any phenotypes measured in the environment [31]–[33]; (B) A simplified molecular quantitative genetics model (2) of a single signaling pathway consisting of a single unit, 2 units, and 6 downstream units underlying expression of trait .
Seven different scenarios under which QTL parameters (main and epistatic effects), in addition to , to be simulated in ideal F2 and RI (DH) populations regarding the number of segregating loci in different layers of a hypothetical signaling pathway defined in model (2) containing 9 FGUs, 1 unit, 2 ( and ) units and 6 downstream (, , , , , and ) units that affect a complex trait, (Fig. 1B).
| Scenario | Number ( |
| Number of FD | N1 | Population size (N2) | Simulated QTL parameters | ||||||||||
|
|
|
|
|
|
|
|
|
| I | II | RI | F2 | ||||
|
| 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | x | 2 | 0 | 0 | 2 | 4 | 9 |
|
|
| 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 4 | 0 | 0 | 4 | 16 | 81 |
|
|
| 0 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 6 | 4 | 0 | 10 | 64 | 729 |
|
|
| 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 7 | 14 | 0 | 21 | 128 | 2187 |
|
|
| 1 | 2 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 6 | 10 | 5 | 21 | 64 | 729 |
|
|
| 0 | 1 | 2 | 0 | 0 | 1 | 3 | 0 | 0 | 7 | 10 | 5 | 22 | 128 | 2187 |
|
|
| 0 | 0 | 0 | 1 | 1 | 1 | 3 | 1 | 1 | 8 | 9 | 4 | 21 | 256 | 6561 |
|
|
| 32 | 16 | 4 | 8 | 4 | 16 | 8 | 4 | 4 | |||||||
‘0’ in the cells indicates the situations where parents are not segregating and have the same functional alleles at all loci within the unit (the unit is functional), whereas ‘x’ indicates the situation where both parents share the same mutant allele(s) at one or more loci within the unit such that the unit is non-functional (the segregating locus or loci in the parents are non-complementary). is the total number of the segregating loci in the populations.
“Effect” is the maximum pathway phenotypic value on trait of the corresponding , and loci (or units) specified in model (2) (Fig. 1B).
FD is functional dependency. There are two types of FD: (I) functional dependency of the downstream FGU loci on loci of their upstream FGUs, and (II) mutual functional dependency of loci from one another in the same FGUs.
N1 is the total number of genetic parameters simulated in the populations; N2 is the population size of the simulated populations and the expected number of possible multilocus genotypes in each simulated population under Hardy-Weinberg equilibrium.
Formula for estimating pathway effects (a) based on QTL additive and epistatic effects (A) and their corresponding portions in the total genotypic variance, in an ideal F2 (under complete dominance at all segregating loci) or RI (DH) population.
| F2 population | RI (DH) population | ||
|
| % of |
| % of |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
is the expected genotypic variance for trait in the population. In an F2, the QTL epistatic effects, , , and represent 1st, 2nd and 3rd order additive by additive epistasis parameters, respectively; % of is the proportion of the total genotypic variation explained by , , , and , respectively. p and q are allelic frequencies of the two alleles at each locus involved, which is 0.5 in the ideal F2 and RI (DH) populations.
Figure 2The putative genetic network underlying plant height (PH) of rice.
It contains 3 major groups of functional genetic units (FGUs) or QTLs controlling rice PH. I - SD1 (GA) mediated FGUs for increased PH; II-1 - a SD1 (GA) repressed FGU for reduced PH; II-2 - SD1 (GA) repressed FGUs with effects on PH of uncertain direction; and III – SD1 (GA) independent FGU. The number under each FGU is its pathway effect estimated using the relevant QTL parameter of Table S11 and its genetic expectation (Tables S12, S13).
Figure 3The putative genetic network underlying submergence tolerance (ST) of rice.
(A) The multilocus structure consisting of 19 FGUs (14 loci and 5 AGs) in 3 major groups plus 3 independent loci identified in the 71 ST NPT/Khazar BC3 ILs (Table S14); (B) the graphic genotypes of the 71 ST NPT/Khazar BC3 ILs at the identified FGUs. The color boxes are homozygous donor (Khazar) alleles and patched boxes are the heterozygotes. An AG is a group of unlinked but perfectly associated loci identified in the selected ST ILs. Different orbits marked with different colors represent different FGUs either as single loci or as AGs. The number under each FGU is the source bin (marker) in the rice genome or the number of loci included in the AG.