| Literature DB >> 16885241 |
Raphaël Kiekens1, Annelies Vercauteren, Beatrijs Moerkerke, Els Goetghebeur, Hilde Van Den Daele, Roel Sterken, Martin Kuiper, Fred van Eeuwijk, Marnik Vuylsteke.
Abstract
Large-scale screening studies carried out to date for genetic variants that affect gene regulation are generally limited to descriptions of differences in allele-specific expression (ASE) detected in vivo. Allele-specific differences in gene expression provide evidence for a model whereby cis-acting genetic variation results in differential expression between alleles. Such gene surveys for regulatory variation are a first step in identifying the specific nucleotide changes that govern gene expression differences, but they leave the underlying mechanisms unexplored. Here, we propose a quantitative genetics approach to perform a genome-wide analysis of ASE differences (GASED). The GASED approach is based on a diallel design that is often used in plant breeding programs to estimate general combining abilities (GCA) of specific inbred lines and to identify high-yielding hybrid combinations of parents based on their specific combining abilities (SCAs). In a context of gene expression, the values of GCA and SCA parameters allow cis- and trans-regulatory changes to be distinguished and imbalances in gene expression to be ascribed to cis-regulatory variation. With this approach, a total of 715 genes could be identified that are likely to carry allelic polymorphisms responsible for at least a 1.5-fold allelic expression difference in a total of 10 diploid Arabidopsis thaliana hybrids. The major strength of the GASED approach, compared to other ASE detection methods, is that it is not restricted to genes with allelic transcript variants. Although a false-positive rate of 9/41 was observed, the GASED approach is a valuable pre-screening method that can accelerate systematic surveys of naturally occurring cis-regulatory variation among inbred lines for laboratory species, such as Arabidopsis, mouse, rat and fruitfly, and economically important crop species, such as corn.Entities:
Mesh:
Year: 2006 PMID: 16885241 PMCID: PMC1540733 DOI: 10.1093/nar/gkl510
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Experimental design, consisting of 30 two-dye CATMA v2.2 microarray (24) experiments to examine transcript levels in RNA samples collected from a diallel experiment in A.thaliana with 5 parental lines and 10 F1 hybrids. The five hybridized parental lines were the homozygous accessions Col, Ler, Cvi, Ws and C24. F1 hybrids were obtained by making all pairwise crosses between the five parental lines. The 10 hybrid samples hybridized consisted of pooled progeny from reciprocal crosses. The microarray contained 22 494 unique GSTs from Arabidopsis and is represented by an arrow, connecting the two sampled genotypes hybridized to it. The samples at the tail and head of each arrow were labeled with Cy3 and Cy5, respectively. The two replicates are represented by differently colored arrows.
Figure 2GASED strategy to differentiate between cis- and trans-regulatory changes on a large scale and to assign imbalances in allelic expression to cis-regulatory variation using a classical diallel crossing scheme.
Number of features with a non-significant SCA effect, a significant difference in allelic mean expression, and a significant 1.5- and 2-fold change in allelic expression detected in each of the 10 hybrids
| Hybrid | ||||
|---|---|---|---|---|
| ( | ( | |Δ1| = 0.292 | |Δ1| = 0.5 | |
| ( | ( | |||
| Col×L | 2051 | 245 | 87 | 31 |
| Col×Cvi | 2804 | 448 | 218 | 77 |
| Col×Ws | 1196 | 191 | 89 | 41 |
| Col×C24 | 1222 | 262 | 108 | 39 |
| L | 1944 | 368 | 151 | 41 |
| L | 2796 | 257 | 96 | 29 |
| L | 2292 | 375 | 175 | 51 |
| Cvi×Ws | 1718 | 368 | 155 | 46 |
| Cvi×C24 | 1301 | 340 | 150 | 46 |
| Ws×C24 | 2346 | 614 | 210 | 60 |
| Total | 4047 | 1574 | 715 | 236 |
Figure 3Volcano plot contrasting the significance (−log10Q on the ordinate) and the magnitude of the difference in allelic mean expression between Ler and Cvi (data on log2 scale). The bottom horizontal dashed line corresponds to the FDR acceptance level of Q = 0.001 (−log10 = 3). The vertical dashed lines demarcate the 0-, 2- and 4-fold change in allelic expression (data on the log2 scale). Triangles represent the 50 genes with sij = 0 at the FNR acceptance level of 0.0005 and carrying evidence in favor of a 2-fold difference in allelic mean expression, of which 41 genes met the FDR acceptance level of Q = 0.001. Circles represent genes of which the allelic expression is not affected by trans-regulatory variants (s = 0; FNR < 0.0005), but do not carry evidence in favor of a 2-fold difference in allelic mean expression.
Figure 4Mapping eQTL that modulate the expression of 41 genes identified in the Ler×Cvi hybrid as having an allelic polymorphism responsible for a 2-fold difference in allelic mean expression. Variation in the transcript levels across 18 Ler/Cvi RIL samples was correlated with the presence of the Ler or Cvi allele at the target genomic bin. Each cross represents a single transcript. The physical position of each transcript is indicated on the ordinate, and the position of the locus most strongly associated with the variation of the transcript levels on the abscissa. Transcripts on the diagonal display the strongest association to their own locus at P < 0.05, confirming an allelic polymorphism responsible for the change in allelic mean expression. Transcripts off the diagonal display the strongest association to a locus distant to their own locus at P < 0.05, failing to confirm their cis-regulated ASE as judged by the GASED procedure. To represent the data graphically, the Arabidopsis genome was divided into 69 genomic bins. Actual chromosomal positions are indicated at the top.