| Literature DB >> 16110339 |
Cathal Seoighe1, Chris Gehring, Laurence D Hurst.
Abstract
Why do highly expressed genes have small introns? This is an important issue, not least because it provides a testing ground to compare selectionist and neutralist models of genome evolution. Some argue that small introns are selectively favoured to reduce the costs of transcription. Alternatively, large introns might permit complex regulation, not needed for highly expressed genes. This "genome design" hypothesis evokes a regionalized model of control of expression and hence can explain why intron size covaries with intergene distance, a feature also consistent with the hypothesis that highly expressed genes cluster in genomic regions with high deletion rates. As some genes are expressed in the haploid stage and hence subject to especially strong purifying selection, the evolution of genes in Arabidopsis provides a novel testing ground to discriminate between these possibilities. Importantly, controlling for expression level, genes that are expressed in pollen have shorter introns than genes that are expressed in the sporophyte. That genes flanking pollen-expressed genes have average-sized introns and intergene distances argues against regional mutational biases and genomic design. These observations thus support the view that selection for efficiency contributes to the reduction in intron length and provide the first report of a molecular signature of strong gametophytic selection.Entities:
Mesh:
Year: 2005 PMID: 16110339 PMCID: PMC1186733 DOI: 10.1371/journal.pgen.0010013
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Mean Intron Length as a Function of Intron Position, Counting from the 5′ End of the Gene
Intron length was nearly constant for introns 5 to 10. Proximity to the 3′ end of the gene was not correlated with intron length. Error bars show ± twice the standard error. The data shown are for genes with exactly ten introns so that positional effects from the 3′ ends can also be assessed.
Summary of the Dataset
Each cell represents the mean value of the quantity in the column for the subset of genes indicated in the row. The complete dataset used is available as Dataset S1 with this article.
aSAGE data
bMicroarray data
Figure 2Histograms and Quantile–Quantile Plots of Mean Distal Intron Length
(A) Histogram for genes expressed in the sporophyte microarray datasets.
(B) Histogram for genes expressed in the gametophyte but not the sporophyte microarray datasets.
(C) Quantile–quantile plot of introns from all pollen-expressed and all sporophyte-expressed genes derived from the SAGE dataset. Quantiles of the intron length distributions for genes expressed in the gametophyte and sporophyte are on the x- and y-axes, respectively.
(D) Quantile–quantile plot of introns from pollen-specific and sporophyte-specific genes derived from the microarray dataset.