| Literature DB >> 23405862 |
Jean-Luc Da Lage1, Manfred Binder, Aurélie Hua-Van, Stefan Janeček, Didier Casane.
Abstract
BACKGROUND: Increasing genome data show that introns, a hallmark of eukaryotes, already existed at a high density in the last common ancestor of extant eukaryotes. However, intron content is highly variable among species. The tempo of intron gains and losses has been irregular and several factors may explain why some genomes are intron-poor whereas other are intron-rich.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23405862 PMCID: PMC3584928 DOI: 10.1186/1471-2148-13-40
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Tree of glycosyl hydrolases of the GH13 family. Full gene and species names and taxonomic positions are given in Additional file 3: Table S3. GH13 subfamilies [58] are colour-labelled and indicated by their numbers next to the species names. The fungal genes studied in this work are indicated by an asterisk. Bootstrap values are shown along the branches.
Figure 2Intron positions in the transferred α-amylase gene homologous to Phchr1|7087| found in Fungi. Pink dots: phase 0 introns; green dots: phase 1 introns; blue dots: phase 2 introns. Shaded regions represent unknown sequences. The black bar is a region of uncertain annotation. Asterisks indicate possible cases of intron sliding. CBM20 indicates the presence of an additional carbohydrate binding module of the CBM20 family. The number of introns is meant without the CBM20 extension.
Figure 3Reconstitution of intron gain and loss events mapped on a species tree. Branch lengths are proportional to time, the time scale in million years is shown below. Numbers along the branches are intron positions. HGT: horizontal transfer event. Black numbers: intron gains; red numbers: intron losses in all gene copies in a clade; green numbers: intron loss in some, but not all gene copies in a clade; blue numbers: intron gains specific to Stehi1|78757|. Black crosses show complete gene losses.
Figure 4Apparent intron gain and loss rates per million year and per lineage. Gain rates are in blue, loss rates are in red. The X axis is graduated in million years from the present.
Figure 5Average intron sizes at positions where more than ten values were available. Error bars indicate standard deviations. The long bar at position 6 is due to a single long intron in Stehi1|83072|. Red line : average intron size for the whole genomes of all the species studied. Dashed lines show standard deviation.
Figure 6Consensus sequences at positions −2 and −1, and +1 and +2 around intron positions, drawn with Weblogo 3.2[72](http://weblogo.threeplusone.com/create.cgi). A: all positions, intron present. B: all positions, intron absent. C: sequence at the 17 oldest positions when an intron is present; D: sequence at the 17 oldest positions in the absence of intron, i. e. after intron loss; E: sequence at the recent positions 2, 3, 4, 11, 12, 13, 14, 15, 16, 21, 25, 31, 37, 39a, 47, 48, 51, 52, 55, 57 in the presence of intron; F: same positions as E, in the absence of intron. Introns of Stehi_78757, of Jaapia argillacea and of Bjerkandera adusta were not included. n is the number of sequences. Y-axis is graduated in bits of information. Error bars are Bayesian 95% confidence intervals.