| Literature DB >> 17651506 |
Uffe Hellsten1, Mustafa K Khokha, Timothy C Grammer, Richard M Harland, Paul Richardson, Daniel S Rokhsar.
Abstract
BACKGROUND: Ancient whole genome duplications have been implicated in the vertebrate and teleost radiations, and in the emergence of diverse angiosperm lineages, but the evolutionary response to such a perturbation is still poorly understood. The African clawed frog Xenopus laevis experienced a relatively recent tetraploidization ~40 million years ago. Analysis of the considerable amount of EST sequence available for this species together with the genome sequence of the related diploid Xenopus tropicalis provides a unique opportunity to study the genomic response to whole genome duplication.Entities:
Mesh:
Year: 2007 PMID: 17651506 PMCID: PMC1949811 DOI: 10.1186/1741-7007-5-31
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Figure 1Four-fold synonymous transversion rates. (a) X. tropicalis-X. laevis mutual-best hits (LT MBH) show 4 DTv distances sharply peaked around 0.09 corresponding to the species divergence. The few hits in the high-end tail (4 DTv > 0.2) are due to the incompleteness of the gene sets and/or gene losses. The line marked LL doublets shows two-member clusters of recent (4 DTv < 0.15) X. laevis paralogs. Assuming uniform transversion rates across vertebrates, and dating the last common human-mouse ancestor at 75 Mya, the laevis-tropicalis and laevis-laevis divergence is ~50 and ~40 Mya, respectively. For comparison, paralogs from the much more ancient teleost duplication in zebrafish are also shown. After correcting for multiple transversions, the fish duplication is about eight times older than the X. laevis event, consistent with timings based on total synonymous substitution rates [13,14]. (b) 4 DTv distributions for orthologs in mouse-rat (red), mouse-human (blue), rat-human (green), and mouse-X. tropicalis (purple). Only orthologs supported by conserved synteny are considered. Using the same molecular clock as panel (a), the mammal-frog divergence is 350 Mya.
Figure 2Symmetric evolution of paralogs. Scatter plot of relative evolution between X. tropicalis peptides and their co-orthologous sequences in X. laevis. A total of 578 gene triples with 16 or more highly-conserved positions are shown (see text for details). L1 and L2 refer to co-orthologous genes 1 and 2 in X. laevis. The diagonal line represents a null model assuming symmetric evolution of L1 and L2. Black boxes are L1–L2 pairs inconsistent with this model at P < 0.01.
X. laevis paralogs show an enhanced rate of amino acid change relative to X. laevis-X. tropicalis orthologs.
| 0.0598(2) | 0.0973(5) | 0.615(4) | 0.118(1) | |
| 0.0563(3) | 0.0707(6) | 0.796(8) | 0.147(1) | |
| 94.1% | 72.7% | 129.4% | 124.6% | |
This is demonstrated using both the P-dist/4 DTv measure described in the text, and the conventional ratio of non-synonymous to synonymous substitution rates dN/dS. Both show a 25–30% enhancement in amino acid change. These results are derived from a 513 188 amino acid concatenated, gap-free, multiple sequence alignment (~250 aligned amino acid positions per gene) produced from the 2 135 triplets possessing at least a 50 amino acid aligned block.
Figure 3Normalized peptide to nucleotide evolutionary rates show an accelerated divergence of duplicated . The chart shows the ratio of peptide evolution (P-distance) to synonymous transversion rates (4 DTv), normalized by the human-mouse P-distance/4 DTV value of 0.242 ± 0.004, for three sets of multiple alignments corresponding to genes found in single copy in each of human, mouse, rat, and X. tropicalis, and two copies in X. laevis (sextuplets); pentuplets obtained by randomly selecting one X. laevis paralog from each sextuplet (5 A), and pentuplets in which only a single X. laevis sequence is known (5B).
Gene content in X. tropicalis genes within LLT-triplets compared to the reference set of all X. tropicalis genes with X. laevis orthologs.
| 123 | 46 | 29.11 | + | 0.06 | |
| 165 | 57 | 39.05 | + | 0.107 | |
| 90 | 10 | 21.3 | - | 0.146 | |
| 105 | 42 | 24.85 | + | 0.154 | |
| 24 | 14 | 5.68 | + | 0.44 | |
| 479 | 135 | 113.36 | + | 0.625 | |
| 309 | 57 | 73.13 | - | 0.787 | |
| 35 | 8 | 8.28 | - | ~1 | |
| 10 | 2 | 2.37 | - | ~1 | |
| 431 | 91 | 102.00 | - | ~1 |
Genes were categorized by assigning PANTHER classification terms to our X. tropicalis gene set by HMM scanning of the peptides and then grouping the terms into high-level categories of molecular function. We were able to assign categories to 6393 genes in the reference set and to 1513 genes in the subset. Out null hypothesis is that the second copy of genes following the X. laevis tetraploidization are lost in a random fashion. None of the 10 categories shown here show significant deviation from the null hypothesis, and the remaining molecular categories all have p-values close to 1. All p-values were Bonferroni-corrected for multiple tests.
Differential expression levels measured using the four largest X. laevis EST sets show that a significant fraction doublets show differential expression.
| EST library | N > = 16 | ||||
| ANE | 69917 | 9988 | 1092 | 130 | 53 |
| NIBBegast | 40476 | 5424 | 1199 | 40 | 20 |
| NICH_brain1 | 11005 | 1278 | 478 | 12 | 2 |
| XGC_Kid1 | 9662 | 1504 | 573 | 9 | 3 |
A total of 2070 matched pairs of antisense "probes" were computed as described in Methods, and applied to the EST data by in silico hybridization. Genes with 16 or more hits to ESTs were used to test the null hypothesis that expression levels are the same between paralogs. The four libraries are: ANE (anterior neuroectoderm) [36]; NIBBegast (early gastrulation; Kityama, A, Terasaka, C, Mochii, M, Ueno, N, Shin-i, T, unpublished results); NICH_brain1 (brain; NIH Mammalian collection, unpublished results); and XGC_kid1 (kidney; Heil, O, Neubert, P, Peters, M, Radelof, U, Schneider, D, Schroth, A, Korn, B, Landgrebe, J, unpublished results).
Figure 4Expression of specific . Panels depict the expression of skp1a (a-c), isocitrate dehydrogenase (isoD) (d-f), foxA1 (g-i), sorcin (j-l). X. laevis paralogs were arbitrarily assigned as a (a,d,g,j) or b (b,e,h,k) and are compared to the X. tropicalis ortholog (c, f, i, l). All views are lateral with anterior to the left. Embryos (a-f, j-l) are at stages 31 while embryos (g-i) are at stage 37–38. The arrowhead in (b) indicates kidney expression of skp1a in X. laevis paralog b that is not seen in the a paralog. Insets in (d) and (e) magnify somite expression revealing the differential expression between X. laevis paralogs ((d) with narrow expression, (e) with broad expression). The arrow in (g) highlights posterior expression of foxA1 seen in paralog a but absent in paralog b. The arrow in (k) indicates weak lateral expression of sorcin in X. laevis paralog b that is not seen in paralog a. X. tropicalis embryos are shown at a higher magnification than X. laevis embryos, reflecting their smaller size.
Figure 54DS distances identify genome duplication event. Histograms of the 4DS distances for the 9905 mutual highest scoring L-T pairs (blue line) as well as for the 3358 unambigous L-L pairs (red bars). The L-L pairs with 0.05 < 4DS < 0.25, peaking around 0.16, are selected as originating from the genome duplication event.
Description of triplets selected for in situ hybridizations
| Gene | Species paralog | Clone | Cut/transcribe |
| Skp1a | L1 | IMAGE:6946267 | |
| Skp1a | L2 | IMAGE:7202221 | |
| Skp1a | IMAGE:6995134 | ||
| Isocitrate dehydrogenase | L1 | IMAGE:3474748 | |
| Isocitrate dehydrogenase | L2 | IMAGE:5542876 | |
| Isocitrate dehydrogenase | IMAGE:6995129 | ||
| foxA1 | L1 | IMAGE:5572849 | |
| foxA1 | L2 | IMAGE:4203644 | |
| foxA1 | TGas068H09 | ||
| Sorcin | L1 | IMAGE:4957318 | |
| Sorcin | L2 | IMAGE:7204932 | |
| Sorcin | IMAGE:4461879 |