| Literature DB >> 17506875 |
Ana C Fierro1, Raphaël Thuret, Laurent Coen, Muriel Perron, Barbara A Demeneix, Maurice Wegnez, Gabor Gyapay, Jean Weissenbach, Patrick Wincker, André Mazabraud, Nicolas Pollet.
Abstract
BACKGROUND: The western African clawed frog Xenopus tropicalis is an anuran amphibian species now used as model in vertebrate comparative genomics. It provides the same advantages as Xenopus laevis but is diploid and has a smaller genome of 1.7 Gbp. Therefore X. tropicalis is more amenable to systematic transcriptome surveys. We initiated a large-scale partial cDNA sequencing project to provide a functional genomics resource on genes expressed in the nervous system during early embryogenesis and metamorphosis in X. tropicalis.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17506875 PMCID: PMC1890556 DOI: 10.1186/1471-2164-8-118
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Xenopus tropicalis EST project statistics
| xthr | xtbs | xthr and xtbs | |
| Number of sequences reads obtained | 30272 | 27602 | 57874 |
| Number of clone sequences obtained | 16548 | 14901 | 31449 |
| Number of valid sequences | 26440 | 22345 | 48785 |
| Number of clones with valid sequences | 15540 | 12266 | 27806 |
| Number of clones with 5' and 3' EST | 9354 | 11486 | 20840 |
| Number of clones with 5' EST only | 4831 | 0 | 4831 |
| Number of clones with 3' EST only | 1355 | 780 | 2135 |
| Average trimmed EST length | 522 | 546 | 534 |
| Number of contigs | 4327 | 4002 | 8756 |
| Number of contigs groups | 497 | 289 | 842 |
| Number of contigs grouped | 1210 | 649 | 2209 |
| Number of unique contigs | 3117 | 3353 | 6547 |
| Nulmber of clones in contigs | 9616 | 9268 | 15642 |
| Number of singletons | 7958 | 4262 | 17018 |
| Number of clones in singletons | 5924 | 2998 | 12164 |
| Number of putative transcripts | 9538 | 6640 | 19553 |
| Max. assembled sequence length | 3028 | 3144 | 3028 |
| Average assembled sequence length | 732 | 782 | 745 |
| Max. assembled sequence size | 147 | 144 | 159 |
| Average assembled sequence size | 6 | 5 | 5 |
| Number of contigs containing | |||
| 1 | 509 | 97 | 1152 |
| 2 | 1779 | 2155 | 3730 |
| 3 | 452 | 362 | 905 |
| 4–5 | 587 | 545 | 1159 |
| 6–10 | 505 | 476 | 952 |
| 11–20 | 283 | 207 | 479 |
| 21–30 | 91 | 90 | 174 |
| 31–50 | 80 | 41 | 118 |
| 50–100 | 31 | 26 | 76 |
| >100 | 10 | 3 | 11 |
Figure 1Assessment of normalization effectiveness. Histogram showing the percentage of sequence matching each oligonucleotide used in the procedure of normalization by hybridization. Bars represent percentages of positive clones calculated before (grey bars) and after normalization (black bars). Data before normalization were obtained after partial sequencing of 1,989 cDNAs from xthr and 1,694 cDNAs from xtbs. Note the relatively high abundance of cell-type specific transcript such as gamma crystallin (crystallinG1) or neurogranin (underlined on the figure).
Figure 2Added-value of xthr and xtbs 5'ESTs. 5' cDNA sequences were compared to 4,908 complete X. tropicalis cDNA sequences from XGC and Sanger Institute. When an EST matched unambiguously (>95% id over more than 50 nt on the same orientation) one of these cDNAs, the position of its first residue (X axis) was plotted as a function of the cDNA size (Y axis). Each dot represents the result of an alignment. A position of 0 on the x axis indicates identical 5' ends between the EST and cDNA. Negative values indicate that the EST extends further 5', positive values superior to the cDNA length indicate that the EST extends further 3', and positive values inferior to the cDNA size indicate the 5' EST position relative to the cDNA.
Results of sequence comparisons
| All hits | 4426 | 51% | 5506 | 63% | 5447 | 62% | 7175 | 82% | 7865 | 90% | 8703 | 99% | 3877 | 44% | All hits |
| >= 90% | 1594 | 18% | 2753 | 31% | 744 | 8% | 864 | 10% | 7199 | 82% | 8371 | 96% | 3660 | 42% | >95% |
| 70 – 90% | 1705 | 19% | 1917 | 22% | 2963 | 34% | 3843 | 44% | 454 | 5% | 302 | 3% | 217 | 2% | 90 – 95% |
| < 70% | 1127 | 13% | 836 | 10% | 1740 | 20% | 2468 | 28% | 212 | 2% | 30 | 0% | 0 | 0% | < 90% |
| No similarity | 4330 | 49% | 3250 | 37% | 3309 | 38% | 1581 | 18% | 891 | 10% | 53 | 1% | 4879 | 56% | No similarity |
GO Molecular function classification.
| N | |
| All molecular function terms | 1775 |
| ->antioxidant activity | 14 |
| ->binding | 623 |
| -->calcium ion binding | 89 |
| -->carbohydrate binding | 7 |
| -->lipid binding | 10 |
| -->nucleic acid binding | 350 |
| --->DNA binding | 135 |
| ---->chromatin binding | 0 |
| ---->transcription factor activity | 31 |
| --->RNA binding | 40 |
| --->translation factor activity, nucleic acid binding | 58 |
| -->nucleotide binding | 51 |
| -->oxygen binding | 0 |
| -->protein binding | 52 |
| --->cytoskeletal protein binding | 25 |
| ---->actin binding | 23 |
| -->receptor binding | 28 |
| ->catalytic activity | 455 |
| -->hydrolase activity | 110 |
| --->nuclease activity | 9 |
| --->peptidase activity | 66 |
| --->phosphoprotein phosphatase activity | 6 |
| -->kinase activity | 56 |
| --->protein kinase activity | 33 |
| -->transferase activity | 112 |
| ->chaperone regulator activity | 0 |
| ->enzyme regulator activity | 37 |
| ->motor activity | 15 |
| ->nutrient reservoir activity | 0 |
| ->signal transducer activity | 48 |
| -->receptor activity | 13 |
| -->receptor binding | 28 |
| ->structural molecule activity | 398 |
| ->transcription regulator activity | 40 |
| ->translation regulator activity | 58 |
| ->transporter activity | 120 |
| -->electron transporter activity | 39 |
| -->ion channel activity | 11 |
| -->neurotransmitter transporter activity | 3 |
| ->triplet codon-amino acid adaptor activity | 0 |
Figure 3Digital expression profiles of . Each line gives the expression profile of a given transcript represented by a UniGene cluster. The expression is deduced from counting the occurence of ESTs derived from a given cDNA library. The level of expression is colour coded in blue shades, dark blue means evidence for high levels of transcripts and white means no evidence for expression. On the left, clusters of expression profiles are delineated by a vertical bar labelled with the associated characteristic domain of expression. On the right, the cluster name and its annotation (i.e. the corresponding gene product description as deduced from sequence similarity analysis) are given. Each column corresponds to a category of tissue or stage of development: 8 adult tissues and 6 stages of development. Note that a given category may correspond to several cDNA libraries. Here, only clusters for which evidence of differential expression were used to build the matrix of expression. This matrix was analysed by hierarchical clustering on the expression profile dimension using CLUSTER 3.0 as described in the methods section.
Highly expressed metamorphic transcripts.
| Xl.24656_a | 710 | 1405 | b;h;l;li;ot;sk;t;wb | R | ubiquitous | tail | |
| Xl.1115__b | 248 | 653 | et;h;he;l;ot;sk;t;wb | R | tail, limb | tail | |
| Xl.17432 | 560 | 4290 | b;et;ey;fb;h;he;k;l;li;lu;ot;ov;s;sk;t;te;th;wb | U | ubiquitous | - | eukaryotic translation factor 1 alpha, somatic form |
| Xl.5860 | 397 | 1008 | b;he;l;ot;s;sk;t;te;wb | U | ubiquitous | - | creatine kinase, muscle |
| Xl.11405 | 280 | 751 | b;et;l;t;wb | R | tail, limb, tailbud | tail | Calcium ATPase at 60A, cardiac muscle |
| Xl.24815 | 238 | 510 | b;l;t;th;wb | R | limb, tadpole | limb | myosin, heavy polypeptide 13, skeletal muscle |
| Xl.47042 | 227 | 468 | b;et;ey;fb;he;k;l;li;s;sk;t;th;wb | R | limb | metamorphosis | collagen, type I, alpha 1 |
| Xl.25492 | 217 | 833 | l;ot;sk;wb | R | limb, tadpole | limb metamorphosis | oncomodulin |
| Xl.7551 | 211 | 1333 | b;et;ey;fb;h;he;k;l;li;lu;ot;ov;s;sk;t;te;th;wb | U | ubiquitous | - | eukaryotic translation elongation factor 2 |
| Xl.28832 | 205 | 474 | et;h;h;l;t;wb | R | tail, limb | tail metamorphosis | actin alpha skeletal muscle |
| Xl.4138__a | 198 | 1376 | b;et;ey;fb;h;k;l;li;lu;ot;ov;s;sk;t;te;th;wb | U | ubiquitous | - | |
| Xl.29221_b | 98 | 644 | b;et;ey;fb;he;k;l;li;lu;ot;ov;s;sk;t;te;th;wb | U | thymus | - | |
| Xl.5146 | 168 | 360 | b;et;ey;fb;he;k;l;li;s;t;th;wb | R | limb | limb metamorphosis | freeze tolerance-associated protein FR47 |
| Xl.8842 | 164 | 1033 | b;et;ey;fb;h;he;k;l;li;lu;ot;ov;s;sk;t;te;th;wb | N | limb, thymus, spleen_PHA | - | ribosomal protein L3 |
| Xl.1032 | 158 | 243 | b;l;ot;sk;t;wb | R | tadpole | limb metamorphosis | fast skeletal troponin C beta |
| Xl.1055 | 151 | 288 | b;et;h;l;ot;sk;t;th;wb | R | limb, tailbud, tadpole | limb metamorphosis | myosin light chain 1, fast skeletal muscle isoform |
| Xl.24699 | 134 | 426 | b;ey;he;l;lu;t;th;wb | R | limb, heart | heart metamorphosis | actin alpha cardiac |
| Xl.7875 | 120 | 975 | b;et;ey;fb;h;he;k;l;lu;ot;ov;s;sk;t;te;th;wb | D | ubiquitous | - | solute carrier family 25 (mitochondrial carrier; adenine nucleotide translocator), member 5 |
| Xl.1728 | 120 | 884 | b;et;ey;fb;h;he;k;l;li;lu;ot;ov;s;sk;t;te;th;wb | N | limb, tadpole, olfactory epith. | - | acidic ribosomal protein P0 |
| Xl.8672 | 116 | 756 | b;et;ey;fb;h;he;k;l;li;lu;ot;ov;s;sk;t;te;th;wb | N | limb | - | ribosomal protein L4 |
| Xl.49126 | 115 | 272 | b;ey;fb;l;lu;ot;s;sk;t;te;wb | D | ?ubiquitous | limb metamorphosis | procollagen, type I, alpha 2 |
| Xl.995 | 113 | 440 | b;et;ey;fb;h;he;k;l;lu;ot;ov;s;sk;t;te;th;wb | N | limb | - | glyceraldehyde-3-phosphate dehydrogenase |
| Xl.3463 | 109 | 554 | b;et;ey;fb;h;he;k;l;li;lu;ot;ov;s;sk;t;te;th;wb | R | limb | metamorphosis | guanine nucleotide binding protein, beta 2, related sequence 1 |
| Xl.118 | 102 | 274 | b;et;fb;h;k;l;lu;ot;s;sk;t;wb | R | limb | metamorphosis | tropomyosin 1 alpha chain |
| Xl.1464 | 100 | 537 | fb;l;li;ot;s;sk;t;wb | R | tail, tadpole | tail | myosin, heavy polypeptide 4, skeletal muscle |
| Xl.395 | 94 | 479 | li;lu;th;wb | R | liver | liver | serum albumin 74 kDa |
| Xl.938 | 94 | 221 | b;ey;fb;l;ot;sk;t;te;wb | R | limb | limb metamorphosis | tropomyosin beta chain, skeletal muscle |
| Xl.23399 | 94 | 156 | b;l;ot;sk;t;wb | D | ?ubiquitous | tail metamorphosis | aspartyl beta-hydroxylase |
b: brain;et: embryonic tissue;ey: eye;fb: fat body;h: head;he: heart;k: kidney;l: limb;li: liver;lu: lung;ot: other;ov: ovary;s: spleen;sk:skin; t: tail;te: testis;th: thymus;wb: whole body. Allogenes are written in italics. U: ubiquitous; D: no differential; N: no restricted expression; R: restricted expression.
Figure 4Digital expression profiles of . Using the same representation as in Fig. 3 three clusters are depicted that are associated with differential expression at metamorphosis (top cluster), tadpole stage (middle) and in the forming limb (down).