| Literature DB >> 18611272 |
Heesun Shin1, Martin Hirst, Matthew N Bainbridge, Vincent Magrini, Elaine Mardis, Donald G Moerman, Marco A Marra, David L Baillie, Steven J M Jones.
Abstract
BACKGROUND: We have applied a high-throughput pyrosequencing technology for transcriptome profiling of Caenorhabditis elegans in its first larval stage. Using this approach, we have generated a large amount of data for expressed sequence tags, which provides an opportunity for the discovery of putative novel transcripts and alternative splice variants that could be developmentally specific to the first larval stage. This work also demonstrates the successful and efficient application of a next generation sequencing methodology.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18611272 PMCID: PMC2474577 DOI: 10.1186/1741-7007-6-30
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Figure 1454 ESTs mapping to . (A) Histogram showing the distribution of 454 expressed sequence tags (ESTs) mapping to Caenorhabditis elegans transcripts. Coordinate 0 on the x-axis represents the 5'-end of the transcripts. (B) Summary of 454 EST mapping result to the C. elegans transcriptome and genome. (C) Categorization of genomic 454 EST hits.
Figure 2The intergenic region on chromosomes with unique 454 expressed sequence tags. (A) The intergenic region on chromosome III with 49 unique 454 expressed sequence tags (ESTs). (B) The intergenic region on chromosome I with the most number of unique 454 ESTs (99) in this analysis. (C) The intergenic region on chromosome II with 62 unique 454 ESTs. The 454 EST clusters in the middle of these intergenic regions with black vertical bars represent deep EST coverage, and conservation of these regions between Caenorhabditis elegans and C. briggsae is shown. These ESTs may represent a novel gene or extension of the neighboring gene. Note that the genomic regions shown are not to the same scale.
Figure 3Examples of intronic and gapped expressed sequence tags. (A) An example of intronic expressed sequence tags (ESTs) showing 454 ESTs mapped to the gene, K09E2.3, which is added to a recent WormBase release (WS180) within the intron of K09E2.2. There are also other ESTs recently added that confirm K09E2.3. (B) An example of a gapped-EST suggesting alternative splicing or correction of the current gene model. Note that the genomic regions shown are not to the same scale.
Summary of gapped-expressed sequence tag matches and putative novel structures
| 98 | 480 | I | 390514 to 390558 | 391038 to 391059 | Skipped exon |
| 96 | 489 | I | 5761023 to 5761058 | 5761547 to 5761592 | Novel end exon confirmed* |
| 98 | 101 | I | 7642453 to 7642487 | 7642588 to 7642631 | Novel end exon |
| 98 | 62 | I | 9724862 to 9724908 | 9724970 to 9725025 | Alternate exon-intron boundary |
| 98 | 157 | I | 11931105 to 11931163 | 11931320 to 11931345 | Novel internal exon |
| 92 | 45 | II | 2782924 to 2782962 | 2783007 to 2783086 | Novel end exon confirmed* |
| 98 | 224 | II | 10828328 to 10828404 | 10828628 to 10828667 | Confirmed intron* |
| 91 | 461 | II | 13635576 to 13635653 | 13636114 to 13636163 | Confirmed intron* |
| 98 | 548 | II | 14747166 to 14747225 | 14747773 to 14747821 | Novel end exon |
| 98 | 306 | III | 8552492 to 8552538 | 8552844 to 8552868 | Alternate exon-intron boundary |
| 92 | 277 | III | 11751557 to 11751606 | 11751883 to 11751921 | Novel end exons confirmed* |
| 92 | 1364 | III | 12639534 to 12639558 | 12640922 to 12640967 | Novel internal exon |
| 98 | 926 | IV | 9350535 to 9350565 | 9351491 to 9351529 | Confirmed intron* |
| 97 | 900 | V | 1981345 to 1981373 | 1982273 to 1982364 | Novel end exons |
| 98 | 50 | X | 11824129 to 11824180 | 11824230 to 11824259 | Novel transcript/novel end exons |
* A recent WormBase release (WS180) has since confirmed these exon-intron boundaries; the original mapping analysis was performed using WB160.
Figure 4Comparative analyses of 3' splice sites, GC contents, and cross-species sequence conservation. (A) The conservation of consensus 3'-end splice sites (TTTTCAG) of confirmed transcripts and transcripts with exon-intron boundary 454 expressed sequence tag hits. (B) A comparison of 454 expressed sequence tags and Caenorhabditis elegans whole genome for guanine-cytosine content of different genomic regions. (C) Average ClustalW alignment score comparisons for intergenic regions with or without 454 expressed sequence tags for different numbers of orthologous sequences. (D) Chart showing 3524 unique Caenorhabditis elegans intergenic 454 expressed sequence tags mapped to C. briggsae, C. remanei, and C. brenneri.
Most highly conserved 454 expressed sequence tags loci in intergenic regions
| 1 | I:12824792..12825228 | 436 | 10 | 6 |
| 2 | I:1841201..1842444 | 1243 | 107 | 13 |
| 3 | II:11162247..11171658 | 9411 | 4258 | 5 |
| 4 | II:15226621..15230280 | 3659 | 763 | 13 |
| 5 | II:5956644..5961678 | 5034 | 612 | 16 |
| 6 | II:9773798..9774904 | 1106 | 150 | 9 |
| 7 | III:11170439..11171970 | 1531 | 377 | 17 |
| 8 | III:12130215..12152775 | 22560 | 7576 | 5 |
| 9 | III:1733092..1743238 | 10146 | 4249 | 10 |
| 10 | III:1743309..1752172 | 8863 | 1819 | 6 |
| 11 | III:2768655..2770841 | 2186 | 687 | 6 |
| 12 | III:3664472..3667602 | 3130 | 79 | 19 |
| 13 | III:4098100..4121436 | 23336 | 1088 | 8 |
| 14 | III:8935324..8944721 | 9397 | 1136 | 18 |
| 15 | IV:9934182..9935039 | 857 | 229 | 25 |
| 16 | V:1075619..1081451 | 5832 | 939 | 18 |
| 17 | V:1361139..1373418 | 12279 | 403 | 5 |
| 18 | V:1980421..1982526 | 2105 | 6 | 11 |
| 19 | V:5932735..5933473 | 738 | 180 | 5 |
| 20 | V:6336709..6357288 | 20579 | 1448 | 6 |
| 21 | X:15012843..15026225 | 13382 | 4845 | 8 |
| 22 | X:15616440..15620416 | 3976 | 928 | 7 |
| 23 | X:16927937..16944791 | 16854 | 1211 | 22 |
| 24 | X:837757..838677 | 920 | 315 | 6 |
Figure 5Examples of 454 ESTs mapped to known or predicted ncRNAs. (A), (B) Representative 454 expressed sequence tag data, which identify known non-coding RNAs. (C) The most conserved cross-species intronic 454 expressed sequence tags hit mapping to a RNAz non-coding RNA prediction. Note that the genomic regions shown are not to the same scale.
Non-coding RNA genes identified by 454 expressed sequence tags (incomplete)
| miRNA | Y105E8A.31 | I | Predicted | 3'-UTR of Y105E8A.16 |
| miRNA | F08F3.11 | V | Predicted | Intergenic |
| Non-coding RNA | F54D7.7 | I | Predicted | Overlap with 3'-UTR of F54D7.4 |
| 21URNA | C46G7.7 | IV | RNAs | Intergenic |
| 21URNA | C08F11.38 | IV | RNAs | Intron of C08F11.13 |
| 21URNA | T23G4.18 | IV | RNAs | Intergenic |
| 21URNA | T23G4.24 | IV | RNAs | Intergenic |
| 21URNA | F55B11.13 | IV | RNAs | Intergenic |
| 21URNA | Y105C5A.159 | IV | RNAs | Intergenic |
| 21URNA | Y51H4A.106 | IV | RNAs | Intergenic |
| Non-coding RNA | F09E10.10 | X | Expressed sequence tags | Intergenic |
| Non-coding RNA | C30E1.9 | X | Expressed sequence tags | Intergenic |
| RNA pseudogene | D1005.t1 | X | Predicted | Intergenic |
| RNA pseudogene | ZK380.t2 | X | Expressed sequence tags | Intergenic |
| rRNA | F31C3.11 | I | Expressed sequence tags | Intergenic |
| rRNA | F31C3.9 | I | Expressed sequence tags | Intergenic |
| snoRNA | R12E2.17 | I | RNAs | Intron of R12E2.3 |
| snoRNA | F25H5.9 | I | RNAs | Intron of F25H5.3 |
| snoRNA | T10B9.11 | II | RNAs | Intergenic |
| snoRNA | M106.6 | II | RNAs | Intron of M106.1 |
| snoRNA | H06I04.9 | III | Predicted | Intron of H06I04.4 |
| snoRNA | ZK643.9 | III | RNAs | Intergenic |
| snoRNA | Y43B11AR.7 | IV | Expressed sequence tags | Intron of Y43B11AR.4 |
| snoRNA | F17C11.14 | V | RNAs | Intron of F17C11.9 |
| snoRNA | K09E9.5 | X | mRNA | Intergenic |
Intergenic 454 expressed sequence tags overlapping with RNAz non-coding RNA predictions
| 062385_1158_0389 | I:9863279..9863316 | I:9863183..9863302 | I:9862287..9863587 |
| 004771_1566_3251 | II:15165861..15165910 | II:15165800..15165950 | II:15165619..15166104 |
| 074784_3939_2069 | III:11474420..11474530 | III:11474520..11474625 | III:11474411..11479792 |
| 313519_0405_2992 | IV:3168808..3168913 | IV:3168772..3168891 | IV:3155983..3169942 |
| 320686_1504_2153 | V:5440646..5440778 | V:5440615..5440736 | V:5438973..5446748 |
| 322033_3470_0718 | V:11801636..11801764 | V:11801572..11801690 | V:11800350..11803726 |
| 240717_0095_3913 | V:12296851..12296944 | V:12296911..12297022 | V:12293608..12302290 |
| 093179_3480_2283 | X:17010136..17010216 | X:17010094..17010200 | X:17008275..17013376 |
| 104650_3438_2367 | X:6961371..6961474 | X:6961430..6961548 | X:6956349..6967668 |
Intronic 454 expressed sequence tags overlapping with RNAz non-coding RNA predictions
| III:4689667..4689728 | III:4689566..4689715 | III:4689399..4690021 | |
| III:7160222..7160329 | III:7160256..7160375 | III:7159674..7160467 | |
| V:18041446..18041591 | V:18041383..18041481 | V:18041071..18041867 | |
| V:18041446..18041593 | V:18041501..18041624 | V:18041071..18041867 | |
| V:6881222..6881352 | V:6881154..6881264 | V:6881209..6881399 | |
| X:9091394..9091461 | X:9091460..9091597 | X:9090452..9092219 |