| Literature DB >> 20700453 |
Hideo Matsumura1, Kentaro Yoshida, Shujun Luo, Eiji Kimura, Takahiro Fujibe, Zayed Albertyn, Roberto A Barrero, Detlev H Krüger, Günter Kahl, Gary P Schroth, Ryohei Terauchi.
Abstract
We established a protocol of the SuperSAGE technology combined with next-generation sequencing, coined "High-Throughput (HT-) SuperSAGE". SuperSAGE is a method of digital gene expression profiling that allows isolation of 26-bp tag fragments from expressed transcripts. In the present protocol, index (barcode) sequences are employed to discriminate tags from different samples. Such barcodes allow researchers to analyze digital tags from transcriptomes of many samples in a single sequencing run by simply pooling the libraries. Here, we demonstrated that HT-SuperSAGE provided highly sensitive, reproducible and accurate digital gene expression data. By increasing throughput for analysis in HT-SuperSAGE, various applications are foreseen and several examples are provided in the present study, including analyses of laser-microdissected cells, biological replicates and tag extraction using different anchoring enzymes.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20700453 PMCID: PMC2917361 DOI: 10.1371/journal.pone.0012010
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Scheme of high-throughput SuperSAGE.
Details of the experimental procedure are described in Results and Material and Methods.
Summary of all the analyzed samples.
| Sample code | Sample name | Index seq | Number of total tags | Number of unique tags | Number of non-singleton tags |
| a | rice leaf(3-cyclePCR) |
| 353,524 | 51,314 | 18,956 |
| b | rice leaf (5-cycle PCR) |
| 517,891 | 69,055 | 24,706 |
| c | rice leaf (10-cycle PCR) |
| 295,439 | 45,846 | 16,790 |
| d | 5 PCR rice seedling-1 |
| 367,798 | 74,506 | 22,902 |
| e | 5 PCR rice seedling-2 |
| 483,836 | 78,205 | 26,379 |
| f | 5 PCR rice seedling-3 |
| 388,658 | 71,455 | 23,549 |
| g |
|
| 729,542 | 91,311 | 31,954 |
| h |
|
| 383,022 | 69,448 | 21,736 |
| i | rice pollen cells |
| 348,370 | 67,317 | 24,002 |
| j | rice anther wall tissue |
| 301,118 | 69,498 | 22,162 |
| k | rice mutant seedling (l |
| 537,192 | 92,133 | 29,377 |
| l |
|
| 420,326 | 84,014 | 27,586 |
| m | CM552 seedling (allele of |
| 311,581 | 63,781 | 21,120 |
| n | SG0807 seedling (allele of |
| 321,433 | 67,868 | 21,817 |
| o | rice germinating seed (c.v.Dunghan shali at low temp) |
| 489,818 | 91,408 | 28,873 |
| p | rice germinating seed (c.v. Kakehashi at low temp) |
| 92,410 | 29,779 | 7,214 |
| q | rice germinating seed (c.v.Dunghan shali ,submerged) |
| 394,443 | 65,955 | 22,510 |
| r | rice germinating seed (c.v. Kakehashi ,submerged) |
| 784,859 | 114,234 | 36,423 |
| u | zebrafish embryo 10.5h after fertilization |
| 484,471 | 85,922 | 26,788 |
| v | zebrafish embryo 12h after fertilization |
| 665,730 | 106,953 | 30,791 |
| w | zebrafish embryo 13.5h after fertilization |
| 582,844 | 113,931 | 29,656 |
| x | zebrafish embryo 15h after fertilization |
| 620,402 | 105,314 | 30,772 |
| y | zebrafish embryo 16.5h after fertilization |
| 466,332 | 82,625 | 24,741 |
| z | wild type rice (cv. Sasanishiki) leaf |
| 530,176 | 71,640 | 23,806 |
| ex1 |
|
| 456,718 | 71,591 | 25,389 |
| s1 | Arabidopsis leaves (NlaIII; CATG) |
| 399,106 | 67,949 | 23,903 |
| s2 | Arabidopsis leaves (DpnII; GATC) |
| 260,326 | 51,671 | 18,669 |
| s3 | Arabidopsis leaves (BfaI;CTAG) |
| 233,475 | 43,168 | 16,069 |
| t1 | Arabidopsis stems (NlaIII; CATG) |
| 303,549 | 63,738 | 23,318 |
| t2 | Arabidopsis stems (DpnII; GATC) |
| 351,677 | 66,435 | 25,207 |
| t3 | Arabidopsis stems (BfaI;CTAG) |
| 239,537 | 45,630 | 18,064 |
| Total | 13,115,603 |
Figure 2Influence of PCR cycle numbers on tag abundance.
Tags from mature rice leaf samples were ligated to adapter-2 sequences, separated into three tubes, and then three differently indexed adapters-1 (a, b and c, respectively) were ligated. The adapter-1 ligated fragments were PCR amplified for three (sample a), five (sample b) and ten cycles (sample c), and subsequently directly sequenced. After sequencing, tag abundance data was obtained for each sample. Individual tag counts are plotted for sample a versus b (A), and sample a versus c (B). Correlation coefficient in each plot is shown as inset (R2), and regression line is indicated as curved line due to the plot on logarithmic scale.
Figure 3Comparison of tag abundance between HT-SuperSAGE and original SuperSAGE.
Synthesized cDNAs from rice seedling RNA (sample f) and M. grisea-infected rice leaf sheath RNA (sample l) were divided into half and both applied to HT-SuperSAGE and original SuperSAGE, respectively, using 454-pyrosequencing. Obtained counts of individual tags from the two methods are plotted (panel A for sample f, and panel B for sample l). Correlation coefficient in each plot is shown as inset (R2), and regression line is indicated as curved line due to the plot on logarithmic scale.
Missing transcripts from abundantly expressed Arabidopsis genes studied by three anchoring enzymes (NlaIII, DpnII, BfaI) in leaf and stem tissues.
| Number of missing tags | |||
| NlaIII | DpnII | BfaI | |
| Leaf (sample s) | 64 | 81 | 188 |
| Stem (sample t) | 76 | 59 | 205 |
*Missing transcripts in 1000 of the most abundantly expressed genes in each Arabidopsis leaf or stem tissue samples (see text).