| Literature DB >> 23496936 |
Andreas Petzold1, Kathrin Reichwald, Marco Groth, Stefan Taudien, Nils Hartmann, Steffen Priebe, Dmitry Shagin, Christoph Englert, Matthias Platzer.
Abstract
BACKGROUND: The African annual fish Nothobranchius furzeri has over recent years been established as a model species for ageing-related studies. This is mainly based on its exceptionally short lifespan and the presence of typical characteristics of vertebrate ageing. To substantiate its role as an alternative vertebrate ageing model, a transcript catalogue is needed, which can serve e.g. as basis for identifying ageing-related genes.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23496936 PMCID: PMC3605293 DOI: 10.1186/1471-2164-14-185
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
transcriptome libraries and sequencing methods
| 1 | one male GRZ, 9 weeks, whole body; normaliseda | Sanger | 131,808 | 734 | 97 |
| 2 | one male GRZ, 10 weeks, whole body; normaliseda | 454/Roche GS-FLX | 751,885 | 223 | 168 |
| 3 | one male GRZ, 10 weeks, whole body | 454/Roche GS-FLX | 415,426 | 225 | 94 |
| 4 | six male and female GRZ, 8 weeks, whole body; normalisedb | 454/Roche GS-Titanium | 2,351,236 | 369 | 869 |
| 5 | three GRZ, 1 week, whole body | Solexa/Illumina, 2 x 150 bp | 89,730,910 | 151 | 6,774 |
| 6 | five male GRZ, 5 weeks, skin | Solexa/Illumina, 2 x 101 bp | 55,410,042 | 101 | 5,596 |
| | | Solexa/Illumina, 76 bp | 33,947,044 | 76 | 2,579 |
| 7 | five male GRZ, 5 weeks, brain | Solexa/Illumina, 2 x 101 bp | 60,997,820 | 101 | 6,161 |
| | | Solexa/Illumina, 76 bp | 36,117,265 | 76 | 2,744 |
| 8 | three male GRZ, 14 weeks, skin | Solexa/Illumina, 76 bp | 35,963,921 | 76 | 2,733 |
| 9 | three male GRZ, 14 weeks, brain | Solexa/Illumina, 76 bp | 27,369,310 | 76 | 2,080 |
| 10 | four male MZM-0403, 5 weeks, skin | Solexa/Illumina, 76 bp | 31,727,935 | 76 | 2,411 |
| 11 | five male MZM-0403, 5 weeks, brain | Solexa/Illumina, 76 bp | 34,720,289 | 76 | 2,638 |
| 12 | five male MZM-0403, 31 weeks, skin | Solexa/Illumina, 76 bp | 30,268,649 | 76 | 2,300 |
| 13 | three male MZM-0403, 31 weeks, brain | Solexa/Illumina, 76 bp | 31,423,327 | 76 | 2,388 |
| 467,676,512 | 39,632 |
aNormalisation was done by Evrogen.
bNormalisation was done by Vertis.
Assembly metrics
| | | | ||
|---|---|---|---|---|
| Sanger, 454/Roche | Sanger, 454/Roche | Sanger, 454/Roche | PAVE contigs, Solexa/Illumina | |
| 134,225 | 141,973 | 118,795 | 213,621 | |
| 85.1 Mb | 78.1 Mb | 86.9 Mb | 252.9 Mb | |
| 634 bp | 549 bp | 731 bp | 1,183 bp | |
| 474 bp | 434 bp | 495 bp | 676 bp | |
| 9,221 bp | 7,841 bp | 9,241 bp | 64,116 bp | |
| 15,516 | 11,756 | 23,534 | 79,035 | |
| 24.9 Mb | 19.1 Mb | 38.3 Mb | 183.0 Mb |
a ‘First assembly’ shows the results of three assembly programs that were tested using Sanger and 454/Roche data. PAVE produced the best assembly with respect to total length, number of contigs and average contig length. b ‘Second assembly’: Solexa/Illumina data were assembled onto a backbone of PAVE contigs using an iterative assembly procedure outlined in Methods.
Figure 1Transcript contig length distribution. On the x-axis, contigs sorted according to their length in bins of 200 bp are shown. Black and grey bars on the logarithmic y-axis indicate the fractions of annotated (black) and unannotated (grey) transcript contigs, respectively. The majority of all contigs larger than 1,000 bp is annotated.
BLAST based annotation of transcript contigs
| Ensembl fish proteins | 98,941 | 38,355 | 73% | 37,739 (81%) | 92,857 (85%) |
| UniProt | 98,053 | 42,361 | 71% | 2,643 (6%) | 3,425 (4%) |
| NCBI nr proteins | 96,548 | 41,500 | 71% | 956 (2%) | 1,450 (1%) |
| Refseq human proteins | 80,671 | 18,071 | 61% | 305 (1%) | 497 (1%) |
| NCBI UniGene transcripts | 107,901 | 36,593 | 77% | 3,206 (7%) | 7,113 (6%) |
| Ensembl fish transcripts | 98,099 | 39,301 | 77% | 1,369 (3%) | 3,690 (3%) |
| Total | 122,177 | 46,218 (100%) | 109,032 (100%) |
Figure 2Fractions of putative CDS represented in the longest transcript contig per gene. Predicted CDS fractions were binned into deciles. The histogram bars show the number of respective transcript contigs (genes) per decile. The line shows the cumulative number. More than 70% of the protein-coding genes are represented by a transcript contig with a complete (> 90%) CDS.
BLASTx comparison of transcript contigs to the protein/gene annotation of four other fish genomes
| 24,661 / 19,686 | 27,576 / 20,787 | 23,118 / 19,602 | 41,478 / 26,095 | |
| 87,378 | 84,761 | 79,594 | 87,219 | |
| 19,272 / 17,173 | 20,534 / 17,795 | 18,613 / 16,744 | 22,669 / 18,620 | |
| 78% / 87% | 74% / 86% | 81% / 85% | 55% / 71% | |
| 73.5% | 74.6% | 73.7% | 67.5% | |
| 2,791 | 1,256 | 531 | 2,791 | |
| 914 / 914 | 690 / 690 | 356 / 356 | 942 / 942 | |
| 55.7% | 57.0% | 62.0% | 49.5% |
Summary of RNA-seq analysis
| | ||||||||
|---|---|---|---|---|---|---|---|---|
| Age (weeks) | 5 | 14 | 5 | 31 | ||||
| Organ | Skin | Brain | Skin | Brain | Skin | Brain | Skin | Brain |
| Reads | 33,947,044 | 36,117,265 | 35,963,921 | 27,369,310 | 31,727,935 | 34,720,289 | 30,268,649 | 31,423,327 |
| Mapped / Unique (%) | 58 / 46 | 42 / 37 | 53 / 43 | 41 / 36 | 65 / 52 | 54 / 48 | 64 / 53 | 52 / 46 |
| Transcribed genes | 17,350 | 17,638 | 17,456 | 17,535 | 17,170 | 17,590 | 17,375 | 17,644 |
| Median transcript level (rpkm a) | 2.3 | 4.7 | 2.6 | 5.7 | 2.5 | 5.4 | 3.5 | 5.8 |
| Range of transcript levels (rpkm) | 0.005 – 13,043 | 0.006 – 40,801 | 0.005 – 39,533 | 0.01 – 5,587 | 0.006 – 10,409 | 0.003 – 5,288 | 0.005 – 3,878 | 0.009 – 3,946 |
a Reads per Kilobase per Million mapped reads.
Figure 3Cluster heat map of transcript-level data. Correlation analysis of eight N. furzeri RNA-seq datasets, which were derived from skin and brain of young and old specimens of the short-lived strain GRZ and the longer-lived strain MZM-0403. Young age represents 5 weeks in both strains, whereas old age is reached at 14 weeks in GRZ and 31 weeks in MZM-0403. Analysis is based on Spearman correlation coefficients of log-transformed RPKM transcript levels. Tissues show the highest correlation, irrespective of age and strain.
Figure 4Principal Component Analysis of transcript levels. PCA results for all N. furzeri RNA-seq datasets are shown as biplots of the first vs. the second (A) and first vs. third component (B). The first component preferentially reflected the variation introduced by age whereas second and third components represented differences in strain and tissue, respectively.
Figure 5Validation of selected DEGs by qRT-PCR. Three up- and downregulated genes detected by RNA-seq (blue) were selected from skin (A) and brain (B), respectively, and qRT-PCR (red) was performed on the same RNA-samples. Except for VDRB, fold changes (log-transformed) did show good agreement between RNA-seq and qRT-PCR.
Figure 6Fold change differences between DEGs of the short- and the longer-lived strains. A scatterplot of fold changes (log-transformed) in the short-lived strain GRZ and the longer-lived strain MZM403 is shown. The 43 DEGs in skin are given in blue, and the 43 DEGs in brain in red. In (A), fold changes are shown as determined over the time span of 9 and 26 weeks for GRZ and MZM-0403, respectively. In (B), MZM-0403 fold changes were normalised to the GRZ time span of 9 weeks. P-values are calculated to test for differences in fold changes between GRZ and MZM-0403 using two-sample, paired Wilcoxon test. In contrast to standard fold changes (A), time-normalised fold changes (B) were significantly different between GRZ and MZM-0403 in skin and brain.