| Literature DB >> 31440537 |
Allison Piovesan1, Francesca Antonaros1, Pierluigi Strippoli1, Lorenza Vitale1, Maria Chiara Pelleri1, Maria Caracausi1.
Abstract
Caenorhabditis elegans is a nematode widely used in biology and genomics as a model organism. We provide an integrated, quantitative reference map for the transcriptome of whole, wild type Bristol N2 strain C. elegans worms. The map has been obtained by meta-analysis of 110 gene expression profiles available in Gene Expression Omnibus (GEO) repository and integrated using the computational biology tool Transcriptome Mapper (TRAM). Following probe assignment to the relative locus and intra- and inter-sample normalization (in particular using the scaled quantile method), a mean, consensus reference value is provided for 45,932 transcripts, along with standard deviation. Expression values are all mapped in the context of genomic coordinates. The map provides easy access to relationships among expression values of different genes in this standard condition, highlights genomic segments with relatively high over-/under-expression and may serve as a reference to test for gene expression variation for both individual genes and the whole transcriptome in specific biological conditions (e.g. mutated strains or differently grown worms).Entities:
Keywords: Adult worms; C. elegans; Gene expression; Meta-analysis; Transcriptome map
Year: 2019 PMID: 31440537 PMCID: PMC6700341 DOI: 10.1016/j.dib.2019.104152
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
The genomic segments significantly over-/under-expressed in the C. elegans transcriptome map. Over-expressed genes are in bold, under-expressed genes are with an asterisk and in bold. "+" or "-" signs indicate a value above or below the genome median, respectively. In order to simplify, segments with over-/under-expressed gene content fully included in a segment listed here are not shown.
| # | Chromosome | Segment Start | Segment End | Expression Value | q-value | Genes in the segment |
|---|---|---|---|---|---|---|
| 1 | chrIV | 11,330,001 | 11,350,000 | 2308.35 | 0.00003906 | F54E12.2+ |
| 2 | chrIII | 10,970,001 | 10,990,000 | 2,113.98 | 0.00002970 | dhc-4- |
| 3 | chrV | 11,070,001 | 11,090,000 | 1,762.53 | 0.00002970 | |
| 4 | chrIV | 11,390,001 | 11,410,000 | 1,588.97 | 0.00010384 | tag-89- dsl-6- |
| 5 | chrIV | 7,470,001 | 7,490,000 | 1,482.27 | 0.00000041 | |
| 6 | chrIV | 11,320,001 | 11,340,000 | 1,223.15 | 0.00002952 | B0035.18 + B0035.6+ his-47+ |
| 7 | chrI | 2,060,001 | 2,080,000 | 1,035.73 | 0.00014806 | Y37E3.1+ rpb-10+ moag-4+ arl-13- |
| 8 | chrV | 8,880,001 | 8,900,000 | 1,029.50 | 0.00000060 | K06C4.1- |
| 9 | chrMT | 1 | 20,000 | 975.75 | 0.00000001 | |
| 10 | chrIII | 7,170,001 | 7,190,000 | 917.83 | 0.00043732 | acs-4+ srb-8- srb-7- |
| 11 | chrI | 10,550,001 | 10,570,000 | 899.64 | 0.00033432 | F25H2.4+ |
| 12 | chrX | 7,300,001 | 7,320,000 | 772.99 | 0.00019922 | sur-5+ |
| 13 | chrII | 13,810,001 | 13,830,000 | 755.25 | 0.00019922 | ZK131.11+ |
| 14 | chrV | 2,310,001 | 2,330,000 | 722.00 | 0.00005651 | Y19D10B.1- Y19D10B.6- |
| 15 | chrII | 8,560,001 | 8,580,000 | 634.59 | 0.00025710 | |
| 16 | chrIV | 5,050,001 | 5,070,000 | 618.41 | 0.00014806 | |
| 17 | chrIV | 8,330,001 | 8,350,000 | 611.61 | 0.00002612 | his-29+ |
| 18 | chrV | 8,530,001 | 8,550,000 | 604.67 | 0.00025710 | otpl-5- |
| 19 | chrIII | 9,740,001 | 9,760,000 | 528.29 | 0.00001188 | T05G5.1+ |
| 1 | chrIV | 5,870,001 | 5,890,000 | 5.03 | 0.00045937 | |
| 2 | chrI | 12,380,001 | 12,400,000 | 4.89 | 0.00338533 | gly-16- |
| 3 | chrV | 3,060,001 | 3,080,000 | 4.79 | 0.00257570 | srt-15- srt-16- |
| 4 | chrII | 3,690,001 | 3,710,000 | 4.73 | 0.00448442 | |
| 5 | chrIV | 5,860,001 | 5,880,000 | 4.57 | 0.00198431 | spe-27- |
| 6 | chrV | 15,300,001 | 15,320,000 | 4.56 | 0.00257570 | |
| 7 | chrIV | 9,480,001 | 9,500,000 | 4.35 | 0.00018020 | |
| 8 | chrV | 16,670,001 | 16,690,000 | 4.27 | 0.00338533 | str-61- F14F8.8- |
| 9 | chrI | 12,690,001 | 12,710,000 | 4.21 | 0.00001683 | |
| 10 | chrI | 13,100,001 | 13,120,000 | 3.94 | 0.00134398 | Y26D4A.21- C17H1.2- |
| 11 | chrV | 2,930,001 | 2,950,000 | 3.82 | 0.00257570 | C31B8.16- C31B8.1- srh-247- |
| 12 | chrV | 9,820,001 | 9,840,000 | 3.81 | 0.00134398 | |
| 13 | chrIV | 14,140,001 | 14,160,000 | 3.57 | 0.00134398 | srz-31- |
| 14 | chrV | 2,740,001 | 2,760,000 | 3.54 | 0.00002042 | |
| 15 | chrV | 16,680,001 | 16,700,000 | 3.35 | 0.00001183 | srw-44- |
| 16 | chrV | 16,460,001 | 16,480,000 | 3.23 | 0.00018020 | srh-142- T08G3.7- sru-44- |
| 17 | chrV | 2,940,001 | 2,960,000 | 3.21 | 0.00030016 | srw-143- srw-137- |
| 18 | chrV | 2,950,001 | 2,970,000 | 3.15 | 0.00198431 | srw-122- |
| 19 | chrV | 6,800,001 | 6,820,000 | 3.11 | 0.00134398 | |
| 20 | chrIV | 9,280,001 | 9,300,000 | 3.01 | 0.00040976 | nhr-267- |
| 21 | chrI | 13,110,001 | 13,130,000 | 2.73 | 0.00023681 | |
List of the best predicted reference genes from the whole adult C. elegans quantitative transcriptome map. Chr = chromosome; SD = standard deviation.
| Gene name | Chr | Expression Value | Sample Number | SD as % of Expression | Description |
|---|---|---|---|---|---|
| rpl4 | chrI | 2603.77 | 102 | 19.85 | 60S ribosomal protein L4 |
| riok-3 | chrIII | 165.73 | 61 | 18.99 | Serine/threonine-protein kinase RIO3 |
| Y48G1C.1 | chrI | 149.21 | 55 | 19.88 | hypothetical protein |
Specifications table
| Subject area | |
| More specific subject area | |
| Type of data | |
| How data was acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | |
| Data accessibility | |
| Related research article |
Reference table for a quantitative gene expression value for each of the 45,932 Caenorhabditis elegans transcripts, offering the possibility for immediate establishment of quantitative relative ratio of expression for every pair of desired genes as well as analysis of global patterns of expression with any tool of gene expression profile elaboration. Benchmark to identify variation in individual gene expression value following comparison with gene profiles derived by worms in different biological conditions, e.g. different developmental stages, different feeding conditions or treatments, strains with knockdown of specific genes or with any type of genetic difference. Possibility to select genes with the desired features of the expression values (high/low, with high or low standard deviation from the mean among a large number of individuals, usefulness as a reference gene in gene expression studies). Possibility to select genomic segments with high/low expression values (mean of expression values of the genes contained in the segment), thus also identifying genomic open chromatin domains. The quantitative reference values of the enzyme mRNAs might be used in metabolic network models for the validation of hypotheses about the relationships among mRNA levels, corresponding enzymatic proteins and the quantities of their substrates or products obtained by metabolome experiments. |