| Literature DB >> 22931377 |
Elie Raherison1, Philippe Rigault, Sébastien Caron, Pier-Luc Poulin, Brian Boyle, Jukka-Pekka Verta, Isabelle Giguère, Claude Bomal, Jörg Bohlmann, John MacKay.
Abstract
BACKGROUND: Conifers have very large genomes (13 to 30 Gigabases) that are mostly uncharacterized although extensive cDNA resources have recently become available. This report presents a global overview of transcriptome variation in a conifer tree and documents conservation and diversity of gene expression patterns among major vegetative tissues.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22931377 PMCID: PMC3534630 DOI: 10.1186/1471-2164-13-434
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Development a large-scale oligonucleotide array for spruces (Picea spp): sequence information used to design oligonucleotide probes from Picea glauca and P. sitchensis sequences
| 1 | 11,214 | 35% | Pgl | Confirmed in Pgl and Psi cDNAs |
| 2 | 12,251 | 39% | Pgl | Confirmed by Pgl 454 seqs or Psi cDNAs |
| 3 | 4,840 | 15% | Psi | Confirmed by Pgl 454 seq |
| 4 | 1,629 | 5% | Pgl | Unconfirmed but Pgl clones ≥ 2 |
| 5 | 1,670 | 5% | Psi | Psi full-length cDNA only (not found |
| All | 31,604 |
1Each probe was designed based on the sequence of P. glauca (Pgl) or P. Sitchensis (Psi).
Analysis of probes: sequence similarity of probes aligned to Picea glauca and P. sitchensis sequences
| 67-70 (>95.7%) | 67-70 (>95.7%) | 17,279 | 54% |
| 67-70 (>95.7%) | NA | 11,999 | 38% |
| 67-70 (>95.7%) | 63-66 (>90%;<95%) | 923 | 3% |
| 63-66 (>90%;<95%) | 67-70 (>95.7%) | 529 | 2% |
| NA | 67-70 (>95%) | 869 | 3% |
Figure 1Interspecific comparison of hybridization intensities in secondary xylem.A-F: Pair-wise comparison of white spruce and six other species based on the number of shared positive probes indicated in the plots. The squared correlation coefficients (r2) are as follows 0.86 (A), 0.85 (B), 0.89 (C), 0.29 (D), 0.22 (E) and 0.27 (F). G: Analysis of signal intensity variation between species; the fold change (FC) was determined from the average normalized signal intensities (log2 scale). An FC of 1 or −1 represents a two-fold signal increase or decrease, respectively. For phelloderm results, see Additional file 4: Figure S3.
Figure 2Preferential expression in secondary vascular tissues of three spruce species. The FC data in the plot represent genes with differential expression in all three spruces. The scale is the log2 fold change.
RNA-Seq data
| Total reads (HQ) (millions) | 29.5 | 29.9 | 59.5 |
| Reads mapped (millions) | 17.3 | 20.4 | 37.7 |
| Genes - RPKM1 > 1 | 19,604 | 21,366 | 22,012 |
| Genes - RPKM > 3 | 16,168 | 18,297 | 19,108 |
1 RPKM, Number of reads per kilobase of mapped sequences per megabase based on Mortazavi et al. [24].
Validation of microarray results by RNA-Seq
| 0.05 | 0.5 | 5,666 | 5,592 | 99% | 4,181 | 75% | 4,155 | 99% | 776 | 19% |
| 0.05 | 1.0 | 2,614 | 2,588 | 99% | 2,265 | 88% | 2,253 | 99% | 677 | 30% |
| 0.01 | 0.5 | 5,526 | 5,466 | 99% | 3,608 | 66% | 3,585 | 99% | 542 | 15% |
| 0.01 | 1.0 | 2,608 | 2,582 | 99% | 2,171 | 84% | 2,160 | 99% | 491 | 23% |
1 DE positive genes determined by MA.
2 Number of DE genes detected by RNA-Seq among the DE genes found by MA hybridization; Tissue pref., indicates that the tissue preference for xylem or phelloderm was the same.
3 DE, differential expression in RNA-Seq, determined by a Chi-squared test (df = 1) with Benjamini-Hochberg correction for multiple testing.
4 RNA-Seq results for non-DE genes from MA (considering genes represented on the MA only).
Tissue specificity in RNA-Seq
| | ||||
|---|---|---|---|---|
| Genes RPKM > 1 | 44 | 16 (36%) | 186 | 79 (42%) |
| Genes RPKM > 3 | 17 | 6 (35%) | 81 | 41 (50%) |
The PiceaGenExpress database: sample characteristics, hybridizations and detected genes
| | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | Embryogenic cells | Pgl | a, a | 1 | 6 | SQ | 10,066 | 100 | 2,456 |
| 2 | Vegetative buds | Pgl | b, c | 2 | 10 | SQ | 12,361 | 128 | 3,216 |
| 3 | Xylem (Mature) | Pgl | d, d | 60 | 60 | SQ | 14,686 | 176 | 4,232 |
| 4 | Xylem (juvenile) | Pgl | e, f | 30 | 20 | SQ | 13,807 | 56 | 3,701 |
| 5 | Phelloderm | Pgl | e, f | 30 | 20 | SQ | 15,803 | 214 | 4,391 |
| 6 | Young needles | Pgl | e, f | 30 | 10 | SQ | 12,819 | 167 | 3,025 |
| 7 | Megagametophytes | Pgl | g, c | 3 | 3 | PA | 17,056 | 1,111 | 5,205 |
| 8 | Adventitious roots | Pab | h, c | 8 | 20 | PA | 15,718 | 393 | 4,696 |
| | | | | | Total detected | 21,241 | 2,345 | | |
| Not detected | 2,612 | ||||||||
1 Pgl: Picea glauca (White spruce); Pab: Picea abies (Norway spruce).
2 The source of materials and the sampling method are from the following studies: a, [43]; b, [37]; c, this paper, see methods; d, [23]; e, O-P seedlot, this paper see methods; f, [22]; g, C2856 parent from [38]; h, [44].
3 Total numbers of genotypes analyzed (either individually or in pools).
4 Microarray Scanner and Image Processing: SQ, ScanArray Express (Perkin Elmer, Waltham MA, USA) and QuantArray v3.0 (Packard BioChip Technologies, Billerica, MA, USA); PA, PowerScanner (Tecan Group Ltd., Männedorf, Switzerland) and ArrayPro Analyser v6.3 (Media Cybernetics, Bethesda, MD, USA).
5 Total: number of genes above background (see methods); Unique: genes detected only in one tissue.
Figure 3The PiceaGenExpress database reveals tissue preferential and conserved expression patterns within three gene families.A: Cellulose synthases. B: Photosystem I and II proteins. C: Ubiquitins. NA: Not detected. Tissues: B (Vegetative buds), F (Foliage), X-M (Xylem - mature), X-J (Xylem - juvenile), P (Phelloderm), R (Adventitious roots), M (Megagametophytes), E (Embryogenic cells).
Figure 4Expression classes and numbers of tissue of annotated and non annotated sequences.A-C: Number of annotated and nonannotated sequences per expression class for xylem from juvenile trees (A), roots (B) and young foliage (C). Other tissues are shown in Additional file 8: Figure S5. D: Number of tissues in which each annotated and non-annotated sequence was detected. Frequency, number of genes in a given intensity class or detected in a given number of tissues types.
Figure 5Gene expression patterns in two osmotic regulation protein families based on the PiceaGenExpress database.A: Dehydrins. B: Late Embryogenesis Abundant proteins. C: Distribution of Euclidean distances between the members of each protein family. The order of the bars is not representative of the order of the genes in panels A and B. A, B: Tissues: B (Vegetative buds), F (Foliage), X-M (Xylem - mature), X-J (Xylem - juvenile), P (Phelloderm), R (Adventitious roots), M (Megagametophytes), E (Embryogenic cells).
Figure 6Expression patterns of LTR retrotransposons based on the PiceaGenExpress database.A: Sequences containing protein domains of LTR retroelements. Pfam annotations: a: Integrase core domain; b: Retrotransposon gag protein; c: Retroviral aspartyl protease; d: Reverse transcriptase (RNA-dependent DNA polymerase); e: RNAse. Tissues: B (Vegetative buds), F (Foliage), X-M (Xylem - mature), X-J (Xylem - juvenile), P (Phelloderm), R (Adventitious roots), M (Megagametophytes), E (Embryogenic cells). B: Expression levels are not correlated with the number of genomic copies.