| Literature DB >> 24821410 |
Samuel E Fox1, Matthew Geniza2, Mamatha Hanumappa1, Sushma Naithani3, Chris Sullivan3, Justin Preece1, Vijay K Tiwari4, Justin Elser1, Jeffrey M Leonard4, Abigail Sage1, Cathy Gresham5, Arnaud Kerhornou6, Dan Bolser6, Fiona McCarthy7, Paul Kersey6, Gerard R Lazo8, Pankaj Jaiswal9.
Abstract
BACKGROUND: Triticum monococcum (2n) is a close ancestor of T. urartu, the A-genome progenitor of cultivated hexaploid wheat, and is therefore a useful model for the study of components regulating photomorphogenesis in diploid wheat. In order to develop genetic and genomic resources for such a study, we constructed genome-wide transcriptomes of two Triticum monococcum subspecies, the wild winter wheat T. monococcum ssp. aegilopoides (accession G3116) and the domesticated spring wheat T. monococcum ssp. monococcum (accession DV92) by generating de novo assemblies of RNA-Seq data derived from both etiolated and green seedlings. PRINCIPALEntities:
Mesh:
Year: 2014 PMID: 24821410 PMCID: PMC4018402 DOI: 10.1371/journal.pone.0096855
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Transcriptome assembly statistics.
| Transcriptome assemblies | Total number of reads | Number of Transcripts | Largest sequence (bp) | Average length (bp) | Median length (bp) |
|
| 435,806,374 | 87,972 | 21,251 | 1633 | 1393 |
|
| 435,806,374 | 82,185 | 13,427 | 1699 | 1460 |
|
|
|
|
|
| |
|
| 366,215,814 | 84,491 | 21,999 | 1579 | 1316 |
|
| 366,215,814 | 79,936 | 13,528 | 1624 | 1372 |
|
|
|
|
|
|
Transcriptome assembly statistics for T. monococcum ssp. monococcum (DV92) and T. monococcum ssp. aegilopoides (G3116) generated by Velvet/Oases. The statistics describe the sequence input to the assembler and the number of assembled transcripts and relative transcript length in base pairs (bp). The merged assembly is a feature of Oases that merges transcript isoforms into putative gene loci.
BLAST results.
| Target | Query | |||
| DV92 (120,911) | G3116 (117,969) | |||
|
|
|
|
| |
|
| - | - | 116,227 | 98.50% |
|
| 117,872 | 97.50% | - | - |
|
| 118,618 | 98.10% | 115,498 | 97.90% |
|
| 102,176 | 84.50% | 99,148 | 84.00% |
|
| 120,061 | 99.30% | 117,090 | 99.25% |
|
| 104,932 | 86.70% | 101,749 | 86.25% |
|
| 115,528 | 95.50% | 113,064 | 95.80% |
|
| 115,244 | 95.30% | 112,786 | 95.60% |
|
| 112,442 | 92.30% | 109,816 | 93.10% |
|
| 93,369 | 77.20% | 91,411 | 77.50% |
|
| 83,775 | 69.30% | 82,176 | 69.70% |
|
| 84,836 | 70.20% | 83,291 | 70.60% |
|
| 88,655 | 73.30% | 86,990 | 73.70% |
Source: *GigaBD; #Gramene; ∧Plant GDB GenBank release 175; § DFCI release 12.0.
BLASTn (E-value 1e−5) nucleotide sequence comparisons of T. monococcum ssp. Monococcum (DV92) and T. monococcum ssp. aegilopoides (G3116) transcripts against gene models and genomes from other sequenced grass species suggesting the coverage represented in the T. monococcum transcriptome.
Figure 1Mappings DV92 and G3116 transcripts to the sequenced Hordeum vulgare (barley) genome v1.0 (source: Gramene/Ensembl Plants).
(A) A hive plot shows comparison between Triticum monococcum accessions G3116 and DV92 vs. the barley genome. (B) A density plot view of the Ensembl Plants genome browser showing barley chromosome-1H karyotype view (track-1) with annotated barley genes (track-2; maroon) and the mapped G3116 transcripts (track-3; blue) and DV92 transcripts (track-4; red).
The coverage and mapping of T. urartu, A. tauschii and H. vulgare transcripts on DV92 and G3116 transcriptomes using BLASTn (E-value 1e−5).
| Target | Query | |||||
|
|
|
| ||||
| (Transcripts #34,879)* | (Transcripts #43,150)* | (Transcripts #62,240)∧ | ||||
|
|
|
|
|
|
| |
|
| 29,784 | 85.40% | 35,618 | 82.50% | 57,781 | 92.80% |
|
| 29,108 | 83.40% | 34,783 | 80.60% | 56,609 | 90.90% |
Source: *GigaBD; ∧Gramene.
The number of transcripts and percent of transcripts from each query that hit a transcript from DV92 and G3116 are shown.
Figure 2Analyses of the differentially expressed transcripts.
A scatter plot of light up- regulated (red colored) and down-regulated (green colored) transcripts from G3116 (A) and DV92 (B) accessions of T. monococcum. Each spot represents a single transcript. (C) The table lists counts of differentially expressed transcripts from the DV92 and G3116 accessions shown in the adjacent scatter plots and their barley homologs. The four-way Venn diagram shows the distribution of barley homolog counts with reference to the mapped light up-regulated (red shaded boxes) and light down-regulated (green shaded boxes) transcripts. (D) Barley homologs from various unique sets identified in the Venn diagram (C) and their selected molecular function enrichment.
Figure 3Genetic marker discovery.
Polymorphic sites identified in the transcriptome of DV92 (blue) and G3116 (red). (A) Number of SSR identified in the transcriptomes. (B) Number of SNPs identified in the two genotypes by aligning against the sequenced barley reference genome. 9,808 out of 340,250 common SNP sites have polymorphism between DV92 and G3116. (C) Mapping of common, variable 9,808 SNP and 148 SSR sites identified in the DV92 and G3116 transcriptomes on the karyotype view of the reference barley genome hosted by the Ensembl Plants. The SNP sites are shown as red colored density plot and SSR sites are depicted as black triangles along the length of the respective barley chromosomes.
Prediction of SNP variant consequence with reference to the annotated barley genome.
| Predicted variant effect | Number of SNP sites with consequences | Unique | ||
| DV92 | G3116 | DV92 | G3116 | |
| 3 prime UTR variant | 131,758 | 165,696 | 6,918 | 9,022 |
| 5 prime UTR variant | 86,389 | 127,854 | 4,450 | 9,371 |
| coding sequence variant | 21,545 | 30,920 | 2,422 | 3,704 |
| downstream gene variant | 328,112 | 440,765 | 19,120 | 26,060 |
| initiator codon variant | 364 | 507 | 22 | 49 |
| initiator codon variant, splice region variant | 6 | 8 | none | None |
| intergenic variant | 35,753 | 54,722 | 2,682 | 24,136 |
| intron variant | 46,901 | 111,413 | 4,717 | 14,217 |
| missense variant | 198,794 | 258,081 | 9,929 | 17,763 |
| missense variant, splice region variant | 1,145 | 1,866 | 89 | 188 |
| non coding exon variant, nc transcript variant | 7 | 11 | 1 | 1 |
| splice acceptor variant | 10,094 | 18,609 | 572 | 2,103 |
| splice donor variant | 18,433 | 34,503 | 1,145 | 3,985 |
| splice region variant, 3 prime UTR variant | 681 | 962 | 25 | 51 |
| splice region variant, 5 prime UTR variant | 685 | 1,137 | 50 | 80 |
| splice region variant, coding sequence variant | 136 | 272 | 15 | 39 |
| splice region variant, downstream gene variant | 2 | 3 | none | None |
| splice region variant, intron variant | 31,692 | 63,891 | 2,270 | 8,074 |
| splice region variant, synonymous variant | 3,500 | 5,083 | 176 | 448 |
| stop gained | 462 | 732 | 40 | 62 |
| stop gained, splice region variant | 4 | 8 | none | 2 |
| synonymous variant | 451,169 | 538,115 | 18,687 | 29,046 |
SNP Variant Consequence Prediction based on the T. monococcum SNPs identified by aligning the sequenced reads from DV92 and G3116 to the reference barley genome and the barley gene models (v1.0) available from Ensembl Plants database. Listed variant effect types are based on the categories adopted by the Ensembl Plants database.