| Literature DB >> 25202520 |
Samuel E Fox1, Justin Preece1, Jeffrey A Kimbrel2, Gina L Marchini3, Abigail Sage1, Ken Youens-Clark4, Mitchell B Cruzan3, Pankaj Jaiswal1.
Abstract
PREMISE OF THE STUDY: We report the de novo assembly and characterization of the transcriptomes of Brachypodium sylvaticum (slender false-brome) accessions from native populations of Spain and Greece, and an invasive population west of Corvallis, Oregon, USA. • METHODS ANDEntities:
Keywords: Brachypodium sylvaticum; comparative genomics; de novo transcriptome; invasive species; simple sequence repeat (SSR); single-nucleotide polymorphism (SNP)
Year: 2013 PMID: 25202520 PMCID: PMC4105277 DOI: 10.3732/apps.1200011
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
Statistics of data sets from the sequencing and de novo transcriptome assemblies of the Brachypodium sylvaticum samples from Corvallis (Brasy-Cor), Spain (Brasy-Esp), and Greece (Brasy-Gre) and their comparison with the transcriptome of B. distachyon Bd21 sequenced genome.
| Raw sequences | Assembled contigs | |||||
| Transcriptomes | No. of reads | Gb | Total no. | Longest sequence (bp) | Average length (bp) | Median length (bp) |
| Brasy-Cor | 120,443,086 | 12.16 | 120,091 | 16,713 | 1616 | 1380 |
| Brasy-Esp | 109,942,266 | 11.1 | 104,950 | 21,443 | 1696 | 1456 |
| Brasy-Gre | 127,927,820 | 12.92 | 177,682 | 15,289 | 1566 | 1317 |
| — | — | 31,029 | 14,577 | 1280 | 1086 | |
Fig. 1.Sequence comparisons. (A) Histogram of frequency distribution of contig lengths of Brachypodium sylvaticum samples from Corvallis (Brasy-Cor), Spain (Brasy-Esp), and Greece (Brasy-Gre) de novo transcriptome assemblies compared with B. distachyon cDNA lengths (X-axis has been truncated at 6 kb). The frequency distribution of the B. sylvaticum transcriptomes closely mirrors that of the B. distachyon transcriptome. (B) Comparisons of number of contigs and the average length of B. sylvaticum assembled contigs and B. distachyon transcripts. Although the overall number of contigs is far greater than the number of gene loci in B. distachyon, the average length observed in B. sylvaticum transcriptomes is larger than B. distachyon.
BLAST comparisons of Brachypodium sylvaticum transcriptomes against various databases. Shown are the number and percentage of contigs from B. sylvaticum samples from Corvallis (Brasy-Cor), Spain (Brasy-Esp), and Greece (Brasy-Gre) that hit a gene from the respective database. BLASTx comparisons were made to the GenBank nonredundant peptide database (nr), while BLASTn nucleotide comparisons were made against the B. distachyon, Oryza sativa, and Sorghum bicolor cDNA databases (E-value threshold cutoff of 1e-10−5).
| Brasy-Cor (120,091) | Brasy-Esp (104,950) | Brasy-Gre (177,682) | ||||
| Database compared | No. of hits | % hits | No. of hits | % hits | No. of hits | % hits |
| GenBank peptide (nr) | 96,140 | 80.1 | 86,791 | 82.7 | 149,178 | 84.0 |
| 103,752 | 86.4 | 93,291 | 88.9 | 160,309 | 90.2 | |
| 96,375 | 80.3 | 86,577 | 82.5 | 148,617 | 83.6 | |
| 93,751 | 78.1 | 84,023 | 80.1 | 144,884 | 81.5 | |
Fig. 2.Sequence comparisons between contigs from Brachypodium sylvaticum samples from Corvallis (Brasy-Cor), Spain (Brasy-Esp), and Greece (Brasy-Gre) and B. distachyon transcripts and genome. (A) B. distachyon genes hit by B. sylvaticum contigs. Greater than 87% of B. distachyon loci were hit by all three B. sylvaticum transcriptomes, and more than 96% were hit by a minimum of one contig from at least one B. sylvaticum transcriptome. (B) RPKM values averaged over 0.5 megabase intervals across the B. distachyon genome. This image shows the uniform coverage of B. sylvaticum reads aligned to the five B. distachyon chromosomes. Blue histograms indicate RPKM values averaged along the positive strand while green histograms indicate RPKM values of minus strand alignments.
Fig. 3.Comparison of SNPs identified by mapping sequence reads from Brachypodium sylvaticum genotypes from Corvallis (Brasy-Cor), Spain (Brasy-Esp), and Greece (Brasy-Gre) to the B. distachyon Bd21 reference genome. (A) Venn diagram showing the overlap of SNPs within the three B. sylvaticum genotypes. The numbers inside the Venn diagram exclude the 628 common sites exhibiting variation among the B. sylvaticum genotypes. We identified more than 157,835 nucleotide variants common among the three B. sylvaticum genotypes when compared to B. distachyon. Brasy-Gre had the greatest number of SNPs identified. (B) Genotype-specific SNPs were mapped to show the SNP density and chromosome-wide distribution against the B. distachyon genome. (C) Phylogenetic construction of a maximum likelihood tree based on the 628 common sites exhibiting variation among the B. sylvaticum genotypes indicates the proposed evolutionary relationship among the B. sylvaticum genotypes and the B. distachyon Bd21 reference genome.
SNP consequence predictions. After aligning all of the 394,654 SNPs from the Brachypodium sylvaticum samples from Corvallis (Brasy-Cor), Spain (Brasy-Esp), and Greece (Brasy-Gre) on the B. distachyon (Bd21) genome v1.0, we predicted the potential effects of SNP variant loci on the Bd21 genes. We identified a diverse set of consequences for the various gene loci.
| Predicted variant effect | No. of SNP sites |
| 3′ UTR variant | 46,871 |
| 3′ UTR variant, splice region variant | 48 |
| 5′ UTR variant | 6636 |
| 5′ UTR variant, splice region variant | 34 |
| Downstream gene variant | 231,969 |
| Initiator codon variant | 38 |
| Intron variant | 36,334 |
| Intron variant, splice region variant | 13,336 |
| Missense variant | 92,532 |
| Missense variant, splice region variant | 310 |
| Splice acceptor variant | 3222 |
| Splice donor variant | 5502 |
| Splice region variant, initiator codon variant | 1 |
| Stop gained | 234 |
| Stop gained, splice region variant | 3 |
| Stop lost | 117 |
| Stop lost, splice region variant | 2 |
| Stop retained variant | 180 |
| Synonymous variant | 185,986 |
| Synonymous variant, splice region variant | 958 |
| Upstream gene variant | 174,506 |
Characterization of the number of transitions and transversions that were predicted from our SNP analysis, showing a much higher prevalence of transitions over transversions as expected. The SNPs were identified in the Brachypodium sylvaticum samples from Corvallis (Brasy-Cor), Spain (Brasy-Esp), and Greece (Brasy-Gre).
| SNP | Substitution type | Brasy-Cor | Brasy-Esp | Brasy-Gre |
| A → C | Transversion | 14,208 | 13,836 | 14,695 |
| A → G | Transition | 43,635 | 42,985 | 43,135 |
| A → T | Transversion | 9,695 | 9,834 | 10,118 |
| C → A | Transversion | 11,998 | 12,241 | 11,459 |
| C → G | Transversion | 18,834 | 18,072 | 17,862 |
| C → T | Transition | 36,201 | 37,250 | 34,717 |
| G → A | Transition | 35,918 | 36,980 | 34,383 |
| G → C | Transversion | 18,962 | 18,103 | 17,813 |
| G → T | Transversion | 11,892 | 12,143 | 11,393 |
| T → A | Transversion | 9,647 | 9,970 | 10,039 |
| T → C | Transition | 44,121 | 43,302 | 43,790 |
| T → G | Transversion | 14,077 | 13,912 | 14,371 |
Summary of SSR sites identified in the transcriptomes of Brachypodium sylvaticum samples from Corvallis (Brasy-Cor), Spain (Brasy-Esp), and Greece (Brasy-Gre). We identified di-, tri-, tetra-, penta-, and hexanucleotide SSRs with a minimum of nine, six, six, five, and five repeat units, respectively. More than 20,000 SSRs were identified in the B. sylvaticum transcriptomes; trinucleotide repeats were the largest class of SSRs.
| Population | Total | Dimer | Trimer | Tetramer | Pentamer | Hexamer |
| Brasy-Cor | 23,535 | 6011 | 16,346 | 750 | 183 | 245 |
| Brasy-Esp | 20,303 | 5449 | 14,093 | 513 | 141 | 107 |
| Brasy-Gre | 32,847 | 8194 | 23,280 | 858 | 276 | 239 |