| Literature DB >> 26540077 |
Tae-Sung Kim1, Sebastin Raveendar2, Sundan Suresh3,4, Gi-An Lee5, Jung-Ro Lee6, Joon-Hyeong Cho7, Sok-Young Lee8, Kyung-Ho Ma9, Gyu-Taek Cho10, Jong-Wook Chung11.
Abstract
The vetch (Vicia sativa) is one of the most important annual forage legumes globally due to its multiple uses and high nutritional content. Despite these agronomical benefits, many drawbacks, including cyano-alanine toxin, has reduced the agronomic value of vetch varieties. Here, we used 454 technology to sequence the two V. sativa subspecies (ssp. sativa and ssp. nigra) to enrich functional information and genetic marker resources for the vetch research community. A total of 86,532 and 47,103 reads produced 35,202 and 18,808 unigenes with average lengths of 735 and 601 bp for V. sativa sativa and V. sativa nigra, respectively. Gene Ontology annotations and the cluster of orthologous gene classes were used to annotate the function of the Vicia transcriptomes. The Vicia transcriptome sequences were then mined for simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers. About 13% and 3% of the Vicia unigenes contained the putative SSR and SNP sequences, respectively. Among those SSRs, 100 were chosen for the validation and the polymorphism test using the Vicia germplasm set. Thus, our approach takes advantage of the utility of transcriptomic data to expedite a vetch breeding program.Entities:
Keywords: 454 pyrosequencing; SNPs; SSRs; the vetch (Vicia sativa); transcriptome
Year: 2015 PMID: 26540077 PMCID: PMC4690033 DOI: 10.3390/genes6041164
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Summary of the functional annotation using the Gene Ontology (GO) approach for Vicia sativa unigenes by the classification (a); GO functional categories (b) and individual GO annotations (c) in between the Vicia sativa nigra and sativa transcriptome.
Gene Ontology (GO) annotation results assigned to functional categories in the Vicia sativa nigra and sativa transcriptomes.
| Functional Category | GO Annotations | Sativa | Nigra | ||
|---|---|---|---|---|---|
| No. of Unigenes | Proportion a | No. of Unigenes | Proportion | ||
| Biological process | Unclassified | 2194 | 0.29 | 1050 | 0.29 |
| Metabolic process | 1433 | 0.19 | 749 | 0.19 | |
| Response to stimulus | 1055 | 0.14 | 703 | 0.18 | |
| Biological regulation | 597 | 0.08 | 235 | 0.06 | |
| Cellular process | 553 | 0.07 | 271 | 0.07 | |
| Developmental process | 503 | 0.07 | 188 | 0.05 | |
| Establishment of localization | 358 | 0.05 | 210 | 0.05 | |
| Cell component organization | 122 | 0.02 | 61 | 0.02 | |
| Others | 827 | 0.11 | 405 | 0.10 | |
| Cellular component | Cell part | 5357 | 0.60 | 3227 | 0.62 |
| Unclassified | 1797 | 0.20 | 902 | 0.17 | |
| Organelle | 1203 | 0.14 | 714 | 0.14 | |
| Organelle part | 278 | 0.03 | 161 | 0.03 | |
| Macromolecular complex | 189 | 0.02 | 128 | 0.02 | |
| Extracellular region | 73 | 0.01 | 61 | 0.01 | |
| Others | 10 | 0.00 | 46 | 0.01 | |
| Molecular function | Catalytic activity | 3775 | 0.42 | 1836 | 0.41 |
| Binding | 2757 | 0.30 | 1207 | 0.27 | |
| Unclassified | 1317 | 0.15 | 759 | 0.17 | |
| Transporter activity | 604 | 0.07 | 361 | 0.08 | |
| Transcription regulator activity | 239 | 0.03 | 70 | 0.02 | |
| Structural molecule activity | 146 | 0.02 | 111 | 0.02 | |
| Enzyme regulator activity | 98 | 0.01 | 34 | 0.01 | |
| Others | 148 | 0.02 | 114 | 0.03 | |
a No. of singletons in a GO annotation/total No. of singletons.
Figure 2Cluster of orthologous groups (COG) classification between Vicia sativa nigra and sativa transcriptomes.
The distributions of simple sequence repeats (SSRs) identified from the Vicia sativa nigra and sativa transcriptomes.
| Repeat Type | SSR Motif | Sativa | Nigra | ||
|---|---|---|---|---|---|
| Count | Frequency a | Count | Frequency | ||
| AC/CA | 85 | 1.8% | 27 | 1.1% | |
| AG/GA | 205 | 4.4% | 58 | 2.4% | |
| AT/TA | 202 | 4.3% | 172 | 7.0% | |
| CG/GC | 4 | 0.1% | 0 | 0.0% | |
| CT/TC | 234 | 5.0% | 86 | 3.5% | |
| GT/TG | 31 | 0.7% | 39 | 1.7% | |
| Subtotal | 761 | 14.9% | 398 | 15.7% | |
| AAC/ACA/CAA | 137 | 2.9% | 46 | 1.8% | |
| AAG/AGA/GAA | 228 | 4.9% | 92 | 3.6% | |
| AAT/ATA/TAA | 67 | 1.4% | 105 | 4.1% | |
| ACC/CCA/CAC | 573 | 12.2% | 492 | 19.4% | |
| ACG/CGA/GAC | 34 | 0.7% | 3 | 0.1% | |
| ACT/CTA/TAC | 26 | 0.6% | 2 | 0.1% | |
| AGC/GCA/CAG | 63 | 1.3% | 14 | 0.6% | |
| AGG/GGA/GAG | 76 | 1.6% | 40 | 1.6% | |
| AGT/GTA/TAG | 15 | 0.3% | 0 | 0.0% | |
| ATC/TCA/CAT | 173 | 3.7% | 91 | 3.6% | |
| ATG/TGA/GAT | 228 | 4.9% | 68 | 2.7% | |
| ATT/TTA/TAT | 94 | 2.0% | 57 | 2.3% | |
| CCG/CGC/GCC | 60 | 1.3% | 35 | 1.4% | |
| CCT/CTC/TCC | 95 | 2.0% | 9 | 0.4% | |
| CGG/GGC/GCG | 35 | 0.7% | 6 | 0.2% | |
| CGT/GTC/TCG | 22 | 0.5% | 1 | 0.0% | |
| CTG/TGC/GCT | 124 | 2.6% | 8 | 0.3% | |
| CTT/TTC/TCT | 338 | 7.2% | 64 | 2.5% | |
| GGT/GTG//TGG | 907 | 19.4% | 739 | 29.2% | |
| GTT/TTG/TGT | 220 | 4.7% | 26 | 1.0% | |
| Subtotal | 3515 | 76.3% | 1898 | 75.1% | |
| Other (Tetra/Penta/Hexa) | Subtotal | 405 | 8.7% | 235 | 9.3% |
| Total | 4681 | 2531 | |||
a The proportion of a certain repeat type or motif out of total SSRs.
Figure 3Frequency distribution of SSRs of Vicia sativa nigra and sativa transcriptomes, by the repeat types (a) and the motifs (b).
Figure 4Empirical validations of the polymorphism of SSRs in Vicia sativa nigra and sativa using eight randomly-selected accessions. (a) Gel images showing the PCR products from the eight accessions for the representative SSR loci in Vicia sativa sativa; (b) those images for Vicia sativa nigra; (c) graph showing the overall average polymorphism index content (PIC) values; (d) average PICs by the SSR motif; (e) correlation between PIC and SSR repeat number. The numbers in gel images in (a,b) represent the names of the SSR loci; (c,d) n represents the number of SSR loci used to estimate PIC, and error bars indicate standard deviations; (d) different letters represent one(s) that is (are) significantly different from the others (determined by comparison of all pairs using the Tukey-Kramer HSD test); (e) p-values are determined by simple linear regression analysis.
Figure 5Distributions of single nucleotide polymorphism (SNPs) in Vicia sativa nigra and sativa transcriptomes by the types (a) and by the number of reads assembled in each SNP type (b).
Summary of SNPs discovered from the genome sequencer (GS) reference mapper.
| SNP Types | All Differences | HCD a | NOR b for All Differences | NOR for HCD | ||||
|---|---|---|---|---|---|---|---|---|
| Number | Percentage c | Number | Percentage | Number | Percentage | Number | Percentage | |
| A/C | 102 | 0.04 | 35 | 0.03 | 413 | 0.03 | 224 | 0.03 |
| A/G | 400 | 0.16 | 170 | 0.16 | 1974 | 0.15 | 1206 | 0.15 |
| A/T | 147 | 0.06 | 66 | 0.06 | 949 | 0.07 | 645 | 0.08 |
| C/A | 101 | 0.04 | 43 | 0.04 | 456 | 0.03 | 298 | 0.04 |
| C/G | 92 | 0.04 | 42 | 0.04 | 477 | 0.04 | 258 | 0.03 |
| C/T | 449 | 0.17 | 187 | 0.17 | 2168 | 0.16 | 1441 | 0.18 |
| G/A | 382 | 0.15 | 149 | 0.14 | 2064 | 0.16 | 1170 | 0.14 |
| G/C | 100 | 0.04 | 50 | 0.05 | 518 | 0.04 | 327 | 0.04 |
| G/T | 113 | 0.04 | 50 | 0.05 | 577 | 0.04 | 359 | 0.04 |
| T/A | 173 | 0.07 | 76 | 0.07 | 887 | 0.07 | 592 | 0.07 |
| T/C | 392 | 0.15 | 154 | 0.14 | 2016 | 0.15 | 1146 | 0.14 |
| T/G | 120 | 0.05 | 58 | 0.05 | 648 | 0.05 | 438 | 0.05 |
| Total | 2571 | 1.01 | 1080 | 1.00 | 13,147 | 0.99 | 8104 | 0.99 |
a Highly-confident difference; b NOR, number of reads; c percentage of a certain SNP type in total SNPs.
| Sample | Large Contig (Length ≥100 bp) | Singletons after Sequence Cleanings (SeqClean, Lucy) | Total Valid Unigenes (Isotigs c + Singletons) | ||||
|---|---|---|---|---|---|---|---|
| Contigs | Bases | ACZ a | N50 Contig Size b | Largest Contig Size | |||
| sativa | 2698 | 1,983.375 | 735.13 | 782 | 3849 | 31,504 | 34,202 |
| nigra | 837 | 503,641 | 601.72 | 619 | 3345 | 17,971 | 18,808 |
| Sample ( | Total No. of Reads | Total No. of Bases | Assembled | Partial | Singleton | Repeat | Singletons after SeqClean | Singletons after Lucy |
|---|---|---|---|---|---|---|---|---|
| sativa | 86,532 | 28,429.544 | 42,405 | 5923 | 34,938 | 24 | 31,744 | 31,504 |
| nigra | 47,103 | 16,060.539 | 24,242 | 2309 | 19,646 | 9 | 18,091 | 17,971 |
a Average contig size; b contig size means that half of all bases reside in contigs of this size or longer; c isotig includes transcript variant mainly from alternative splicing.