| Literature DB >> 28510893 |
Yul-Kyun Ahn1, Swati Tripathi2, Young-Il Cho2, Jeong-Ho Kim2, Hye-Eun Lee2, Do-Sun Kim2, Jong-Gyu Woo2, Myeong-Cheoul Cho2.
Abstract
BACKGROUND: Pepper, Capsicum annuum L., Solanaceae, is a major staple economically important vegetable crop worldwide. Limited functional genomics resources and whole genome association studies could be substantially improved through the application of molecular approach for the characterization of gene content and identification of molecular markers. The massive parallel pyrosequencing of two pepper varieties, the highly pungent, Saengryeg 211, and the non-pungent, Saengryeg 213, including de novo transcriptome assembly, functional annotation, and in silico discovery of potential molecular markers is described. We performed 454 GS-FLX Titanium sequencing of polyA-selected and normalized cDNA libraries generated from a single pool of transcripts obtained from mature fruits of two pepper varieties.Entities:
Keywords: Capsicum annuum; Molecular markers; Next generation sequencing; Simple sequence repeats; Transcriptome profiling
Year: 2013 PMID: 28510893 PMCID: PMC5430321 DOI: 10.1186/1999-3110-54-58
Source DB: PubMed Journal: Bot Stud ISSN: 1817-406X Impact factor: 2.787
454 Pyrosequencing results for the pepper ( ) varieties, Saengryeg 211 and Saengryeg 213
| 454 pyrosequencing terms | Saengryeg 211 | Saengryeg 213 |
|---|---|---|
| Reads | ||
| The Number of raw sequencing reads (n) | 361,671 | 274,269 |
| Bases of raw sequencing reads (bp) | 164,494,414 | 124,608,071 |
| The average of read length (bp) | 454,818 | 454,328 |
| Assembleda | 282,705 | 217,905 |
| Partialb | 50,330 | 33,427 |
| Singletonsc | 18,147 | 15,129 |
| Repeatd | 111 | 242 |
| Outliere | 5,075 | 3,699 |
| Too shortf | 5,273 | 3,831 |
| The number of bases, Q20 ≤ (bp)g | 93,272,495 | 70,812,087 |
| The number of reads, Q20 ≤ (n) | 269,198 | 205,166 |
|
| ||
| Number of isotigs (n) | 23,821 | 17,813 |
| Average count of contigs in the isotigs (n) | 1.401 | 1.317 |
| Largest count of contigs in the isotigs (n) | 12 | 16 |
| The number of isotigs assembled by one contig (n) | 18,363 | 14,556 |
| The total number of bases in the isotig (bp) | 20,787,054 | 14,965,217 |
| The average isotig size (n) | 872.636 | 840.129 |
| N50 isotig size (bp)h | 1,028 | 977 |
| The size of the largest isotig (bp) | 17,462 | 7,300 |
|
| ||
| Number of singletons after SeqClean (minimum length, 100) | 18,147 | 15,129 |
| Number of singletons after Lucy (minimum length, 100) | 17,054 | 14,310 |
| Number of valid singletons after Lucy | 16,958 | 14,244 |
aThe number of read’s bases used in the assembly computation.
bOnly part of the read was included in the assembly.
cThe read did not overlap with any other reads in the input.
dThe read deemed to be from repeat regions.
eThe read was identified by the GS De Novo Assembler as problematic.
fThe read was too short to be used in the computation.
gThe quality of read was higher than or equal to 20.
hHalf of all bases reside in contigs of this size or longer.
Figure 1A comparison of functional annotation result of (a) Saengryeg 211 (b) Saengryeg 213 pepper varieties, using BLAST.
Figure 2Classification of variety Saengryeg 211 transcripts into functional categories according to Gene ontology. (a) Biological Process (b) Cellular Component (c) Molecular Function.
Figure 3Classification of variety Saengryeg 213 transcripts into functional categories according to Gene ontology. (a) Biological Process (b) Cellular Component (c) Molecular Function.
Figure 4Frequency distribution of di- and tri-nucleotide repeats in motif sequences of Saengryeg 211 and Saengryeg 213 pepper varieties.