| Literature DB >> 21310031 |
José Blanca1, Joaquín Cañizares, Cristina Roig, Pello Ziarsolo, Fernando Nuez, Belén Picó.
Abstract
BACKGROUND: Cucurbita pepo belongs to the Cucurbitaceae family. The "Zucchini" types rank among the highest-valued vegetables worldwide, and other C. pepo and related Cucurbita spp., are food staples and rich sources of fat and vitamins. A broad range of genomic tools are today available for other cucurbits that have become models for the study of different metabolic processes. However, these tools are still lacking in the Cucurbita genus, thus limiting gene discovery and the process of breeding.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21310031 PMCID: PMC3049757 DOI: 10.1186/1471-2164-12-104
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Sequence statistics of Cucurbita 454 ESTs
| Library | Raw reads | Total | Sequence | Processed reads | Total | Sequence |
|---|---|---|---|---|---|---|
| Zucchini MU16 | 407,723/252 | 103 Mbp | 31 | 261,962/319 | 84 Mbp | 32 |
| Scallop UPV196 | 392,370/254 | 100 Mbp | 31 | 250,789/323 | 81 Mbp | 32 |
| TOTAL | 800,093/253 | 203 Mbp | 31 | 512,751/321 | 165 Mbp | 32 |
Summary of the Cucurbita pepo expressed sequences generated with two half runs of GS-FLX Titanium pyrosequencing. Statistics of raw reads and reads after processing are indicated.
Figure 1Length distribution of the . Data obtained after sequencing, with a half run of 454 GS FLX Titanium each one of the two Cucurbita cDNA libraries (Zucchini, Mu-16; Scallop, UPV-196), and processing the 454 raw reads, are presented.
Figure 2Distribution of number of ESTs in each .
Figure 3Length distribution of the .
Figure 4Number of GO terms (A) and GO level distribution (B) in the annotated Cucurbita unigenes. A. Distribution of GO terms in the annotated Cucurbita unigenes. B. GO level distribution in each category for the annotated Cucurbita unigenes.
Figure 5Number of . Cucurbita unigenes were classified into different functional groups based on a set of GO slims in the A) Biological Process category and B) Molecular Function category.
Functional annotation statistics
| Database | Number of unigenes | % | Number of |
|---|---|---|---|
| 11022 | 22,2% | 14880 | |
| Melon ICUGI | 12461 | 25,12% | 12976 |
Databases searched were: Arabidopsis and melon icugi [12,61]
Simple Sequence repeats (SSRs) statistics
| di-nucleotide repeat | Number of di-SSRs | % |
|---|---|---|
| AG | 225 | 76 |
| AT | 60 | 20 |
| AC | 11 | 4 |
| Total | 296 | 100 |
| tri-nucleotide repeat | Number of tri-SSRs | |
| AAG | 699 | 50 |
| AGC | 135 | 10 |
| ATC | 116 | 8 |
| AGG | 99 | 7 |
| AAT | 89 | 6 |
| Other tri-nucleotide repeats | ||
| (% ≤ 6 each one) | ||
| AAC, ACC, ACG,CCG, ACT | 249 | 19 |
| Total | 1387 | 100 |
| Tetra-nucleotide repeat | Number of tetra-SSRs | % |
| AAAT | 33 | 13 |
| AAAG | 31 | 12 |
| AATG | 24 | 10 |
| AATC | 21 | 8 |
| ATCC | 18 | 7 |
| AAAC | 17 | 7 |
| ACAT | 16 | 6 |
| Other tetranucleotide repeats | ||
| (% ≤ 6 each one) | ||
| ACTC,AACC,AAGG,ACAG,AGGC,AACG,AACT, | 92 | 37 |
| AATT,AGCC,AGCG,AAGC,AGGG,AGAT,ACGG, AGCT,AAGT,ACCC, ACCT | ||
| Total | 252 | 100 |
The number of di-, tri- and tetra-nucleotide repeats identified in the Cucurbita unigene dataset is shown for the complete set of putative SSRs.
Localization of SSRS with respect to putative initiation and termination codons in the Cucurbita unigene dataset
| N° | % | N° | % | N° | % | N° | % | |
| 5'-UTR | 86 | 29% | 172 | 12% | 102 | 41% | 360 | 19% |
| ORF | 72 | 24% | 903 | 65% | 89 | 35% | 1064 | 55% |
| 3'-UTR | 105 | 36% | 194 | 14% | 30 | 12% | 329 | 17% |
| Other | 33 | 11% | 118 | 9% | 31 | 12% | 182 | 9% |
| Total | 296 | 100% | 1387 | 100% | 252 | 100% | 1935 | 100% |
Unigenes were checked for the presence of the start and stop codons. "Other" means imprecise localization of the SSRs with respect to putative initiation or termination codons.
Single nucleotide polymorphism (SNPs) statistics
| SNPs | Number | SNPs | Number |
|---|---|---|---|
| Transitions | Transversions | ||
| A<->G | 6,694 | A<->T | 1,793 |
| C<->T | 6,902 | G<->T | 1,547 |
| C<->G | 1,548 | ||
| A<->C | 1,496 | ||
| Total | 13,596 (68%) | Total | 6,384(32%) |
Type and number of transition and transversions are shown for putative high quality single nucleotide polymorphism (SNPs) identified in the Cucurbita database.