| Literature DB >> 21810238 |
Massimo Iorizzo1, Douglas A Senalik, Dariusz Grzebelus, Megan Bowman, Pablo F Cavagnaro, Marta Matvienko, Hamid Ashrafi, Allen Van Deynze, Philipp W Simon.
Abstract
BACKGROUND: Among next generation sequence technologies, platforms such as Illumina and SOLiD produce short reads but with higher coverage and lower cost per sequenced nucleotide than 454 or Sanger. A challenge now is to develop efficient strategies to use short-read length platforms for de novo assembly and marker development. The scope of this study was to develop a de novo assembly of carrot ESTs from multiple genotypes using the Illumina platform, and to identify polymorphisms.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21810238 PMCID: PMC3224100 DOI: 10.1186/1471-2164-12-389
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Assembly strategy. B493 Sanger sequences were assembled using CAP3. Two assemblies were generated for Illumina sequences for each genotype (B493xQAL, B6274 and B7262). After left and right Velvet trimming, sequences were assembled using CAP3. A second assembly was carried out using the ABySS short read assembler. A final CAP3 assembly allowed comparison to be made among sequences from all 4 genotypes to generate a combined de novo assembly.
Figure 2Contig distribution among genotypes.
Figure 3Gene ontology distribution. Gene Ontology distribution of the carrot ESTs derived from Blast2GO. The results are summarized as follows: (A) molecular functions; (B) biological processes; (C) cellular components
Results of BLASTN of anthocyanin reference sequences against EST consensus sequences in this study
| Gene family* | Reference sequence source | GenBank ID | Contigs with hits to reference |
|---|---|---|---|
| Contig21709 | |||
| Contig3962 | |||
| Contig937 | |||
| Contig28113 | |||
| Contig21482 | |||
| Contig524 | |||
| Contig52163 | |||
| Contig8294 | |||
| Contig16173 | |||
| Contig45955 | |||
| Contig16173 | |||
| - | |||
| Contig52090 | |||
| - | |||
| - | |||
| - | |||
| Contig47011 | |||
| Contig3218 | |||
| Contig22451 | |||
| Contig22451 | |||
| - |
* based on information from Hummer and Schreirer and Boss et al. [4,23]
Previously unreported carrot anthocyanin genes
| Gene | Reference sequence source | Carrot EST contig | Contig length (nt) | e-Value | BLASTN identities |
|---|---|---|---|---|---|
| PAL2 | Contig21482 | 2,553 | <1.00E-180 | 1,923/2,127 (90%) | |
| PAL3 | Contig28113 | 2,560 | <1.00E-180 | 1,940/2,122 (91%) | |
| CA4H | Contig524 | 1,798 | <1.00E-180 | 1,408/1,567 (89%) | |
| 4CL1 | Contig8294 | 1,814 | <1.00E-180 | 1,549/1,796 (87%) | |
| 4CL2 | Contig52163 | 1,937 | <1.00E-180 | 1,601/1,858 (87%) |
Figure 4Intra and inter-sample SNP distribution. Intra- and inter-sample polymorphism distributions of computationally detected SNPs within carrot genotypes (B493xQAL, B6274 and B7262) at a depth of sequence coverage of 20 or more. The two graph reports: (A) distribution of intra and inter-sample polymorphism SNPs within genotypes; (B) distribution of intra and inter-sample polymorphism SNP within genotypes and contig containing SNPs. * M = intra-sample monomorphic, inter-sample polymorphic; P = intra- and inter-sample polymorphic. ** M+P = contigs containing both P and M SNP categories.
Evaluation of SSR primer pairs in four carrot genetic stocks used to develop the EST library
| PCR results | Number of primers | Percentage of all primer tested | Percentage of amplified primer |
|---|---|---|---|
| Tested | 114 | ||
| Amplified | 102 | 89.5 | |
| Single product | 99 | 86.8 | 97 |
| Single expected product | 75 | 65.8 | 74 |
| Single larger product | 24 | 21.1 | 24 |
| Multiple product | 3 | 2.6 | 3 |
| Polymorphic* | 26 |
*out of 31 tested.
Evaluation of SNP primer pairs in four carrot genetic stocks used to develop the EST library
| PCR results | Number of primers | Percentage of all primer tested | Percentage of amplified primer | Percentage of amplified and sequenced primers |
|---|---|---|---|---|
| Tested | 354 | - | - | - |
| Amplified | 311 | 88 | - | - |
| Single product | 272 | 77 | 87 | - |
| Single expected product | 162 | 46 | 52 | - |
| Single larger product | 110 | 31 | 35 | - |
| Multiple product | 39 | 11 | 13 | - |
| Sequenced* | 258 | 73 | 83 | - |
| Polymorphic | 212 | 60 | 68 | 82 |
*Considering 162 single expected product and 96 single larger product whit amplicon size <500 nt