| Literature DB >> 23713999 |
Nunzio D'Agostino1, Tomek Golas, Henri van de Geest, Aureliano Bombarely, Thikra Dawood, Jan Zethof, Nicky Driedonks, Erik Wijnker, Joachim Bargsten, Jan-Peter Nap, Celestina Mariani, Ivo Rieu.
Abstract
BACKGROUND: Solanum dulcamara (bittersweet, climbing nightshade) is one of the few species of the Solanaceae family native to Europe. As a common weed it is adapted to a wide range of ecological niches and it has long been recognized as one of the alternative hosts for pathogens and pests responsible for many important diseases in potato, such as Phytophthora. At the same time, it may represent an alternative source of resistance genes against these diseases. Despite its unique ecology and potential as a genetic resource, genomic research tools are lacking for S. dulcamara. We have taken advantage of next-generation sequencing to speed up research on and use of this non-model species.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23713999 PMCID: PMC3680029 DOI: 10.1186/1471-2164-14-356
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1phenotypes in different habitats.S. dulcamara growing as a creeping plant in a dry and open dune environment (A) and as a climbing plant in a moist and shaded forest undergrowth environment (B) and growing with its roots permanently flooded in a wet environment (C).
Summary of cDNA RNAseq data sets used for assembly
| Mixed1 | GS-FLX titanium, single end, random primed, normalised | 799,937 | 786,113 | 297 |
| Mixed1 | GS-FLX titanium, single end, 3’ primed, normalised | 568,762 | 558,729 | 312 |
| Leaves1 | HiSeq2000, single end, random primed, 100bp, normalised | 127,318,285 | 107,557,660 | 96 |
| Stem & primoridia1 | HiSeq2000, single end, random primed, 50bp | 516,292,741 | 471,046,908 | 50 |
1 See Additional file 1: Table S1 for details.
Statistics of the transcriptome assembly
| Total # of contigs | 32,157 |
| Total # of unigenes | 24,193 |
| # consisting of single sequence | 20,308 |
| # consisting of multiple variants | 3,885 |
| Total sequence length (nt) | 43,277,997 |
| Average contig length (nt) | 1346 |
| Minimum contig length (nt) | 501 |
| Maximum contig length (nt) | 15047 |
| Median contig length (nt) | 1043 |
Figure 2Description of unigene cluster sizes. The distribution of all contigs over categories of unigenes with single or multiple variant is plotted.
Results of the ORF prediction analysis
| complete ORF | 11,760 |
| 5'truncated ORF1 | 8,608 |
| 3' truncated ORF2 | 3,405 |
| 5'and 3' truncated ORF3 | 2,923 |
| no good ORF4 | 5,461 |
| Total | 32,157 |
1 ORFs lacking the initiating ATG codon, but including a termination triplet.
2 ORFs including the ATG start codon, but lacking the stop codon.
3 ORFs having neither start nor stop codon.
4 Transcripts showing interspersed stop codons.
Figure 3Length distribution of proteins from (bittersweet), (tomato) and (potato).
Figure 4Venn diagram describing the distribution of orthologous gene groups and singletons (s) identified by clustering 164,689 protein sequences from five different plant species using OrthoMCL. For each species the total number of proteins sequences that were used as input (between brackets) and the total number of ortholog groups plus singletons are indicated.
Enrichment analysis of GO molecular function terms in selected OrthoMCL subgroups when compared to the whole dataset
| GO:0016301 | kinase activity | 1004 | 159 | 126.25 | 0.00086 |
| GO:0016772 | transferase activity. transferring phosphorus-containing groups | 1004 | 159 | 126.25 | 0.00086 |
| GO:0005215 | transporter activity | 534 | 89 | 67.15 | 0.00287 |
| GO:0016740 | transferase activity | 1737 | 243 | 218.42 | 0.02998 |
| GO:0003677 | DNA binding | 1063 | 159 | 73.97 | 5.2E-22 |
| GO:0001071 | nucleic acid binding transcription factor activity | 363 | 79 | 25.26 | 1.5E-20 |
| GO:0003700 | sequence-specific DNA binding transcription factor activity | 363 | 79 | 25.26 | 1.5E-20 |
| GO:0003676 | nucleic acid binding | 2017 | 209 | 140.36 | 1.6E-10 |
| GO:0005515 | protein binding | 2827 | 242 | 196.72 | 8.5E-5 |
| GO:0030234 | enzyme regulator activity | 165 | 21 | 11.48 | 0.0052 |
| GO:0005488 | Binding | 8086 | 591 | 562.68 | 0.0091 |
| GO:0003676 | nucleic acid binding | 2017 | 19 | 13.64 | 0.076 |
| GO:0030234 | enzyme regulator activity | 165 | 7 | 1.19 | 0.00018 |
| GO:0004518 | nuclease activity | 103 | 3 | 0.74 | 0.03818 |
| GO:0016788 | hydrolase activity, acting on ester bonds | 103 | 3 | 0.74 | 0.03818 |
| GO:0003676 | nucleic acid binding | 2017 | 20 | 14.54 | 0.07853 |
Summary of SSRs detected in the transcriptome
| Total number of identified SSRs: | 6,029 |
| Number of SSR containing sequences: | 5,156 |
| Number of sequences containing more than 1 SSR (c) | 742 |
| Mono-nucleotide (p1) | 3,060 |
| Di-nucleotide (p2) | 1,163 |
| Tri-nucleotide (p3) | 1,733 |
| Tetra-nucleotide (p4) | 42 |
| Penta-nucleotide (p5) | 9 |
| Hexa-nucleotide (p6) | 22 |
Figure 5A stacked bar chart showing for each class of SSR the percentage located in the CDS, 5’ UTR and 3’UTR.
Allele sizes of selected SSRs in various accessions
| A54750069-1 | UK | n.a. | 219 | 209+211 | 151 | 269+277 | 269 | 244 | 173 | 272+275 | 226+232 | 196 |
| 944750001-2 | UK | n.a. | 219 | 211+213 | 148 | 269+277 | 269 | 248+250 | 165+173 | 275+281 | 232+235 | 190 |
| A94750045 | NL | 231 | 213+219 | 207+209 | 148+151 | 269+277 | 260+269 | 244+248 | 165 | 275+281 | 226+232 | 193 |
| A94750066 | NL | 237+240 | 213 | 207+209 | 151 | 269+277 | 260+269 | 244 | 173 | 275 | 226+232 | n.a. |
| A94750135 | NL | 237 | 207+219 | 213 | 148+151 | 269 | 269+272 | 248 | 173 | 275+281 | 226+232 | 190+193 |
| A94750168 | NL | 240 | 213+219 | 211+213 | 151 | 269 | 260 | 244 | 173 | 272+275 | 232 | 193+196 |
| A6/005-1 | A | n.a. | 201+207 | 213 | 145+154 | 269 | 269+275 | 246+248 | 165+173 | 272+281 | 226+232 | 190+193 |
| | | | | | | | | | | | | |
| n. alleles | | 3 | 4 | 4 | 4 | 2 | 4 | 4 | 2 | 3 | 3 | 3 |
| n. homozygous | | 3 | 3 | 2 | 4 | 3 | 3 | 4 | 5 | 1 | 1 | 3 |
| n. heterozygous | 1 | 4 | 5 | 3 | 4 | 4 | 3 | 2 | 6 | 6 | 3 |
1 SSR type (see Table 5).
A: Austria; NL: The Netherlands; UK: United Kingdom; n.a.: not available.
Figure 6A genetic map of . The map is based on 225 markers: 12 linkage groups (chromosomes) were identified. The genetic position (in cM) with respect to the first marker in the linkage group is indicated. Marker codes SD1 to SD96 refer to SNP markers analysed with KASPar assays, marker codes SD103 to SD122 refer to SNP markers analysed with CAPS assays, marker codes starting with E/S##M## refer to AFLP markers (see http://wheat.pw.usda.gov/ggpages/keygeneAFLPs.html for standard nomenclature). All other marker codes refer to published markers from potato.
Figure 7Comparative analysis of the and tomato chromosomes.