| Literature DB >> 27610280 |
Oliver W White1, Bethany Doo2, Mark A Carine3, Mark A Chapman2.
Abstract
PREMISE OF THE STUDY: Oceanic islands offer unparalleled opportunities to investigate evolutionary processes such as adaptation and speciation. However, few genomic resources are available for oceanic island endemics. In this study, we publish transcriptome sequences from three Macaronesian endemic plant species (Argyranthemum broussonetii [Asteraceae], Descurainia bourgaeana [Brassicaceae], and Echium wildpretii [Boraginaceae]) that are representative of lineages that have radiated in the region. In addition, the utility of transcriptome data for marker development is demonstrated. METHODS ANDEntities:
Keywords: Argyranthemum broussonetii; Descurainia bourgaeana; Echium wildpretii; Macaronesia; marker development; simple sequence repeats (SSRs); transcriptomics
Year: 2016 PMID: 27610280 PMCID: PMC5001862 DOI: 10.3732/apps.1600050
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
Fig. 1.The Macaronesian archipelagos in the North Atlantic Ocean.
Fig. 2.Macaronesian endemics Argyranthemum sp. (A), Descurainia bourgaeana (B), and Echium wildpretii (C). Photos taken by O. White.
Summary statistics for the de novo assembled transcriptomes, BLASTX searches, and simple sequence repeat identification.
| Statistic | |||
| Raw reads | |||
| No. of raw reads | 3,950,166 | 3,958,226 | 3,550,703 |
| No. of trimmed reads | 3,842,373 | 3,774,119 | 3,375,554 |
| No. of normalized reads | 1,471,858 | 1,234,637 | 1,228,003 |
| Normalized reads, % | 38 | 33 | 36 |
| Assembly statistics | |||
| No. of genes | 80,620 | 44,287 | 58,526 |
| No. of transcripts | 94,522 | 54,221 | 69,509 |
| N50 all transcripts, bp | 921 | 1233 | 1041 |
| Median length all transcripts, bp | 411 | 549 | 446 |
| Average length all transcripts, bp | 654 | 833 | 707 |
| Total no. of assembled bases | 61,826,463 | 45,213,744 | 49,160,261 |
| BLASTX analyses | |||
| | 37,265 | 43,329 | 37,060 |
| | 39 | 80 | 53 |
| | 12,235 | 16,645 | 12,072 |
| | 36 | 50 | 36 |
| | 39,521 | 34,042 | 39,758 |
| | 42 | 63 | 57 |
| | 13,139 | 12,534 | 13,101 |
| | 38 | 36 | 38 |
| MISA statistics | |||
| Total no. of identified SSRs | 2282 | 1972 | 1284 |
| Total no. of SSR-containing transcripts | 2232 | 1919 | 1251 |
| No. of transcripts with >1 SSR | 50 | 51 | 31 |
| No. of SSRs in compound formation | 68 | 34 | 31 |
| No. of dinucleotide SSRs | 478 | 604 | 230 |
| No. of trinucleotide SSRs | 1097 | 1098 | 799 |
| No. of tetranucleotide SSRs | 589 | 183 | 191 |
| No. of SSR loci suitable for primer design | 1288 | 1219 | 737 |
BLAST parameters included an expectation value (E-value) of 1.0 × 10−20 or less.
Proportion of the A. thaliana (33,602) or S. lycopersicum (34,727) annotated coding sequences recovered from the target species.
SSR loci were excluded if they were located in the first or last 50 bp of the contig, if SSRs were in compound formation, or if there were multiple SSR loci per contig.
Fig. 3.Summary statistics for the three de novo transcriptome assemblies. (A) Number of genes and transcripts assembled for each species. (B) N50, mean, and median transcript length. (C) Transcript lengths for the three transcriptomes (note the change in bin size along the x axis).
Fig. 4.Principal coordinates analysis (PCoA) of 10 samples of Argyranthemum based on genotypic information from eight SSR loci.