| Literature DB >> 26960153 |
Hui Li1, Defang Li1, Anguo Chen1, Huijuan Tang1, Jianjun Li1, Siqi Huang1.
Abstract
Kenaf (Hibiscus cannabinus L.) is an economically important natural fiber crop grown worldwide. However, only 20 expressed tag sequences (ESTs) for kenaf are available in public databases. The aim of this study was to develop large-scale simple sequence repeat (SSR) markers to lay a solid foundation for the construction of genetic linkage maps and marker-assisted breeding in kenaf. We used Illumina paired-end sequencing technology to generate new EST-simple sequences and MISA software to mine SSR markers. We identified 71,318 unigenes with an average length of 1143 nt and annotated these unigenes using four different protein databases. Overall, 9324 complementary pairs were designated as EST-SSR markers, and their quality was validated using 100 randomly selected SSR markers. In total, 72 primer pairs reproducibly amplified target amplicons, and 61 of these primer pairs detected significant polymorphism among 28 kenaf accessions. Thus, in this study, we have developed large-scale SSR markers for kenaf, and this new resource will facilitate construction of genetic linkage maps, investigation of fiber growth and development in kenaf, and also be of value to novel gene discovery and functional genomic studies.Entities:
Mesh:
Year: 2016 PMID: 26960153 PMCID: PMC4784950 DOI: 10.1371/journal.pone.0150548
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Output statistics of sequencing.
| Sample | Total raw reads | Total clean reads | Total clean nucleotides | Q20 (%) | N (%) | GC (%) |
|---|---|---|---|---|---|---|
| 1 | 52,922,062 | 48,326,998 | 4,349,429,820 | 97.70 | 0.00 | 46.05 |
| 2 | 51,859,144 | 47,656,960 | 4,289,126,400 | 97.77 | 0.00 | 45.42 |
| 3 | 51,881,472 | 47,594,544 | 4,283,508,960 | 97.7 | 0.00 | 45.51 |
Results of annotation of unigenes to protein databases.
| Sequence file | NR | NT | Swiss-Prot | KEGG | COG | GO | ALL |
|---|---|---|---|---|---|---|---|
| All-unigene.fa | 56,147 | 51,851 | 38,065 | 33,807 | 22,049 | 45,855 | 58,095 |
Fig 1Characteristics of similarity of All-unigene against NR database.
(A) E-value distribution of All-unigene. (B) Similarity distribution of All-unigene. (C) Species distribution of All-unigene.
Fig 2Gene Ontology (GO)classifications of All-unigenen.
The results are summarized in three main categories: Biological process, Cellular component and Molecular function.
Fig 3Clusters of orthologous groups(COG) classification of All-unigene.
Results of microsatellite search.
| Total number of sequences examined | 71318 |
| Total length of examined sequences (bp) | 81509256 |
| Total number of identified SSRs | 12886 |
| Number of SSR-containing sequences | 10892 |
| Number of sequences containing more than 1 SSR | 1673 |
| Number of SSRs present in compound formation | 545 |
Fig 4Dendrogram of 28 Kenaf varieties based on cluster analysis of 72 ploymorphic EST-SSR markers.