| Literature DB >> 29785177 |
Zhigao Liu1,2, Weili Shao2, Yamei Shen2, Mengcheng Ji2, Wenchao Chen2, Ying Ye2, Yongbao Shen1.
Abstract
BACKGROUND: Clematis is the biggest genus in the family Ranunculaceae with about 300 species. Clematis is also a globally important commercial group of flowers, especially in the United States and European countries. Their petals with different colors and shapes make the genus the "Queen of the Vines". However, the genomic information and phylogeny of Clematis based on existing molecular studies are limited. In this paper, new microsatellites (SSR) markers were identified from the transcriptome data of C. finetiana obtained using the Illumina paired-end sequencing technology.Entities:
Keywords: Clematis finetiana; Marker development,Transcriptome sequencing; SSRs
Mesh:
Substances:
Year: 2018 PMID: 29785177 PMCID: PMC5952850 DOI: 10.1186/s41065-018-0060-x
Source DB: PubMed Journal: Hereditas ISSN: 0018-0661 Impact factor: 3.271
Summary of transcriptome data for C. finetiana
| Item | Number |
|---|---|
| 1. Raw sequences and assembly statistics | |
| Total amount of clean reads(Mb) | 111.18 |
| Total amount of clean bases(Gb) | 11.12 |
| GC content percentage (%) | 42.35 |
| Clean reads proportion (%) | 99.91 |
| Total number of unigenes, | 71, 900 |
| Mean length of unigenes(bp), N50(bp), GC content(%) | 865, 1469, 42.35 |
| 2.Statistics of unigene annotation | |
| Gene annotation against Nr (%) | 36, 015 (50.09%) |
| Gene annotation against Swiss-Prot (%) | 23, 982 (33.35%) |
| Gene annotation against KEGG (%) | 21, 494 (29.89%) |
| Gene annotation against COG (%) | 14, 022 (19.50%) |
| Gene annotation against GO (%) | 6192 (8.61%) |
| Gene annotation against Interpro (%) | 27, 004 (37.56%) |
| All annotated genes (%) | 38, 814(53.98%) |
Fig. 1Length distribution of all unigenes obtained from the C. finetiana transcriptome. The x-axis indicates a different sequence size, and the y-axis indicates the unigene numbers of a specific sequence size
Fig. 2Characteristics of homology search in C. finetiana unigenes. (a) E-value distribution of the BLASTx hits against the nr database. b Top-hit species in similarity search of unigenes
Fig. 3Annotation of the C. finetiana transcriptome. a GO classification of unigenes. The x-axis indicates the categories, and the y-axis indicates the number of the unigenes. b COG classification of unigenes. The x-axis indicates the categories, and the y-axis indicates the number of the unigenes
Fig. 4Validation of expression levels of selected genes
Summary of microsatellite data of C. finetiana transcriptome
| Item | Number |
|---|---|
| Total number of sequences examined | 71, 900 |
| Total size of examined sequences (bp) | 62, 250,256 |
| Total number of identified SSRs | 7532 |
| Number of SSR containing sequences | 6337 |
| Number of sequences containing more than 1 SSR | 961 |
| Number of SSRs present in compound formationa | 568 |
aThe SSR locus containing at least 2 repeat motifs
Fig. 5Frequency distribution of SSR repeats types. Motif types of di-nucleotides and tri-nucleotides are represented. The x-axis indicates the categories, and the y-axis indicates the number of the unigenes
Fig. 6Phylogenetic tree of 13 populations from 6 Clematis species based on 19 SSR loci and utilizing Nei’s genetic distances. The population abbreviations are the same as those in Additional file 1