| Literature DB >> 24303059 |
Li-Qiang Tan1, Li-Yuan Wang, Kang Wei, Cheng-Cai Zhang, Li-Yun Wu, Gui-Nian Qi, Hao Cheng, Qiang Zhang, Qing-Mei Cui, Jin-Bo Liang.
Abstract
Despite the worldwide consumption and high economic importance of tea, the plant (Camellia sinensis) is not well studied in molecular biology. Under the few circumstances in which the plant is studied, C. sinensis flowers, which are important for reproduction and cross-breeding, receive less emphasis than investigation of its leaves or roots. Using high-throughput Illumina RNA sequencing, we analyzed a C. sinensis floral transcriptome, and 26.9 million clean reads were assembled into 75,531 unigenes averaging 402 bp. Among them, 50,792 (67.2%) unigenes were annotated with a BLAST search against the NCBI Non-Redundant (NR) database and 10,290 (16.67%) were detected that contained one or more simple sequence repeats (SSRs). From these SSR-containing sequences, 2,439 candidate SSR markers were developed and 720 were experimentally tested, validating 431 (59.9%) novel polymorphic SSR markers for C. sinensis. Then, a consensus SSR-based linkage map was constructed that covered 1,156.9 cM with 237 SSR markers distributed in 15 linkage groups. Both transcriptome information and the genetic map of C. sinensis presented here offer a valuable foundation for molecular biology investigations such as functional gene isolation, quantitative trait loci mapping, and marker-assisted selection breeding in this important species.Entities:
Mesh:
Year: 2013 PMID: 24303059 PMCID: PMC3841144 DOI: 10.1371/journal.pone.0081611
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Statistics for output of RNA-seq and reads assembly of the C. sinensis floral transcriptome.
| Sequences | Total Nucleotides(nt) | N50(bp) | Mean length (bp) | |
|---|---|---|---|---|
| Clean reads | 26,874,116 | 2,418,670,440 | / | 90 |
| Contigs | 453,071 | 62,697,738 | 113 | 138 |
| Scaffold | 103,918 | 33,852,032 | 424 | 326 |
| Unigene | 75,531 | 30,325,953 | 480 | 402 |
Figure 1Overview of the C. sinensis floral transcriptome assembly.
Summary of annotations for the C. sinensis floral transcriptome.
| Sequences (n) | Total annotations (n) | Unique annotations (n) | Functional classification | |
|---|---|---|---|---|
| All assembled unigenes | 75,531 | — | — | — |
| Gene annotations against NR Gene | 50,792 | 628,678 | 27,223 | — |
| Gene annotations against Swissprot | 31,730 | 383,703 | 23,749 | — |
| Gene annotations against KEGG | 21,929 | 21,928 | 21,929 | 119 pathways |
| Gene annotations against COG | 13,153 | 22,505 | 13,153 | 25 categories |
| GO annotations for NR protein hits | 22,568 | 117,338 | — | 3 main categories 44 sub-categories |
| All annotated Unigenes | 50,792 | — | — | — |
| Unigenes matching all four databases | 9,472 | — | — | — |
Figure 2Characteristics of Blast homology search of unigenes against the NR database.
(A) E-value distribution of the best matches for unigenes (E-value ≤ 1.0e-5). (B) Similarity distribution of the top BLAST matches for unigenes. (C) Species distribution is shown as the percentage of the total homologous sequences. We blasted total unigene sequences against all plant proteins in NR database and selected the best matches for analysis.
Figure 3Linkage map based on SSR markers for C. sinensis (LG1-4).
The map in the middle (LG1–LG15) is the consensus map constructed from the combined dataset. The map on the left (L1–L15) and right (B1–B15) are the maternal (Longjin43) and parental (Baihaozao) maps, respectively. Homology between these is depicted by lines with dots on both ends. Markers to describe both parents are bold (ab×cd or ef×eg), markers for Longjin43 only are blue (lm×ll), markers for Baihaozao only are green (nn×np). Italic markers are hk×hk segregation types. Grey bars in the consensus map indicate gaps longer than 20 cM. Distorted markers are indicated with *, **, ***, ****, ***** and ******, where P = 0.05, 0.01, 0.005, 0.001, 0.0005 and 0.0001, respectively.
Characteristics of each linkage group of the consensus map.
| Linkage groups | Map length (cM) | No. of markers | Average distance (cM) | No. of distorted markers |
|---|---|---|---|---|
| LG1 | 112.2 | 17 | 7.01 | 1 |
| LG2 | 105.7 | 22 | 5.03 | 3 |
| LG3 | 105.6 | 27 | 4.06 | 20 |
| LG4 | 103.1 | 17 | 6.44 | 1 |
| LG5 | 88.4 | 17 | 5.53 | 0 |
| LG6 | 85.1 | 21 | 4.26 | 0 |
| LG7 | 76.1 | 14 | 5.85 | 1 |
| LG8 | 70.4 | 10 | 7.82 | 3 |
| LG9 | 66.9 | 17 | 4.18 | 4 |
| LG10 | 64.7 | 10 | 7.19 | 0 |
| LG11 | 61.9 | 18 | 3.64 | 2 |
| LG12 | 59.9 | 8 | 8.56 | 0 |
| LG13 | 56.7 | 12 | 5.15 | 2 |
| LG14 | 53.2 | 16 | 3.55 | 1 |
| LG15 | 47.0 | 11 | 4.70 | 0 |
| Total/average | 1156.9 | 237 | 5.21 | 38 |