| Literature DB >> 28630801 |
Ying Wang1,2, Kun Liu1,2, Shoubiao Zhou1,3, Jianwen Shao1,2.
Abstract
BACKGROUND: Desiccation-tolerant (DT) plants can recover full metabolic competence upon rehydration after losing most of their cellular water (>95%) for extended periods of time. Functional genomic approaches such as transcriptome sequencing can help us understand how DT plants survive and respond to dehydration, which has great significance for plant biology and improving the drought tolerance of crops. Boea clarkeana Hemsl. (Gesneriaceae) is a DT dicotyledonous herb. Its genomic sequences characteristics remain unknown. Based on transcriptomic analyses, polymorphic EST-SSR (simple sequence repeats in expressed sequence tags) molecular primers can be designed, which will greatly facilitate further investigations of the population genetics and demographic histories of DT plants.Entities:
Keywords: Boea clarkeana; Desiccation tolerance; EST-SSRs; Gene function annotation; Transcriptome
Year: 2017 PMID: 28630801 PMCID: PMC5474092 DOI: 10.7717/peerj.3422
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Summary of sequence assembly using Illumina sequencing.
| Sequence | Items | Value |
|---|---|---|
| Total raw reads | 110,834,050 | |
| Total clean reads | 104,021,494 | |
| Total clean nucleotides (nt) | 9,361,934,460 | |
| Q20 percentage (%) | 97.55 | |
| N percentage (%) | 0 | |
| GC percentage (%) | 45.43 | |
| Total number | 94,546 | |
| Total length (nt) | 46,012,409 | |
| Mean length (nt) | 487 | |
| Contig N50 (nt) | 1,075 | |
| Total number | 91,449 | |
| Total length (nt) | 148,176,175 | |
| Mean length (nt) | 1,620 | |
| Unigene N50 (nt) | 2,389 | |
| Distinct clusters | 55,888 | |
| Distinct singletons | 35,561 |
Figure 1The length distribution of unigenes.
Figure 2The species distribution of Nr annotations.
Figure 3Gene ontology classification of unigenes between B. hygrometrica and B. clarkeana.
GO functions are shown in the X-axis. The Y-axis shows the percentage of genes annotated with the GO function. RWC, relative water content; HD, hydrated. The data of B. hygrometrica came from Xiao et al. (2015).
Figure 4COG and KOG functional classification of unigenes.
The horizontal coordinates are functional classes of COG and KOG, and the vertical coordinates are numbers of unigenes in one class.
Figure 5The distribution of the most repeated nucleotide types.
Figure 6The distribution of SSRs of different lengths.
Characteristics of 17 polymorphic EST-SSR markers.
| Locus | Primer sequence 5’–3’ | Repeat motif | Size range (bp) | HWE | PIC | GenBank accession no. | |||
|---|---|---|---|---|---|---|---|---|---|
| BC1 | F:GCAGTTCTGTGCAGTACCATACAT | (TA)6 | 4 | 172–182 | 0.065 | 0.038 | 0.036* | 0.193 |
|
| R:GGCTTCTGATCAGGTTTCTGAAT | |||||||||
| BC2 | F:GAGATCCCAGATCCAGATCTTCT | (TC)6 | 3 | 160–164 | 0.038 | 0.023 | 0.192 n.s | 0.423 |
|
| R:AACATTAATGGAAACACGTCGTC | |||||||||
| BC3 | F:ATTCGCTCTCTTGGTATGACTGT | (TA)6 | 5 | 170–184 | 0.054 | 0.045 | 0.380n.s | 0.664 |
|
| R:CCCAATTTGAAGTGTTGCTTTAC | |||||||||
| BC4 | F:TATCAGCGTGTGTGAATAGTTGC | (TA)7 | 4 | 157–163 | 0.097 | 0.045 | 0.004** | 0.491 |
|
| R:TAACCTAAATTCGAATCCATCCA | |||||||||
| BC5 | F:CAAACTTGGCTTAATACCATTCG | (TG)9 | 3 | 119–125 | 0.079 | 0.083 | 0.713 n.s | 0.469 |
|
| R:CCATGATCATCTCTATTTCAGGC | |||||||||
| BC6 | F:CCTTAAGGAGATGCATTGTGAAT | (TC)9 | 3 | 159–169 | 0.000 | 0.000 | - n.c. | 0.299 |
|
| R:GTATGAAGGGCATCAACAATAGG | |||||||||
| BC7 | F:GCTGAAAGTTGGTGATTGCTAGT | (AT)9 | 4 | 166–178 | 0.120 | 0.125 | 0.087 n.s | 0.526 |
|
| R:AGTTATGTCTTCGCTTGCTTCAG | |||||||||
| BC8 | F:AACGTGAGAGTGCTAGTTCGGTA | (TGA)5 | 3 | 167–173 | 0.014 | 0.000 | 0.041* | 0.17 |
|
| R:TCTTCCTCACTTTATCATCCACG | |||||||||
| BC9 | F:AGAAGAGGTACGACAGTTTGCTG | (GCG)5 | 2 | 156–159 | 0.059 | 0.064 | 1.000 n.s | 0.195 |
|
| R:TTCACGTCCGAATTCTTAGTCTC | |||||||||
| BC10 | F:CACTGCACATAGAAGGAGGAGTT | (GCG)6 | 5 | 108–129 | 0.081 | 0.076 | 0.146 n.s | 0.581 |
|
| R:GTAATCGCCTACATGATTCATCC | |||||||||
| BC11 | F:CAGCAGTATGTCGGGATTATTTC | (TTTCT)4 | 2 | 123–133 | 0.000 | 0.000 | -n.c. | 0.155 |
|
| R:CCTCTGGTCATATTGCTGTTACC | |||||||||
| BC12 | F:AACAAGAGGGTCAGCTACAACAG | (CAGCAA)4 | 4 | 160–178 | 0.104 | 0.095 | 0.184 n.s | 0.549 |
|
| R:CAGCAATGGTATTAGCAGAGGAC | |||||||||
| BC13 | F:ACCTTGACGATCCTTCATCTTCT | (GGTGCG)4 | 6 | 132–174 | 0.124 | 0.095 | 0.161 n.s | 0.701 |
|
| R:TTATGTTCTCCATATCCGTCAGC | |||||||||
| BC14 | F:GGCAGCAATATAGCTCAAATACG | (GACAAG)4 | 4 | 170–188 | 0.196 | 0.083 | 0.000*** | 0.516 |
|
| R:ACCTGATCGTTCACAACTTCATC | |||||||||
| BC15 | F:TCTTATTCAACACAACAGCCTGA | (ATGATA)4 | 5 | 151–175 | 0.157 | 0.140 | 0.228 n.s | 0.528 |
|
| R:GCTGCAGTTGATAATGAGAAGGA | |||||||||
| BC16 | F:ACCAATGGTCTATATTTCAACGG | (ATTACT)4 | 6 | 149–179 | 0.132 | 0.125 | 0.174 n.s | 0.643 |
|
| R:GTGCCCCACATAGCTTCTATCTA | |||||||||
| BC17 | F:TGACGAGGCTTCTACAGAATGAG | (CATCCT)4 | 2 | 137–143 | 0.034 | 0.045 | 1.000 n.s | 0.186 |
|
| R:ACAAACAACAAGATGGGAATCAT |
Notes.
number of alleles per locus across all populations
expected heterozygosity (mean value)
observed heterozygosity (mean value)
polymorphic information content
Hardy-Weinberg equilibrium
After Bonferroni correction, significant departures from Hardy–Weinberg equilibrium are indicated by ∗P < 0.05, ∗∗P < 0.01, ∗∗∗P < 0.001. n.s., not significant; n.c., not calculated (Clarke & Gorley, 2001).
GO, COG and KEGG annotation of 17 SSR-containing ESTs.
| Unigene | Annotation | ||||
|---|---|---|---|---|---|
| GO | COG | KEGG | |||
| b. p. | c. c. | m. f. | |||
| BC1 | – | 6 | – | Function unknown | – |
| BC2 | 1 | – | – | – | Plant hormone signal transduction |
| BC3 | – | – | – | – | – |
| BC4 | 7 | – | 3 | General function prediction only | Plant hormone signal transduction |
| Signal transduction mechanisms | Plant-pathogen interaction | ||||
| Transcription | |||||
| Replication, recombination, and repair | |||||
| BC5 | – | – | – | – | – |
| BC6 | – | – | – | General function prediction only | – |
| BC7 | 2 | 2 | – | General function prediction only | Biosynthesis of secondary metabolites |
| Amino sugar and nucleotide sugar metabolism | |||||
| BC8 | 12 | 4 | 4 | General function prediction only | – |
| Signal transduction mechanisms | |||||
| Transcription | |||||
| Replication, recombination and repair | |||||
| BC9 | – | – | 2 | Secondary metabolites biosynthesis, transport and catabolism | Biosynthesis of secondary metabolites |
| BC10 | 2 | 4 | 1 | Cell cycle control, cell division, chromosome partitioning | Metabolic pathways |
| BC11 | – | – | – | – | – |
| BC12 | 10 | 3 | 2 | General function prediction only | Plant hormone signal transduction |
| Posttranslational modification, protein turnover, chaperones | |||||
| BC13 | – | – | – | Transcription | – |
| BC14 | – | – | – | – | – |
| BC15 | 9 | 5 | 2 | – | – |
| BC16 | – | – | – | – | – |
| BC17 | – | – | – | – | – |
Notes.
biological_process
cellular_component
molecular_function
The name of each unigene is replaced with the name of the EST-SSR maker it contains.
This table shows only the number of the GO terms in the ontology for unigenes. The details of the GO classification in three ontologies are shown in Table S1.
Figure 7The neutral test results for 17 primer pairs using F and H from 11 populations using LOSITAN.