| Literature DB >> 30740571 |
Yunhua Wang1, Nan Li1, Ting Chen1, Yiqing Gong1.
Abstract
A normalized full-length cDNA library was constructed from the coralloid roots of Cycas debaoensis by the DSN (duplex-specific nuclease) normalization method combined with the SMART (Switching Mechanism At 5' end of the RNA Transcript) technique. The titer of the original cDNA library was about 1.5 × 106 cfu·mL-1 and the average insertion size was about 1 kb with a high recombination rate (97%). The 5011 high-quality expressed sequence tags (ESTs) were obtained from 5393 randomly picked cDNA clones. Clustering and assembly of ESTs resulted in 2984 unique sequences, consisting of 618 contigs and 2366 singlets. EST sequence annotation revealed that 2333 and 1901 unigenes were functionally annotated in the NCBI non-redundant database and Swiss-Prot protein database, respectively. Functional analysis demonstrated that 1495 (50.1%) unigenes were associated with 4082 Gene Ontology (GO) terms. A total of 847 unigenes were grouped into 22 Cluster of Orthologous Groups (COG) functional categories. Based on the EST dataset, 22 ESTs that encoded putative receptor-like protein kinase (RLK) genes were screened. Furthermore, a total of 94 simple sequence repeats (SSRs) were discovered, of which 20 loci were successfully amplified in C. debaoensis. This study is the first EST analysis for the coralloid roots of C. debaoensis and provides a valuable genomic resource for novel gene discovery, gene expression and comparative genomics, conservation and management studies as well as applications in C. debaoensis and related cycad species.Entities:
Keywords: Coralloid root; Cycas debaoensis; Expressed sequence tags; SSRs; Symbiosis and defense; cDNA library
Year: 2018 PMID: 30740571 PMCID: PMC6224666 DOI: 10.1016/j.pld.2018.09.002
Source DB: PubMed Journal: Plant Divers ISSN: 2468-2659
Summary of EST sequencing and assembly results.
| EST sequences and contigs | Number |
|---|---|
| Number of EST sequences | 5011 |
| Number of Contigs | 618 |
| Number of singletons | 2366 |
| Average assembled EST length | 600.59 |
| Average number of sequences per contig | 4.23 |
| Number of contigs containing: | |
| 2 ESTs | 338 |
| 3 ESTs | 119 |
| 4∼5 ESTs | 89 |
| 6∼10 ESTs | 49 |
| 11∼20 ESTs | 17 |
| 21∼50 ESTs | 1 |
| 51∼100 ESTs | 3 |
| >100 ESTs | 2 |
Fig. 1Distribution of individual EST sequences among the clustered contigs.
Fig. 2Venn diagram of annotation results against Nr, Swiss-Prot, COG, and KEGG databases. The numbering each color block indicates the number of unigenes that is annotated by single or multiple databases.
Fig. 3GO analysis and functional classification of the C. debaoensis unigenes.
Fig. 4COG functional classification of the C. debaoensis unigenes.
Estimation of gene expression: unique EST sequences with >10 ESTs.
| Putative protein identification | Number of ESTs | Number of unique ESTs |
|---|---|---|
| Metallothionein-like protein EMB30 | 470 | 90 |
| Protein DJ-1 homolog D | 101 | 26 |
| Antifungal protein ginkbilobin-2 | 55 | 17 |
| Germin-like protein 9-3 | 50 | 19 |
| Ubiquitin-conjugating enzyme E2 28 | 42 | 10 |
| Glycine cleavage system H protein, mitochondrial | 38 | 6 |
| WAT1-related protein At5g07050 | 23 | 4 |
| Glucoamylase | 23 | 5 |
| Protein early responsive to dehydration 15 | 17 | 1 |
| Chitotriosidase-1 | 15 | 7 |
| High mobility group B protein 1 | 14 | 2 |
| Clavaminate synthase-like protein At3g21360 | 14 | 6 |
| Subtilisin-like protease SDD1 | 13 | 4 |
| Retrovirus-related Pol polyprotein from transposon TNT 1-94 | 13 | 13 |
| Probable aquaporin PIP2-8 | 12 | 7 |
| Flavanone 3-dioxygenase | 11 | 3 |
| Small nuclear ribonucleoprotein E | 10 | 1 |
| Probable aquaporin PIP1-5 | 10 | 1 |
| Non-functional NADPH-dependent codeinone reductase 2 | 10 | 4 |
| Membrane steroid-binding protein 2 | 10 | 1 |
| EndochitinaseA2 | 10 | 2 |
| EC protein homolog 1 | 10 | 3 |
Fig. 5Conservation between PUT sequences of C. debaoensis and other gymnosperms.
Type and number of nucleotide repeats in SSRs.
| Repeats motif | Number of repeats | total | ||||||
|---|---|---|---|---|---|---|---|---|
| 5 | 6 | 7 | 8 | 9 | 10 | >10 | ||
| AC/GT | – | 2 | 3 | 1 | 1 | – | 7 | |
| AG/CT | – | 14 | 4 | – | 1 | 2 | 2 | 23 |
| AT/AT | – | 5 | 2 | – | 1 | – | – | 8 |
| AAC/GTT | 3 | 1 | – | – | – | – | 4 | |
| AAG/CTT | 6 | 2 | 1 | – | 1 | – | – | 10 |
| AAT/ATT | 4 | 3 | – | – | – | – | – | 7 |
| ACG/CGT | 2 | – | – | – | – | – | – | 2 |
| AGC/CTG | 4 | 2 | 1 | – | – | – | – | 7 |
| AGG/CCT | 3 | 3 | 1 | – | – | – | – | 7 |
| ATC/ATG | 5 | 1 | 1 | 1 | 1 | – | – | 9 |
| AAAT/ATTT | 6 | – | – | – | – | – | – | 6 |
| AATT/AATT | 1 | – | – | – | – | – | – | 1 |
| ACAT/ATGT | 1 | – | – | – | – | – | – | 1 |
| AAAAT/ATTTT | – | 1 | – | – | – | – | – | 1 |
| ACAGCC/CTGTGG | 1 | – | – | – | – | – | – | 1 |
| Total | 36 | 34 | 13 | 2 | 4 | 3 | 1 | 94 |
Characteristics of 20 SSR loci designed from an EST library of C. debaoensis.
| Locus | Primer sequence (5′–3′) | Repeat motif | product size (bp) | Ta (° C) | GenBank Accession No. |
|---|---|---|---|---|---|
| Cdb01 | F:CGCCCCATTTTAGATCTCTC | (TC)6 | 155 | 55 | |
| Cdb02 | F:CAATGCCAACGCTGTGTCTA | (CAT)9 | 222 | 57 | |
| Cdb04 | F:TTGCACCTGCCATTAGTCAA | (AATA)5 | 196 | 55 | |
| Cdb05 | F:TTGCACCTGCCATTAGTCAA | (AATA)5 | 196 | 55 | |
| Cdb07 | F:ATCCAAGCTAAAGGGTTCGG | (TGA)5 | 141 | 55 | |
| Cdb08 | F:CGACTGATCTCGTCCCAAAT | (GA)6 | 221 | 57 | |
| Cdb09 | F:AAATCCAAGCCAAAGGGTTC | (TGA)5 | 157 | 55 | |
| Cdb11 | F:TTGCACCTGCCATTAGTCAA | (AATA)5 | 196 | 55 | |
| Cdb12 | F:CCTGTACCAGGGACGAAGAA | (CAT)8 | 273 | 57 | |
| Cdb13 | F:CGGACCCTCAATGTGTCTTT | (CT)6 | 163 | 57 | |
| Cdb18 | F:ATTGTATATGCAGCAGCCCC | (GCA)6 | 265 | 57 | |
| Cdb19 | F:ATTGTATATGCAGCAGCCCC | (CCT)7 | 265 | 57 | |
| Cdb33 | F:AAGTTCCGTGCCAACCATAA | (ATA)5 | 164 | 55 | |
| Cdb45 | F:TGGATTCATGAGCATTGGAA | (CAT)5 | 148 | 53 | |
| Cdb48 | F:AAGCCAAAAAGGGCAAGATT | (CAA)5 | 186 | 53 | |
| Cdb50 | F:TACTTACAGCAGGGGGAAGG | (TATG)5 | 263 | 59 | |
| Cdb53 | F:TCTGTAGCGAGTTTGGGGTT | (TAT)6 | 255 | 55 | |
| Cdb54 | F:TACATCAGGCAATGGCAAAA | (AT)7 | 259 | 53 | |
| Cdb55 | F:CCTCCGAGGAACACAAACAT | (AAG)7 | 241 | 57 | |
| Cdb56 | F:ATCGGTCTCAACTTGGATGC | (TC)10 | 261 | 57 |
Identification of ESTs encoding putative RLK (LRR-RLKs,LysM-RLKs,LecRLK) in coralloid roots of C. debaoensis.
| GenBank_Accn | Annotated sequence identifier | Annotation description |
|---|---|---|
| sp|C0LGQ5|GSO1_ARATH | LRR receptor-like serine/threonine-protein kinase GSO1 | |
| sp|C0LGP4|Y3475_ARATH | Probable LRR receptor-like serine/threonine-protein kinase At3g47570 | |
| sp|Q9XID3|Y1343_ARATH | G-type lectin S-receptor-like serine/threonine-protein kinase At1g34300 | |
| sp|O64780|Y1614_ARATH | G-type lectin S-receptor-like serine/threonine-protein kinase At1g61400 | |
| sp|Q9LFG1|Y3359_ARATH | Putative leucine-rich repeat receptor-like serine/threonine-protein kinase At3g53590 | |
| sp|C0LGS2|Y4361_ARATH | Probable LRR receptor-like serine/threonine-protein kinase At4g36180 | |
| sp|C0LGP4|Y3475_ARATH | Probable LRR receptor-like serine/threonine-protein kinase At3g47570 | |
| sp|C0LGP4|Y3475_ARATH | Probable LRR receptor-like serine/threonine-protein kinase At3g47570 | |
| sp|C0LGH3|Y5614_ARATH | Probable LRR receptor-like serine/threonine-protein kinase At1g56140 | |
| sp|O64825|LYK4_ARATH | LysM domain receptor-like kinase 4 | |
| sp|Q9M2S4|LRKS4_ARATH | L-type lectin-domain containing receptor kinase S.4 | |
| sp|Q9LYX1|LRK82_ARATH | L-type lectin-domain containing receptor kinase VIII.2 | |
| sp|O49445|LRK72_ARATH | Probable L-type lectin-domain containing receptor kinase VII.2 | |
| sp|Q9LT96|Y5977_ARATH | Probable leucine-rich repeat receptor-like protein kinase At5g49770 | |
| sp|O22938|Y2182_ARATH | Leucine-rich repeat receptor-like tyrosine-protein kinase At2g41820 |