| Literature DB >> 22558534 |
Gaurav Sablok, Chun Luo, Wan Sin Lee, Farzana Rahman, Tatiana V Tatarinova, Jennifer Ann Harikrishna, Zhengrong Luo.
Abstract
We present here a systematic analysis of the Diospyros kaki expressed sequence tags (ESTs) generated from development stage-specific libraries. A total of 2,529 putative tentative unigenes were identified in the MF library whereas the OYF library displayed 3,775 tentative unigenes. Among the two cDNA libraries, 325 EST-Simple sequence repeats (SSRs) in 296 putative unigenes were detected in the MF library showing an occurrence of 11.7% with a frequency of 1 SSR/3.16 kb whereas the OYF library had an EST-SSRs occurrence of 10.8% with 407 EST-SSRs in the 352 putative unigenes with a frequency of 1 SSR/2.92 kb. We observed a higher frequency of SNPs and indels in the OYF library (20.94 SNPs/indels per 100 bp) in comparison to MF library showed a relatively lower frequency (0.74 SNPs/indels per 100 bp). A combined homology and secondary structure analysis approach identified a potential miRNA precursor, an ortholog of miR159, and potential miR159 targets, in the development-specific ESTs of D. kaki. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s13205-011-0005-9) contains supplementary material, which is available to authorized users.Entities:
Year: 2011 PMID: 22558534 PMCID: PMC3339603 DOI: 10.1007/s13205-011-0005-9
Source DB: PubMed Journal: 3 Biotech ISSN: 2190-5738 Impact factor: 2.406
Summary of in silico mining of Diospyroskaki cDNA libraries for assembly and repeat analysis
| Parameters | Values | |
|---|---|---|
| OYF Library | MF Library | |
| Total number of EST | 5,053 | 4,404 |
| Total number of contigs | 658 | 604 |
| Total Number of ESTs left to assemble | 3,117 | 1,925 |
| Redundancy index (%) | 38.31 | 56.28 |
| Total number of unigenes sequences searched | 3,775 | 2,529 |
| Total number of SSRs after removing poly A and poly T | 407 | 325 |
| Average UniGene length including poly A and poly T | 521.12 | 367.56 |
| Diversity index (%) | 75 | 57.4 |
| Repeat type | ||
| Mono-nucleotide | 3 (0.73)* | 4 (1.23) |
| Di-nucleotide | 229 (56.3) | 175 (53.84) |
| Tri-nucleotide | 101 (24.81) | 98 (30.2) |
| Tetra-nucleotide | 18 (4.42) | 20 (6.2) |
| Penta-nucleotide | 4 (1.0) | 6 (1.8) |
| Hexa-nucleotide | 52 (12.8) | 22 (6.8) |
* Data in parentheses is the percentage value of the repeat type
Fig. 1Frequency distribution of library specific repeat types identified in Diospyros kaki
Fig. 2Frequency distribution of library specific dinucleotide repeat types identified in Diospyros kaki
Fig. 3Frequency distribution of library specific trinucleotide repeat types identified in Diospyros kaki
Frequency of SSRs in the EST sets before and after assembly
| Source of | Before assembly | After assembly | ||||
|---|---|---|---|---|---|---|
| No. of sequence | No. of SSR-ESTs (1–6 bp) | SSR-ESTs (%) | No. of unigenes | No. of SSR-ESTs (1–6 bp) | SSR-ESTs (%) | |
| OYF library | 5,053 | 586 | 11.5 | 3,775 | 407 | 10.78 |
| MF library | 4,404 | 611 | 13.8 | 2,529 | 325 | 12.8 |
Fig. 4Relative distribution patterns of SSRs with primer pairs in ORF and Non-ORF among di-, tri-, tetra-, penta- and hexa-nucleotide repeat in OYF and MF library
SNP analysis
| S. No. | Parameter | OYF Library | MF library | Total |
|---|---|---|---|---|
| 1 | Total sequences analyzed | 1,936 | 2,479 | 4,415 |
| 2 | Total number of TC sequences | 658 | 604 | 1,262 |
| 3 | Total SNP and indels detected | 70,008 | 2,332 | 72,340 |
| 4 | Total consensus size (bp) | 334,271 | 314,897 | 649,168 |
| 5 | Total transitions | 27,016 | 1,216 | 28,232 |
| 6 | Total transversions | 38,966 | 869 | 39,835 |
| 7 | Total indel | 4,026 | 247 | 4,273 |
| 8 | 0.69 | 1.39 | 0.7 | |
| 9 | SNP frequency | 1SNP/5 bp | 1SNP/151 bp | 1SNP/10 bp |
| 10 | Indel frequency | 1Indel/83 bp | 1Indel/1,274 bp | 1Indel/152 bp |
ESTs from Libraries for ovary and young fruit and for mature fruit less than four mismatches to mature miRNA from rice or from Arabidopsis
| S. No. | miR Familya | Length of mature miRNA | Match (BLAST) | EST |
|---|---|---|---|---|
| 1 | osa-miR414 | 21 | 18/21 | DC588681.1b |
| 2 | ath-miR414 | 21 | 18/18 | DC591906.1b |
| 3 | osa-miR395o | 21 | 20/21 | DC591801.1b |
| 4 | peu-miR2914 | 23 | 22/22 | DC592202.1b |
| 5 | ppt-miR1038-3p | 21 | 18/20 | DC589073.1b |
| 6 | peu-miR2910 | 21 | 21/21 | DC589557.1b |
| 7 | osa-miR159a.1; osa-miR159b | 21 | 21/21 | DC584412.1 |
| 8 | sof-miR408e | 21 | 19/20 | DC585074.1 |
| 9 | osa-miR408 | 21 | 18/18 | DC584676.1 |
| 10 | osa-miR414 | 21 | 18/20 | DC588395.1 |
| 11 | ath-miR414 | 21 | 19/20 | DC588139.1; DC586156.1 |
aIdentifiers from miRBase Registry. Only the highest scoring match is shown
bIdentifiers for EST from the Mature Fruit library
Fig. 5The predicted secondary structure of candidate miR159 precursor from Diospyros kaki. The secondary structure for EST DC584412.1 (candidate miR159) predicted using MFOLD (Zuker 2003). The sequence encoding the predicted mature miR159 is indicated by the line below the strand in which the mature miRNA is located
Predicted miRNA159 targets and functions
* ESTs DC584412, DC585223 and DC586102 OYF library, whilst DC590194, DC590670 and DC592144 were from the MF library
Fig. 6GC3 distribution for selected representatives of the Ericales order
GC variations across the different libraries of Diospyroskaki
| Library type | Length | GC1 | GC2 | GC3 | GC | N |
|---|---|---|---|---|---|---|
| Ovary and young fruit library | 177 | 0.51 | 0.45 | 0.50 | 0.49 | 2,322 |
| Mature fruit library | 280 | 0.52 | 0.42 | 0.50 | 0.48 | 2,228 |
GO-term enrichment for high- and low-GC3 Diospyroskaki genes
| Ontology type | Description | High | Low | Ratio |
|---|---|---|---|---|
| Molecular function | Structural constituent of ribosome | 27 | 2 | 13.500 |
| Cellular localization | Ribosome | 21 | 2 | 10.500 |
| Cellular localization | Apoplast | 18 | 2 | 9.000 |
| Molecular function | Translation | 27 | 3 | 9.000 |
| Molecular function | Response to wounding | 17 | 3 | 5.667 |
| Cellular localization | Endomembrane system | 62 | 13 | 4.769 |
| Cellular localization | Cell wall | 39 | 9 | 4.333 |
| Cellular localization | Endoplasmic reticulum | 25 | 8 | 3.125 |
| Molecular function | Response to salt stress | 26 | 9 | 2.889 |
| Cellular localization | Membrane | 73 | 26 | 2.808 |
| Molecular function | Kinase activity | 25 | 9 | 2.778 |
| Molecular function | Transport | 16 | 6 | 2.667 |
| Molecular function | Protein serine/threonine kinase activity | 16 | 6 | 2.667 |
| Molecular function | Transcription factor activity | 30 | 13 | 2.308 |
| Molecular function | Molecular function | 134 | 59 | 2.271 |
| Cellular localization | Integral to membrane | 18 | 8 | 2.250 |
| Molecular function | Protein amino acid phosphorylation | 19 | 9 | 2.111 |
| Biological process | Biological process | 155 | 75 | 2.067 |
| Cellular localization | Nucleolus | 18 | 9 | 2.000 |
| Cellular localization | Plant-type cell wall | 17 | 9 | 1.889 |
| Cellular localization | Cellular component | 138 | 76 | 1.816 |
| Cellular localization | Vacuole | 41 | 23 | 1.783 |
| Molecular function | Defense response | 14 | 8 | 1.750 |
| Molecular function | Protein binding | 41 | 24 | 1.708 |
| Cellular localization | Chloroplast stroma | 17 | 10 | 1.700 |
| Molecular function | ATP binding | 22 | 13 | 1.692 |
| Cellular localization | Mitochondrion | 37 | 22 | 1.682 |
| Cellular localization | Chloroplast | 97 | 58 | 1.672 |
| Molecular function | DNA binding | 25 | 15 | 1.667 |
| Cellular localization | Chloroplast envelope | 16 | 10 | 1.600 |
| Molecular function | Metabolic process | 20 | 13 | 1.538 |
| Cellular localization | Plasma membrane | 90 | 61 | 1.475 |
| Cellular localization | Nucleus | 56 | 38 | 1.474 |
| Cellular localization | Cytoplasm | 19 | 13 | 1.462 |
| Molecular function | Transferase activity, transferring glycosyl groups | 12 | 9 | 1.333 |
| Molecular function | Response to cadmium ion | 16 | 14 | 1.143 |
| Molecular Function | Catalytic activity | 28 | 25 | 1.120 |
| Molecular function | Zinc ion binding | 13 | 13 | 1.000 |
| Cellular localization | Cytosol | 19 | 21 | 0.905 |
| Molecular function | ATP binding | 26 | 29 | 0.897 |
| Molecular Function | Embryonic development ending in seed dormancy | 13 | 15 | 0.867 |
| Molecular function | Nucleotide binding | 13 | 15 | 0.867 |
| Molecular function | Nucleic acid binding | 12 | 18 | 0.667 |
| Molecular function | Protein binding | 13 | 21 | 0.619 |
| Molecular function | Binding | 13 | 23 | 0.565 |