| Literature DB >> 19584933 |
Baozhu Guo1, Xiaoping Chen, Yanbin Hong, Xuanqiang Liang, Phat Dang, Tim Brenneman, Corley Holbrook, Albert Culbreath.
Abstract
Peanut is vulnerable to a range of foliar diseases such as spotted wilt caused by Tomato spotted wilt virus (TSWV), early (Cercospora arachidicola) and late (Cercosporidium personatum) leaf spots, southern stem rot (Sclerotium rolfsii), and sclerotinia blight (Sclerotinia minor). In this study, we report the generation of 17,376 peanut expressed sequence tags (ESTs) from leaf tissues of a peanut cultivar (Tifrunner, resistant to TSWV and leaf spots) and a breeding line (GT-C20, susceptible to TSWV and leaf spots). After trimming vector and discarding low quality sequences, a total of 14,432 high-quality ESTs were selected for further analysis and deposition to GenBank. Sequence clustering resulted in 6,888 unique ESTs composed of 1,703 tentative consensus (TCs) sequences and 5185 singletons. A large number of ESTs (5717) representing genes of unknown functions were also identified. Among the unique sequences, there were 856 EST-SSRs identified. A total of 290 new EST-based SSR markers were developed and examined for amplification and polymorphism in cultivated peanut and wild species. Resequencing information of selected amplified alleles revealed that allelic diversity could be attributed mainly to differences in repeat type and length in the SSR regions. In addition, a few additional INDEL mutations and substitutions were observed in the regions flanking the microsatellite regions. In addition, some defense-related transcripts were also identified, such as putative oxalate oxidase (EU024476) and NBS-LRR domains. EST data in this study have provided a new source of information for gene discovery and development of SSR markers in cultivated peanut. A total of 16931 ESTs have been deposited to the NCBI GenBank database with accession numbers ES751523 to ES768453.Entities:
Year: 2009 PMID: 19584933 PMCID: PMC2703745 DOI: 10.1155/2009/715605
Source DB: PubMed Journal: Int J Plant Genomics ISSN: 1687-5389
Similarity BLAST search against nr database for GT-C20 top 40 abundant contigs.
| Contig | No. of clones | Accession no. | Planta | Putative genes description |
|
|---|---|---|---|---|---|
| C20Lcontig37 | 455 | gb|AAG24882.1| | Glycine. max | Ribulose-1,5-bisphosphate carboxylase small subunit rbcS1 | 6 e−82 |
| C20Lcontig40 | 303 | ref|NP_068348.2| | Peanut mottle virus | Polyprotein | 0 |
| C20Lcontig77 | 184 | gb|AAB01025.1| | Bean common mosaic virus strain peanut stripe | Polyprotein | 0 |
| C20Lcontig20 | 143 | gb|AAG61120.1| | Gossypium hirsutum | Ribulose-1,5-bisphosphate carboxylase/oxygenase activase 1 | 0 |
| C20Lcontig83 | 81 | gb|AAL29886.1| | Glycine max | Chlorophyll a/b binding protein type II | 1e−148 |
| C20Lcontig76 | 47 | gb|ABC46708.1| | Arachis hypogaea | Chloroplast photosystem II 10 kDa protein | 9 e−68 |
| C20Lcontig86 | 46 | gb|AAD27876.2| | Vigna radiata | Carbonic anhydrase | 1e−153 |
| C20Lcontig4 | 45 | gb|ABE90224.1| | Medicago truncatula | Blue (type 1) copper domain; O-methyltransferase, family 2 | 1e−178 |
| C20Lcontig54 | 41 | pir||S04125| | Solanum lycopersicum | Chlorophyll a/b-binding protein type III precursor—tomato | 1e−137 |
| C20Lcontig8 | 40 | gb|AAA50172.1| | Photosystem II type I chlorophyll a/b-binding protein | 1e−146 | |
| C20Lcontig36 | 36 | gb|AAR10885.1| | Trifolium pratense | Plastidic aldolase | 0 |
| C20Lcontig127 | 34 | emb|CAD42908.1| | Prunus persica | Catalase | 1e−174 |
| C20Lcontig74 | 33 | sp|Q02060| | Spinacia oleracea | Photosystem II 22 kDa protein, chloroplast Precursor (CP22) | 1e−111 |
| C20Lcontig33 | 32 | gb|ABE83482.1| | Medicago truncatula | AAA ATPase, central region; Homeodomain-like | 0 |
| C20Lcontig70 | 31 | emb|CAA43590.1| | Solanum lycopersicum | Type I (26 kD) CP29 polypeptide | 1e−138 |
| C20Lcontig97 | 31 | prf||1803516A | Lens culinaris | Glycolate oxidase | 0 |
| C20Lcontig94 | 30 | sp|P16059| | Pisum sativum | Oxygen-evolving enhancer protein 2, chloroplast precursor (OEE2) | 1e−115 |
| C20Lcontig41 | 29 | gb|AAZ20283.1| | Arachis hypogaea | Nucleoside diphosphate kinase I | 6 e−54 |
| C20Lcontig71 | 29 | gb|AAF89206.1| | Vigna radiata | LHCII type I chlorophyll a/b-binding protein | 1e−109 |
| C20Lcontig99 | 29 | gb|AAW64931.1| | Nicotiana tabacum | Chloroplast ferredoxin I | 7 e-62 |
| C20Lcontig92 | 28 | gb|ABE80998.1| | Medicago truncatula | Phosphoglycerate kinase | 0 |
| C20Lcontig101 | 28 | gb|AAQ84170.1| | Pueraria montana var. lobata | Isoprene synthase | 0 |
| C20Lcontig108 | 25 | ref|NP_192427.1| | Arabidopsis thaliana | Calcium ion binding | 2 e−84 |
| C20Lcontig111 | 25 | sp|P26969| | decarboxylating | Glycine dehydrogenase [decarboxylating], mitochondrial precursor | 0 |
| C20Lcontig107 | 24 | gb|AAO33588.1| | Arachis hypogaea | Putative extensin/nodulin protein | 3 e−93 |
| C20Lcontig112 | 24 | gb|AAO33591.1| | Arachis hypogaea | Putative early light induced protein | 3 e−97 |
| C20Lcontig113 | 23 | gb|ABE82236.1| | Medicago truncatula | BURP | 1e−103 |
| C20Lcontig114 | 22 | sp|P10708| | Solanum lycopersicum | Chlorophyll a-b binding protein 7, chloroplast precursor (LHCI type II CAB-7) | 1e−145 |
| C20Lcontig16 | 21 | gb|AAQ84168.1| | Pueraria montana var. lobata | 1-deoxy-D-xylulose 5-phosphate reductoisomerase | 0 |
| C20Lcontig43 | 21 | gb|ABA86963.1| | Glycine max | Glyceraldehyde-3-phosphate dehydrogenase A subunit | 0 |
| C20Lcontig117 | 20 | gb|ABE80774.1| | Medicago truncatula | Chlorophyll A-B binding protein | 1e−145 |
| C20Lcontig12 | 19 | dbj|BAB82452.1| | Vigna radiata | CYP1 | 4 e−90 |
| C20Lcontig44 | 19 | ref|NP_181539.1| | Arabidopsis thaliana | LHCB4.3; chlorophyll binding | 1e−115 |
| C20Lcontig104 | 19 | sp|P27774| | Mesembryanthemum crystallinum | Phosphoribulokinase, chloroplast precursor (Phosphopentokinase) (PRKase) (PRK) | 0 |
| C20Lcontig32 | 18 | sp|P17340| | Solanum lycopersicum | Plastocyanin, chloroplast precursor | 7 e−68 |
| C20Lcontig85 | 18 | gb|AAD27877.1| | Vigna radiata | LHCII type III chlorophyll a/b binding protein | 1e−147 |
| C20Lcontig110 | 18 | gb|ABE77926.1| | Medicago truncatula | Flavoprotein pyridine nucleotide cytochrome reductase | 0 |
| C20Lcontig118 | 18 | emb|CAA45523.1| | Nicotiana tabacum | Photosystem I light-harvesting chlorophyll a/b-binding protein | 1e−117 |
| C20Lcontig120 | 18 | gb|AAL47679.1| | Cucumis melo | Aminotransferase 1 | 0 |
Similarity BLAST search against nr database for Tifrunner top 40 abundant contigs.
| Contig | No. of clones | Accession no. | Planta | Putative gene description |
|
|---|---|---|---|---|---|
| TFLcontig11 | 271 | ref|NP_068348.2| | Peanut mottle virus | Polyprotein | 0 |
| TFLcontig2 | 174 | gb|AAG24882.1| | Glycine max | Ribulose-1,5-bisphosphate carboxylase small subunit | 1e−81 |
| TFLcontig12 | 162 | gb|AAA50172.1| | Glycine max | Photosystem II type I chlorophyll a/b-binding protein | 1e−146 |
| TFLcontig14 | 140 | gb|AAB01025.1| | Bean common mosaic virus strain peanut stripe | Polyprotein | 0 |
| TFLcontig1 | 116 | gb|ABE90224.1| | Medicago truncatula | Blue (type 1) copper domain; O-methyltransferase, family | 1e−134 |
| TFLcontig5 | 66 | gb|AAZ20291.1| | Arachis hypogaea | Metallothionein-like protein | 3 e−46 |
| TFLcontig19 | 65 | sp|Q01516| | Pisum sativum | Fructose-bisphosphate aldolase 1, chloroplast precursor | 1e−151 |
| TFLcontig27 | 60 | gb|ABA86963.1| | Glycine max | Glyceraldehyde-3-phosphate dehydrogenase A subunit | 1e−107 |
| TFLcontig8 | 55 | sp|P24007| | Pyrus pyrifolia var. culta | Ribulose bisphosphate carboxylase small chain, chloroplast precursor (RuBisCO small subunit) | 8 e−75 |
| TFLcontig30 | 53 | gb|AAG61120.1| | Gossypium hirsutum | Ribulose-1,5-bisphosphate carboxylase/oxygenase activase 1 | 1e−170 |
| TFLcontig28 | 52 | gb|AAL29886.1| | Glycine max | Chlorophyll a/b binding protein type II | 1e−148 |
| TFLcontig7 | 31 | gb|AAQ84170.1| | Pueraria montana var. lobata | Isoprene synthase | 1e−166 |
| TFLcontig32 | 27 | gb|ABF38996.1| | Pachysandra terminalis | Ribulose-1,5-bisphosphate carboxylase/oxygenase activase | 1e−119 |
| TFLcontig57 | 27 | gb|AAS58469.1| | Gossypium hirsutum | Ultraviolet-B-repressible protein | 6 e−35 |
| TFLcontig45 | 26 | sp|P10708| | Solanum lycopersicum | Chlorophyll a-b binding protein 7, chloroplast precursor (LHCI type II CAB-7) | 1e−145 |
| TFLcontig25 | 24 | gb|AAD27877.1| | Vigna radiata | LHCII type III chlorophyll a/b binding protein | 1e−147 |
| TFLcontig43 | 24 | gb|AAO33591.1| | Arachis hypogaea | Putative early light induced protein | 1e−101 |
| TFLcontig46 | 21 | dbj|BAB82452.1| | Vigna radiata | CYP1 | 8 e−91 |
| TFLcontig15 | 20 | gb|AAO33588.1| | Arachis hypogaea | Putative extensin/nodulin protein | 1e−42 |
| TFLcontig40 | 20 | sp|P16059| | Pisum sativum | Oxygen-evolving enhancer protein 2, chloroplast precursor (OEE2) (23 kDa subunit of oxygen evolving system of photosystem II) | 1e−116 |
| TFLcontig23 | 18 | sp|P17340| | Solanum lycopersicum | Plastocyanin, chloroplast precursor | 1e−67 |
| TFLcontig56 | 17 | pir||S04125 | Solanum lycopersicum | Chlorophyll a/b-binding protein type III precursor—tomato | 1e−138 |
| TFLcontig55 | 16 | emb|CAA43590.1| | Solanum lycopersicum | Type I (26 kD) CP29 polypeptide | 1e−138 |
| TFLcontig63 | 16 | gb|AAP03873.1| | Nicotiana tabacum | Photosystem I reaction center subunit X psaK | 6 e−57 |
| TFLcontig64 | 15 | emb|CAA45523.1| | Nicotiana tabacum | Photosystem I light-harvesting chlorophyll a/b-binding protein | 1e−116 |
| TFLcontig67 | 15 | gb|AAH02118.1| | Mus musculus | Unknown (protein for MGC:6623) | 1e−85 |
| TFLcontig70 | 15 | gb|ABM45856.1| | Arachis hypogaea | Cytosolic ascorbate peroxidase | 1e−142 |
| TFLcontig68 | 13 | sp|P31336| | Gossypium hirsutum | Photosystem II 5 kDa protein, chloroplast precursor (PSII-T) (Light-regulated unknown 11 kDa protein) | 2 e−26 |
| TFLcontig76 | 13 | gb|ABE77926.1| | Medicago truncatula | Flavoprotein pyridine nucleotide cytochrome reductase | 1e−158 |
| TFLcontig34 | 12 | gb|ABD32352.1| | Medicago truncatula | Heat shock protein Hsp20 | 1e−73 |
| TFLcontig48 | 12 | sp|P14226| | Pisum sativum | Oxygen-evolving enhancer protein 1, chloroplast precursor (OEE1) | 1e−131 |
| TFLcontig58 | 12 | gb|ABD28376.1| | Medicago truncatula | Photosystem I reaction centre, subunit XI | 1e−100 |
| TFLcontig65 | 12 | gb|ABA08415.1| | Arachis hypogaea | Type 2 metallothionein | 2e−45 |
| TFLcontig78 | 12 | ref|NP_181539.1| | Arabidopsis thaliana | LHCB4.3; chlorophyll | 1e−107 |
| TFLcontig79 | 12 | gb|AAR12194.1| | Nicotiana benthamiana | Molecular chaperone Hsp90-2 | 1e−120 |
| TFLcontig82 | 12 | gb|AAW66657.1| | Picrorhiza kurrooa | Thiamine biosynthetic enzyme | 1e−117 |
| TFLcontig77 | 11 | ref|NP_175963.1| | Arabidopsis thaliana | Unknown protein | 8 e−53 |
| TFLcontig87 | 11 | emb|CAD42908.1| | Prunus persica | Catalase [Prunus persica] | 1e−102 |
| TFLcontig88 | 11 | gb|AAW31666.1| | Ammopiptanthus mongolicus | Putative late-embryogenesis protein-like protein | 7 e−24 |
Figure 1Functional classification of peanut unique EST sequences by referring to Arabidopsis Sequencing Project Functional Categories. The matched unique sequences are shown in figures. The unique sequences which have no significant homology to Arabidopsis genes or matched to those that are unknown and unclassified genes were described in text. (a) Functional categories of GT-C20 unique EST sequences. (b) Functional categories of Tifrunner unique EST sequences.
Statistics of GT-C20 and Tifrunner unique EST sequences assigned to GO functional categories.
| GO term | GO ID | GT-C20 | Tifrunner | |
|---|---|---|---|---|
| Biological_process | Rhythmic process | GO:0048511 | 9 | 3 |
| Response to stimulus | GO:0050896 | |||
| Response to stress | GO:0006950 | 16 | 11 | |
| Response to external stimulus | GO:0009605 | 12 | 4 | |
| Response to endogenous stimulus | GO:0009719 | 2 | 1 | |
| Response to chemical stimulus | GO:0042221 | 39 | 21 | |
| Response to biotic stimulus | GO:0009607 | 14 | 20 | |
| Response to abiotic stimulus | GO:0009628 | 62 | 82 | |
| Defense response | GO:0006952 | 11 | 9 | |
| Behavior | GO:0007610 | 34 | 23 | |
| Reproduction | GO:0000003 | 50 | 28 | |
| Multi-organism process | GO:0051704 | 4 | 3 | |
| Multicellular organismal process | GO:0032501 | 22 | 12 | |
| Metabolic process | GO:0008152 | 259 | 203 | |
| Locomotion | GO:0040011 | 3 | 4 | |
| Localization | GO:0051179 | 3 | 2 | |
| Immune system process | GO:0002376 | 7 | 10 | |
| Growth | GO:0040007 | 16 | 11 | |
| Establishment of localization | GO:0051234 | 117 | 72 | |
| Developmental process | GO:0032502 | 127 | 78 | |
| Cellular process | GO:0009987 | 901 | 550 | |
| Biological regulation | GO:0065007 | 214 | 128 | |
| Biological adhesion | GO:0022610 | 13 | 6 | |
|
| ||||
| Molecular_function | Transporter activity | GO:0005215 | 102 | 71 |
| Transcription regulator activity | GO:0030528 | 25 | 13 | |
| Structural molecule activity | GO:0005198 | 147 | 102 | |
| Motor activity | GO:0003774 | 9 | 3 | |
| Molecular transducer activity | GO:0060089 | 16 | 14 | |
| Enzyme regulator activity | GO:0030234 | 32 | 21 | |
| Catalytic activity | GO:0003824 | 876 | 466 | |
| Binding | GO:0005488 | 605 | 441 | |
| Antioxidant activity | GO:0016209 | 24 | 18 | |
|
| ||||
| Cellular_component | Macromolecular complex | GO:0032991 | 4 | 4 |
| Extracellular | GO:0005576 | 23 | 10 | |
| Cell | GO:0005623 | 966 | 705 | |
Figure 2Distribution of repeat units in dinucleotide motif and trinucleotide motif in ESTs from GT-C20 and Tifrunner. Y-axis represents the frequency of microsatellites of a specific motif repeat. X-axis represents the number of repeats for (a) dinucleotide motif and (b) trinucleotide motif.
Figure 3Distribution of peanut leaf EST-derived SSRs according to motif sequence type. X-axis is motif sequence types (considering sequence complementary), and Y-axis represents the frequency of SSRs of a given motif sequencer type.
Comparison of the peanut unique ESTs with those in soybean, Medicago, Arabidopsis, oilseed rape, rice, maize and wheat.
| Number of ESTs matched to TIGR gene indices | ||
|
| (Percentage in Parentheses) | |
| Identity ≥ 80% | Identity ≥ 90% | |
|
| ||
|
| 3,429 (49.78) | 259 (3.76) |
|
| 2,724 (39.55) | 138 (2.00) |
|
| 843 (12.24) | 129 (1.87) |
|
| 622 (9.03) | 37 (0.54) |
|
| 833 (12.09) | 209 (3.03) |
|
| 716 (10.39) | 180 (2.61) |
|
| 674 (9.79) | 136 (1.97) |
Figure 4Amplification patterns obtained with primer EM-31 in polyacrylamide gel electrophoresis of cultivated peanuts, including Chinese landrace and breeding lines, and US market types of runner, Spanish and Virginia, and wild species. Arrows indicate the bands excised for sequencing. M = molecular weight marker in base pair, 1 = Guangliu, 2 = Sanyuening, 3 = GT-C20, 4 = Spancross, 5 = Tennessee Red, 6 = Xiaoliuqiu, 7 = Yangjiangpudizan, 8 = Xihuagoudo, 9 = Padou, 10 = Bo-50, 11 = Yingdejidouzai, 12 = Heyuanbanman, 13 = Tosunxiaohuasheng, 14 = SunOleic 97R, 15 = Tifrunner, 16 = Georgia Green, 17 = NC94022, 18 = A. villosa, 19 = A. stenosperma, 20 = A. correntina, 21 = A. cardenasii, 22 = A. magna, 23 = A. duranensis, 24 = A. chacoensis, 25 = A. batizocoi, 26 = A. helodes, 27 = A. monticola, 28 = A. pintoi.
Figure 5Sequences comparison. SSR primer EM-31 amplificon sequences were obtained as indicated in Figure 4. The original EST sequence (ES719796) was included, where SSR primer EM-31 was designed. The pair of forward and reverse primer sequences was indicated by a pair of lines with arrows and bold-faced. The repetitive sequence regions were shaded in gray. The single nucleotide polymorphisms were indicated by dark-shaded and purple-colored star.