| Literature DB >> 21929789 |
Wenliang Wei1, Xiaoqiong Qi, Linhai Wang, Yanxin Zhang, Wei Hua, Donghua Li, Haixia Lv, Xiurong Zhang.
Abstract
BACKGROUND: Sesame is an important oil crop, but limited transcriptomic and genomic data are currently available. This information is essential to clarify the fatty acid and lignan biosynthesis molecular mechanism. In addition, a shortage of sesame molecular markers limits the efficiency and accuracy of genetic breeding. High-throughput transcriptomic sequencing is essential to generate a large transcriptome sequence dataset for gene discovery and molecular marker development.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21929789 PMCID: PMC3184296 DOI: 10.1186/1471-2164-12-451
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Size distribution of the assembled contigs, scaffolds, and unigenes in the three libraries.
Figure 2Comparison of unigene length with or without hits. Longer contigs were more likely to have BLASTx homologs in protein databases; 97.00% of the unigenes over 1,000 bp had BLASTx homologs, while only 25.65% of the unigenes shorter than 300 bp had homologs.
Distribution of unigenes on Arabidopsis chromosomes
| Chromosome | Chr 1 | Chr 2 | Chr 3 | Chr 4 | Chr 5 | Chr C | Chr M | Total |
|---|---|---|---|---|---|---|---|---|
| 3,966 | 2,302 | 3,032 | 2,336 | 3,734 | 55 | 35 | 15,460 |
Unigenes involved in fatty acid biosynthesis
| Query | Description | Score | E-value | |
|---|---|---|---|---|
| Unigene37331 | AT5G16390.1 | chloroplastic acetylcoenzyme A carboxylase 1 | 154 | 6.00E-38 |
| Unigene40848 | AT5G35360.3 | acetyl Co-enzyme a carboxylase biotin carboxylase subunit | 877 | 0 |
| Unigene3230 | AT2G38040.2 | acetyl Co-enzyme a carboxylase carboxyltransferase alpha subunit | 202 | 3.00E-53 |
| Unigene3676 | ATCG00500.1 | acetyl-CoA carboxylase carboxyl transferase subunit beta | 451 | 1.00E-127 |
| Unigene39950 | AT1G36180.1 | acetyl-CoA carboxylase 2 | 1559 | 0 |
| Unigene33384 | AT2G30200.1 | catalytics; transferases; [acyl-carrier-protein] S-malonyltransferases; binding | 543 | 1.00E-154 |
| Unigene10471 | AT1G62640.2 | 3-ketoacyl-acyl carrier protein synthase III | 550 | 1.00E-156 |
| Unigene6985 | AT1G74960.3 | fatty acid biosynthesis 1 | 35.4 | 9.00E-03 |
| Unigene23348 | AT5G46290.1 | 3-ketoacyl-acyl carrier protein synthase I | 494 | 1.00E-139 |
| Unigene7472 | AT2G04540.1 | beta-ketoacyl synthase | 370 | 1.00E-103 |
| Unigene10471 | AT1G62640.2 | 3-ketoacyl-acyl carrier protein synthase III | 550 | 1.00E-156 |
| Unigene5774 | AT1G24360.1 | NAD(P)-binding Rossmann-fold superfamily protein | 79.3 | 5.00E-16 |
| Unigene637 | AT2G22230.1 | thioesterase superfamily protein | 304 | 1.00E-82 |
| Unigene15731 | AT2G05990.2 | NAD(P)-binding Rossmann-fold superfamily protein | 545 | 1.00E-155 |
| Unigene42291 | AT3G25110.1 | fatA acyl-ACP thioesterase | 104 | 5.00E-23 |
| Unigene2384 | AT1G08510.1 | fatty acyl-ACP thioesterases B | 58.5 | 2.00E-09 |
| Unigene8278 | AT3G12120.2 | fatty acid desaturase 2 | 134 | 2.00E-32 |
Figure 3Gene ontology classification of assembled unigenes. The results are summarized into three main categories: biological processes, cellular components, and molecular functions. In total, 10,805 unigenes with BLASTx matches were assigned to gene ontologies.
Figure 4Clusters of orthologous groups (COG) classification. In total, 27,588 of the 46,479 sequences with Nr hits were grouped into 25 COG classifications.
Frequency of EST-SSRs in sesame
| Motif length | Repeat numbers | Total | % | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 4 | 5 | 6 | 7 | 8 | 9 | 10 | > 10 | |||
| - | - | 1,452 | 1,001 | 854 | 612 | 377 | 870 | 5,166 | 67.07 | |
| - | 1,130 | 443 | 194 | 90 | 51 | 4 | 5 | 1,917 | 24.89 | |
| 247 | 65 | 16 | 2 | 2 | 0 | 0 | 0 | 332 | 4.31 | |
| 73 | 11 | 1 | 0 | 0 | 0 | 0 | 0 | 85 | 1.10 | |
| 176 | 25 | 0 | 1 | 0 | 0 | 0 | 0 | 202 | 2.62 | |
| 496 | 1,231 | 1,912 | 1,198 | 946 | 663 | 381 | 875 | |||
| 6.44 | 15.98 | 24.82 | 15.55 | 12.28 | 8.61 | 4.95 | 11.36 | |||
Frequency of di- and trinucleotide EST-SSR repeat motifs in sesame
| Repeat motif | Repeat numbers | Total | % | ||||||
|---|---|---|---|---|---|---|---|---|---|
| 5 | 6 | 7 | 8 | 9 | 10 | > 10 | |||
| - | 349 | 247 | 210 | 148 | 51 | 133 | 1,138 | 16.07 | |
| - | 786 | 581 | 512 | 396 | 302 | 702 | 3,279 | 46.29 | |
| - | 316 | 172 | 131 | 68 | 24 | 35 | 746 | 10.53 | |
| - | 1 | 1 | 1 | 0 | 3 | 0.04 | |||
| 32 | 20 | 10 | 1 | 0 | 63 | 0.89 | |||
| 237 | 111 | 50 | 23 | 19 | 1 | 0 | 441 | 6.23 | |
| 77 | 26 | 6 | 2 | 5 | 0 | 116 | 1.64 | ||
| 137 | 55 | 23 | 13 | 4 | 1 | 233 | 3.29 | ||
| 102 | 35 | 14 | 8 | 3 | 0 | 162 | 2.29 | ||
| 104 | 46 | 20 | 13 | 6 | 1 | 2 | 192 | 2.71 | |
| 79 | 33 | 12 | 8 | 5 | 0 | 137 | 1.93 | ||
| 135 | 56 | 30 | 12 | 5 | 2 | 0 | 240 | 3.39 | |
| 146 | 40 | 21 | 11 | 3 | 2 | 223 | 3.15 | ||
| 81 | 21 | 8 | 0 | 110 | 1.55 | ||||
| 1,130 | 1,895 | 1,195 | 944 | 663 | 381 | 875 | |||
| 15.95 | 26.75 | 16.87 | 13.33 | 9.36 | 5.38 | 12.35 | |||
Characterization of 40 EST-SSRs among 24 sesame accessions
| Primer | SSRs | Forward primer (5'-3') | Reverse primer (5'-3') | No. of alleles | PIC | ||
|---|---|---|---|---|---|---|---|
| ZM_1 | (CA)6 | GTTTCTTGGTCTTATCACAGC | TACCAACGTCACTCTTCTTTC | 5 | 1.00 | 0.72 | 0.66 |
| ZM_2 | (AC)8 | CTTCTTGAAGTTCTGGTGTTG | ATTCTTGGAGAAAGAGTGAGG | 6 | 0.91 | 0.77 | 0.72 |
| ZM_3 | (GT)8 | ATCACCACACACTGACACAG | CGTGTCTGAGAATCCAATATC | 7 | 0.96 | 0.79 | 0.74 |
| ZM_4 | (AC)8 | TCCAGAGAGGAGACAATAAGA | GAGATAGATTGCGAGTTGTGT | 4 | 0.71 | 0.72 | 0.64 |
| ZM_5 | (CT)10 | GATAAAGAACTGCCAAGGAAC | CACAGCAGTGAAGAAAAGAGT | 6 | 0.71 | 0.74 | 0.68 |
| ZM_6 | (GT)9 | GGTGTGTTCTCTCTCTCACAC | GGGCTGCTCAATAAATGTAG | 6 | 0.75 | 0.78 | 0.73 |
| ZM_7 | (AGC)6 | ATCCTCTGCTCCTAACTTCAT | TCTGGTACTATCCTCAAGCAA | 6 | 0.96 | 0.79 | 0.74 |
| ZM_8 | (AG)6 | TCTCTCTCTCTCTCGTTCTTG | CCCACTGTACCTCTCCATATT | 5 | 0.70 | 0.73 | 0.66 |
| ZM_9 | (AAT)5 | CTCGCCGTTCTAACATTATC | AGTACAGTCTCCTCCGATTTT | 5 | 0.50 | 0.72 | 0.65 |
| ZM_10 | (CTTCCT)4 | ATGCCCATCTCCATATACTCT | AATTCTTGCCTGACTCTACG | 8 | 0.83 | 0.81 | 0.76 |
| ZM_11 | (CT)12 | GGATTCTCTAGACATGGCTTT | AACGCAGAATTCTCTCCTACT | 8 | 1.00 | 0.85 | 0.81 |
| ZM_12 | (CT)7 | ATTGCTGTGCAATCCTTATC | ATCTCTTTCTACCACCACGTT | 5 | 1.00 | 0.79 | 0.74 |
| ZM_13 | (GA)7 | GCAGAAGGCAATAAAGTCAT | GCGTCAGAAGAAAAATACTGG | 7 | 1.00 | 0.80 | 0.75 |
| ZM_14 | (ATC)5 | GGAAGGCGAGTTGATAGATAA | CATGGGATGTTCAAAGAACT | 6 | 0.96 | 0.79 | 0.73 |
| ZM_15 | (GA)7 | ACTCTATCACCGAGTGGAGAC | CTACCCTTTTCCTCGTAGC | 7 | 0.65 | 0.75 | 0.69 |
| ZM_16 | (GATC)4 | AGGTAGAATTACATGCTGTGC | GCTTCCTCCTTCATTCATATC | 5 | 0.60 | 0.71 | 0.65 |
| ZM_17 | (CT)6 | CTTGCTTCCTCTTTTCTCTCT | ACACTGTACTCAGCGGATTT | 5 | 1.00 | 0.79 | 0.72 |
| ZM_18 | (CT)6 | AATACCCTTCAGTATTCAGGTG | CAACAACACAAACACTGCTAC | 5 | 1.00 | 0.80 | 0.75 |
| ZM_20 | (GAG)5 | GGGATGTTGATAGAGATGTTG | TCTTTCACTCTCACACACACA | 10 | 1.00 | 0.86 | 0.82 |
| ZM_21 | (AC)7 | CTCTCTCTCTCTGCTGTTTCA | GCCATACGATCTCAAAATCAC | 8 | 0.95 | 0.86 | 0.82 |
| ZM_22 | (AT)9(GA)6 | ACCACCGATCTACTCACTTTT | CCACTGCACACTACAGTTTTT | 9 | 0.68 | 0.86 | 0.82 |
| ZM_23 | (AT)7(GA)8 | CGTATGTCAAGATGAAGCAGT | ATCAACAATTCCACTCAACC | 6 | 0.95 | 0.75 | 0.69 |
| ZM_24 | (TC)6(CCATTT)4 | CCACACTCAAAACCAAGAAA | GCGAAGAGATTATATACACACG | 5 | 0.75 | 0.72 | 0.66 |
| ZM_25 | (AT)6 | CCTGAACCTTCTCTCTCTCTC | ACTGACAGTACGAATTCACCA | 4 | 0.58 | 0.71 | 0.64 |
| ZM_26 | (GA)8 | ACTTCAACTTCAACCTCAACC | TGTGCATAAAAACCCTCTCT | 5 | 1.00 | 0.65 | 0.57 |
| ZM_29 | (AG)9 | CATTACAATAGCCCGAAAAG | TACTGTTCCTCCTCCTCTCTT | 6 | 1.00 | 0.70 | 0.62 |
| ZM_30 | (TAT)5 | CACTCCACTCATTATCCAAAG | CAAGACACAACTGACACGTAA | 6 | 1.00 | 0.77 | 0.72 |
| ZM_31 | (GA)7 | GAGCACTCTCTTCTCTCTTCC | AAAAGAGGATGGCAACTGTA | 5 | 0.78 | 0.72 | 0.65 |
| ZM_32 | (GCC)5 | CACGAAGAGTGAGAGAGAGAG | CTACCAAAAGTCCCTGAATCT | 5 | 0.77 | 0.70 | 0.62 |
| ZM_33 | (TCA)5 | GAGACAGTACACTTGGGACAA | CTCTTCTTGGGCATTAACTCT | 6 | 0.74 | 0.73 | 0.67 |
| ZM_34 | (AG)14 | AAGTCCCTTTTCAAGCAATC | GAGAGAGGAAAATGCAGAGAG | 10 | 0.79 | 0.83 | 0.79 |
| ZM_35 | (TTCC)4 | AATGCATAGTGCATAGGGTAG | TGGAAAGTAGAGATCGCATAG | 6 | 0.96 | 0.75 | 0.69 |
| ZM_38 | (GA)8 | CAGCTTCCTGATTTGATTTG | AGATTGCAAGAATCGCTTAG | 5 | 0.85 | 0.69 | 0.62 |
| ZM_39 | (TCA)5 | AGAGGCAGAGGAGTTGATAAT | CTTAACTGTAACTCCCTTTTCG | 6 | 0.81 | 0.78 | 0.72 |
| ZM_40 | (ACTCCA)5 | CGAAAAGGGAGTTACAGTTAAG | CTTCCTCTCCTATCATCCTGT | 7 | 0.82 | 0.82 | 0.77 |
| ZM_41 | (AC)8 | GATATGATTCAAACCCCTCAG | CTTCTGCACTACCATCAATTC | 5 | 0.63 | 0.55 | 0.46 |
| ZM_43 | (ACAT)4 | CTTGGATATCAGTTTCCTGTG | GTTCTCCACAGTCAAAACACT | 7 | 0.89 | 0.72 | 0.65 |
| ZM_44 | (TC)9 | GTCTTAAGCCCTCTTAGTTCC | GAAAACCTTCAATGTCAGGA | 7 | 0.77 | 0.77 | 0.73 |
| ZM_45 | (TA)6 | GCAAAATCTCTGTTGTCTCAG | GTGTTCCTACCACTCAACACA | 18 | 0.83 | 0.83 | 0.80 |
| ZM_47 | (TC)8 | GTTTCCAGGTCTATTCCTTTG | AGGTAGAGCTAATCCTTACCG | 10 | 0.71 | 0.83 | 0.79 |
| Mean | 6.55 | 0.84 | 0.76 | 0.70 |