| Literature DB >> 21565938 |
Anuja Dubey1, Andrew Farmer, Jessica Schlueter, Steven B Cannon, Brian Abernathy, Reetu Tuteja, Jimmy Woodward, Trushar Shah, Benjamin Mulasmanovic, Himabindu Kudapa, Nikku L Raju, Ragini Gothalwal, Suresh Pande, Yongli Xiao, Chris D Town, Nagendra K Singh, Gregory D May, Scott Jackson, Rajeev K Varshney.
Abstract
This study reports generation of large-scale genomic resources for pigeonpea, a so-called 'orphan crop species' of the semi-arid tropic regions. FLX/454 sequencing carried out on a normalized cDNA pool prepared from 31 tissues produced 494 353 short transcript reads (STRs). Cluster analysis of these STRs, together with 10 817 Sanger ESTs, resulted in a pigeonpea trancriptome assembly (CcTA) comprising of 127 754 tentative unique sequences (TUSs). Functional analysis of these TUSs highlights several active pathways and processes in the sampled tissues. Comparison of the CcTA with the soybean genome showed similarity to 10 857 and 16 367 soybean gene models (depending on alignment methods). Additionally, Illumina 1G sequencing was performed on Fusarium wilt (FW)- and sterility mosaic disease (SMD)-challenged root tissues of 10 resistant and susceptible genotypes. More than 160 million sequence tags were used to identify FW- and SMD-responsive genes. Sequence analysis of CcTA and the Illumina tags identified a large new set of markers for use in genetics and breeding, including 8137 simple sequence repeats, 12 141 single-nucleotide polymorphisms and 5845 intron-spanning regions. Genomic resources developed in this study should be useful for basic and applied research, not only for pigeonpea improvement but also for other related, agronomically important legumes.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21565938 PMCID: PMC3111231 DOI: 10.1093/dnares/dsr007
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Sequence length distribution before and after assembly of FLX/454 STRs and Sanger ESTs
| Range of nucleotide length (bp) | Raw 454 STRs | Raw Sanger ESTs | Assembled 454 STRs | Assembled Sanger ESTs | Assembled 454 STRs + Sanger ESTs |
|---|---|---|---|---|---|
| 50 | 31 876 (6.4%) | 44 (0.4%) | 0 | 0 | 0 |
| 51–100 | 61 172 (12.3%) | 180 (1.6%) | 2282 (4.7%) | 5 (0.6%) | 2253 (4.6%) |
| 101–150 | 84 878 (17.1%) | 420 (3.8%) | 4854 (10.0%) | 4 (0.5%) | 4829 (9.9%) |
| 151–200 | 88 806 (17.9%) | 449 (4.1%) | 5934 (12.2%) | 17 (2.2%) | 5874 (12.0%) |
| 201–250 | 185 863 (37.5%) | 658 (6.0%) | 12 780 (26.3%) | 24 (3.2%) | 12 561 (25.7%) |
| 251–300 | 41 758 (8.4%) | 630 (5.8%) | 9224 (19.0%) | 20 (2.6%) | 9015 (18.5%) |
| 301–350 | 401 (3.7%) | 4960(10.2%) | 21 (2.8%) | 4821 (9.8%) | |
| 351–400 | 603 (5.5%) | 3415 (7.0%) | 42 (5.6%) | 3349 (6.8%) | |
| 401–450 | 666 (6.1%) | 1901 (3.9%) | 37 (4.9%) | 1879 (3.8%) | |
| 451–500 | 573 (5.2%) | 3169 (6.5%) | 57 (7.6%) | 1124 (2.3%) | |
| 501–550 | 740 (6.8%) | 58 (7.7%) | 3021 (6.1%) | ||
| 551–600 | 575 (5.3%) | 65 (8.7%) | |||
| 601–650 | 621 (5.7%) | 51 (6.8%) | |||
| 651–700 | 887 (8.2%) | 45 (6.0%) | |||
| 701–750 | 1590 (14.6%) | 79 (10.5%) | |||
| 751–800 | 682 (6.3%) | 42 (5.6%) | |||
| 801–850 | 1098 (10.1%) | 74 (9.9%) | |||
| 851–900 | 24 (3.2%) | ||||
| 901–950 | 18 (2.4%) | ||||
| 951–1000 | 12 (1.6%) | ||||
| 1001–1050 | 11 (1.4%) | ||||
| 1051–1100 | 40 (5.3%) | ||||
| Total reads | 494 353 | 10 817 | 48 519 | 746 | 48 726 |
Figure 1.Distribution and read depth of sequence tags in the contigs. Number of 454 STRs aligning to form a contig ranged from 2 to >1000. A total of 36 152 contigs showed a read depth ranging from 2 to 5 tags, followed by 6755 contigs with read depth 6–10 tags, 5517 contigs with read depth 11–100 tags and 290 contigs with read depth ranging from 100 to 1000 tags. A maximum of 12 contigs showed a read depth of >1000 tags.
Figure 2.Histogram plot of pair-wise synonymous distances of pigeonpea duplicated genes when compared with soybean. Histogram plot of percentage pair-wise distance to the synonymous distance value (Ks) showing a peak in pigeonpea at 0.06 which gives a divergence estimate of ∼4.9 Mya.
Figure 3.Distribution of differentially expressed genes in SMD- and FW-responsive genotypes. Differential expression was calculated based on Log 2 value, with a threshold of less than −2 to greater than +2 number of differentially expressed gene was calculated for three SMD- and two FW-parental combinations. Comparison of expression values from susceptible parent to the resistant gave an estimate of up- and down-regulated genes in each cross.
Figure 4.Hierarchical clustering of differentially expressed TAs within SMD- and FW-responsive genotypes. Hierarchial clustering of the gene involved in SMD- and FW-stress responses was done using HCE version 2.0 beta web tool. These two dendrograms illustrate the co-regulation of genes in resistant and susceptible genotypes under stress, (A) clustering of 5106 genes showing expression variation across six SMD-responsive genotypes (three resistant, ICPL 20096, ICPL 7035, BSMR 736, and three susceptible, ICPL 332, TTB 7, TAT 10). (B) Clustering of 3384 genes showing expression variation across four FW-responsive genotypes (two resistant, ICPL 87119, ICPL 99050, and two susceptible, ICPB 2049, ICPL 87091). Colour scale (from green to red) represents the range of expression level.
Summary of differentially expressed TUSs across five parental combinations
| Fold difference | ICPL 20096 × ICPL 332 | BSMR 736 × TAT 10 | TTB 7 × ICPL 7035 | ICPL 87119 × ICPL 87091 | ICPB 2049 × ICPL 99050 |
|---|---|---|---|---|---|
| Exclusively expressed in resistant | 9149 | 8867 | 9455 | 243 | 10 675 |
| Exclusively expressed in susceptible | 1964 | 269 | 200 | 9784 | 2098 |
| 2 to <3 fold | 3385 | 2663 | 3024 | 3715 | 2638 |
| 3 to <4 fold | 2782 | 2111 | 3082 | 3859 | 1779 |
| 4 to <5 fold | 2108 | 1499 | 2180 | 2618 | 1167 |
| 5 to <6 fold | 1544 | 807 | 856 | 1053 | 674 |
| ≥6 fold | 998 | 385 | 260 | 273 | 415 |
Illumina sequencing based SNP discovery in five parental combinations
| Genotypes | ICPL 87119 | ICPL 87091 | BSMR 736 | TAT 10 | TTB 7 | ICPL 7035 | ICPL 20096 | ICPL 332 | ICPB 2049 | ICPL 99050 |
|---|---|---|---|---|---|---|---|---|---|---|
| Number of reads (in millions) | 18.4 | 16.8 | 16.9 | 14.5 | 15.7 | 17.2 | 15.5 | 14.8 | 15.7 | 17.3 |
| Number of SNPs in parental combination | ||||||||||
| Substitution | 5965 | 573 | 955 | 1580 | 1878 | |||||
| Insertion | 176 | 87 | 28 | 31 | 61 | |||||
| Deletion | 122 | 44 | 44 | 43 | 38 | |||||
| Total SNPs | 6263 | 704 | 1027 | 1654 | 1977 | |||||