| Literature DB >> 22241453 |
Himabindu Kudapa1, Arvind K Bharti, Steven B Cannon, Andrew D Farmer, Benjamin Mulaosmanovic, Robin Kramer, Abhishek Bohra, Nathan T Weeks, John A Crow, Reetu Tuteja, Trushar Shah, Sutapa Dutta, Deepak K Gupta, Archana Singh, Kishor Gaikwad, Tilak R Sharma, Gregory D May, Nagendra K Singh, Rajeev K Varshney.
Abstract
A comprehensive transcriptome assembly for pigeonpea has been developed by analyzing 128.9 million short Illumina GA IIx single end reads, 2.19 million single end FLX/454 reads, and 18 353 Sanger expressed sequenced tags from more than 16 genotypes. The resultant transcriptome assembly, referred to as CcTA v2, comprised 21 434 transcript assembly contigs (TACs) with an N50 of 1510 bp, the largest one being ~8 kb. Of the 21 434 TACs, 16 622 (77.5%) could be mapped on to the soybean genome build 1.0.9 under fairly stringent alignment parameters. Based on knowledge of intron junctions, 10 009 primer pairs were designed from 5033 TACs for amplifying intron spanning regions (ISRs). By using in silico mapping of BAC-end-derived SSR loci of pigeonpea on the soybean genome as a reference, putative mapping positions at the chromosome level were predicted for 6284 ISR markers, covering all 11 pigeonpea chromosomes. A subset of 128 ISR markers were analyzed on a set of eight genotypes. While 116 markers were validated, 70 markers showed one to three alleles, with an average of 0.16 polymorphism information content (PIC) value. In summary, the CcTA v2 transcript assembly and ISR markers will serve as a useful resource to accelerate genetic research and breeding applications in pigeonpea.Entities:
Mesh:
Year: 2012 PMID: 22241453 PMCID: PMC3440007 DOI: 10.1093/mp/ssr111
Source DB: PubMed Journal: Mol Plant ISSN: 1674-2052 Impact factor: 13.164
Details on Datasets Used for Developing Comprehensive Transcriptome Assembly (CcTA v2).
| Dataset/sequencing platform | Genotype | Tissues | Source | Number of transcript reads |
|
| ||||
| Illumina GA IIx | ICPL 332 | SMD-challenged leaves | ICRISAT/NCGR | 16 361 115 |
| Illumina GA IIx | ICPW 94 | FW-challenged roots | ICRISAT/NCGR | 15 828 791 |
| Illumina GA IIx | ICPL 99050 | FW-challenged roots | ICRISAT/NCGR | 13 498 156 |
| Illumina GA IIx | ICP 7035 | SMD-challenged leaves | ICRISAT/NCGR | 13 223 516 |
| Illumina GA IIx | ICPB 2049 | FW-challenged roots | ICRISAT/NCGR | 11 494 670 |
| Illumina GA IIx | BSMR 736 | SMD-challenged leaves | ICRISAT/NCGR | 11 065 219 |
| Illumina GA IIx | ICP 28 | FW-challenged roots | ICRISAT/NCGR | 9 721 562 |
| Illumina GA IIx | ICPL 20096 | SMD-challenged leaves | ICRISAT/NCGR | 9 507 797 |
| Illumina GA IIx | ICPL 87091 | FW-challenged roots | ICRISAT/NCGR | 8 977 567 |
| Illumina GA IIx | TAT 10 | SMD-challenged leaves | ICRISAT/NCGR | 7 932 691 |
| Illumina GA IIx | TTB 7 | SMD-challenged leaves | ICRISAT/NCGR | 4 122 216 |
| Illumina GA IIx | Asha (ICPL 87119) | FW-challenged roots | ICRISAT/NCGR | 7 182 619 |
|
| ||||
| FLX/454 | Asha (ICPL 87119) | Pooled RNA from 4 tissues | NRCPB | 906 300 |
| FLX/454 | UPAS 120 | Pooled RNA from 4 tissues | NRCPB | 790 424 |
|
| ||||
| FLX/454 | PusaAgeti | Pooled RNA from 31 tissues | ICRISAT | 494 353 |
|
| ||||
| Sanger | ICPL 20102 | FW-challenged roots | ICRISAT | 3168 |
| Sanger | ICP 2376 | FW-challenged roots | ICRISAT | 2880 |
| Sanger | TTB 7 | SMD-challenged leaves | ICRISAT | 1920 |
| Sanger | ICP 7035 | SMD-challenged leaves | ICRISAT | 1920 |
| Sanger | Asha (ICPL 87119) and UPAS 120 | Root and shoot | NCBI dbEST | 8465 |
SMD, sterility mosaic disease; FW, Fusarium wilt, ICRISAT, International Crops Research Institute for the Semi-Arid Tropics; NCGR, National Center for Genome Resources; NRCPB, National Research Center on Plant Biotechnology; NCBI, National Centre for Biotechnology Information.
Comparative Analysis of Four Assemblies.
| CcTA v2 (this study) | CcTA v1 ( |
|
| |
| Sequence data used | 128.9 million Illumina short reads + 2.19 million FLX/454 reads + 18,353 Sanger ESTs | 494 353 FLX/454 reads and 10 817 Sanger ESTs | 1.696 million FLX/454 reads | 9888 Sanger ESTs |
| Number of genotypes providing sequence data | >16 | 5 | 2 | 4 |
| Program(s) used for assembly | ABySS and miraEST | CAP3 | Lasergene SeqMan Pro™ v8.0.12 | CAP3 |
| Total number of transcript assembly contigs | 21 434 | 48 726 | 43 324 | 4557 |
| N50 (bp) | 1510 | 287 | 1222 | 701 |
| Largest contig (bp) | 7909 | 2067 | 7783 | 3430 |
| Shortest contig (bp) | 85 | 52 | 32 | 46 |
Mapping of Pigeonpea Transcriptome Assembly TACs onto Soybean Genome Build 1.0.9.
| Soybean ( | Total unique CcTA v2 TACs hits | Genes covered on each | Total number of genes on each | Percent of genes covered on each |
| Gm01 | 1558 | 1200 | 2023 | 59.30 |
| Gm02 | 1985 | 1582 | 2652 | 59.70 |
| Gm03 | 1524 | 1244 | 2196 | 56.60 |
| Gm04 | 1656 | 1293 | 2088 | 61.90 |
| Gm05 | 1738 | 1556 | 2147 | 72.50 |
| Gm06 | 1960 | 1324 | 2686 | 49.30 |
| Gm07 | 1791 | 1378 | 2298 | 60.00 |
| Gm08 | 2392 | 1930 | 3130 | 61.70 |
| Gm09 | 1693 | 1324 | 2305 | 57.40 |
| Gm10 | 1861 | 1497 | 2479 | 60.40 |
| Gm11 | 1839 | 1455 | 2304 | 63.20 |
| Gm12 | 1453 | 1212 | 2016 | 60.10 |
| Gm13 | 2418 | 1893 | 3119 | 60.70 |
| Gm14 | 1359 | 1061 | 1825 | 58.10 |
| Gm15 | 1690 | 1288 | 2250 | 57.20 |
| Gm16 | 1172 | 999 | 1803 | 55.40 |
| Gm17 | 1757 | 1351 | 2252 | 60.00 |
| Gm18 | 1738 | 1374 | 2452 | 56.00 |
| Gm19 | 1608 | 1300 | 2177 | 59.70 |
| Gm20 | 1504 | 1229 | 1975 | 62.20 |
| Total | 27 490 | 46 177 | 59.50 |
Figure 1.A Sample View of Pigeonpea TACs, Markers, and Candidate ISR Markers onto Soybean Genome Sequence.
This image is from the SoyBase GBrowse viewer for soybean, at http://soybase.org/gb2/gbrowse, shows 200 kb from Gm10, starting from position 43 820 000. At a zoom level >2 Mbp, a heat map showing transcript density will be displayed. Assemblies for all 20 Gm chromosomes are displayed alongside regions of sequence homology in Cc. Red: there was at least one additional reported CcTA v2 alignment. Green: there were no other reported alignments.
Figure 2.Amplification Pattern of ISR Markers on MDE Gel.
From left to right lanes 1–8 for each marker: ICPL 332, ICPW 94, ICPL 99050, ICPB 2049, ICP 28, ICPL 20096, ICPL 87091, and ICPL 87119. Arrows indicate ladder bp fragments.
Distribution of ISR Markers on Pigeonpea Linkage Groups and Their Polymorphism Status.
| Pigeonpea LG | Total ISR markers showing mapping positions | Markers selected for analysis | Markers amplified | Markers showing polymorphism |
| CcLG01 | 333 | 10 | 8 | 5 |
| CcLG02 | 1778 | 20 | 17 | 12 |
| CcLG03 | 699 | 17 | 16 | 9 |
| CcLG04 | 763 | 10 | 7 | 6 |
| CcLG05 | 27 | – | – | – |
| CcLG06 | 257 | 8 | 8 | 6 |
| CcLG07 | 478 | 15 | 15 | 11 |
| CcLG08 | 394 | 5 | 5 | 4 |
| CcLG09 | 281 | 15 | 15 | 6 |
| CcLG10 | 1011 | 16 | 14 | 5 |
| CcLG11 | 263 | 12 | 11 | 6 |
| Total | 6284 | 128 | 116 | 70 |