| Literature DB >> 24431589 |
Nagendra K Singh1, Deepak K Gupta1, Pawan K Jayaswal1, Ajay K Mahato1, Sutapa Dutta1, Sangeeta Singh1, Shefali Bhutani1, Vivek Dogra1, Bikram P Singh1, Giriraj Kumawat1, Jitendra K Pal1, Awadhesh Pandit1, Archana Singh1, Hukum Rawal1, Akhilesh Kumar1, G Rama Prashat1, Ambika Khare2, Rekha Yadav2, Ranjit S Raje2, Mahendra N Singh3, Subhojit Datta4, Bashasab Fakrudin5, Keshav B Wanjari6, Rekha Kansal1, Prasanta K Dash1, Pradeep K Jain1, Ramcharan Bhattacharya1, Kishor Gaikwad1, Trilochan Mohapatra1, R Srinivasan1, Tilak R Sharma1.
Abstract
Pigeonpea (Cajanus cajan) is an important grain legume of the Indian subcontinent, South-East Asia and East Africa. More than eighty five percent of the world pigeonpea is produced and consumed in India where it is a key crop for food and nutritional security of the people. Here we present the first draft of the genome sequence of a popular pigeonpea variety 'Asha'. The genome was assembled using long sequence reads of 454 GS-FLX sequencing chemistry with mean read lengths of >550 bp and >10-fold genome coverage, resulting in 510,809,477 bp of high quality sequence. Total 47,004 protein coding genes and 12,511 transposable elements related genes were predicted. We identified 1,213 disease resistance/defense response genes and 152 abiotic stress tolerance genes in the pigeonpea genome that make it a hardy crop. In comparison to soybean, pigeonpea has relatively fewer number of genes for lipid biosynthesis and larger number of genes for cellulose synthesis. The sequence contigs were arranged in to 59,681 scaffolds, which were anchored to eleven chromosomes of pigeonpea with 347 genic-SNP markers of an intra-species reference genetic map. Eleven pigeonpea chromosomes showed low but significant synteny with the twenty chromosomes of soybean. The genome sequence was used to identify large number of hypervariable 'Arhar' simple sequence repeat (HASSR) markers, 437 of which were experimentally validated for PCR amplification and high rate of polymorphism among pigeonpea varieties. These markers will be useful for fingerprinting and diversity analysis of pigeonpea germplasm and molecular breeding applications. This is the first plant genome sequence completed entirely through a network of Indian institutions led by the Indian Council of Agricultural Research and provides a valuable resource for the pigeonpea variety improvement.Entities:
Keywords: Disease resistance; Genome sequence; Legumes; Pigeonpea; SSR markers
Year: 2011 PMID: 24431589 PMCID: PMC3886394 DOI: 10.1007/s13562-011-0088-8
Source DB: PubMed Journal: J Plant Biochem Biotechnol ISSN: 0971-7811 Impact factor: 1.175
Fig. 1The whole plant and different parts of the pigeonpea cultivar ‘Asha’ (ICPL 87119). a whole plant at fruiting stage; b a defoliated branch with pods; c a branch with heavy flowering; d mature seeds; e dehusked split seeds or ‘Dal’; f 22 chromosomes in a root tip cell
Fig. 2Flow diagram of the strategy used for the decoding of pigeonpea genome sequence
Summary of gene prediction statistics in the genome sequence of pigeonpea variety ‘Asha’
| Description | Size/number |
|---|---|
| Size of the assembled genome sequence (bp) | 510,809,477 |
| Number of large sequence contigs | 191,705 |
| Number of protein coding genes | 47,004 |
| Number of TE-related genes | 12,511 |
| Largest gene size (bp) | 11,523 |
| Smallest gene Size (bp) | 501 |
| Average gene Size (bp) | 1,170 |
| Total number of exons | 233,560 |
| Largest exon size (bp) | 6,555 |
| Average exon size (bp) | 268 |
| Maximum number of exons in a gene | 54 |
| Total number of introns | 180,000 |
| Largest intron size (bp) | 4,884 |
| Average intron size (bp) | 288 |
Fig. 3Frequency of different categories of genes in the 511 Mb of pigeonpea genome sequence. Unknown category includes genes unique to pigeonpea and those showing matches with hypothetical category genes of other species
Fig. 4Frequency distribution of five main categories of resistance-like (R-like) and defense response (DR) genes predicted in the pigeonpea genome
Frequency of some major categories of genes in the pigeonpea genome in comparison to soybean genome
| Gene category | No. of genes Pigeonpeaa | No. of genes Soybeanb | Detailed Supplementary material |
|---|---|---|---|
| Disease resistance and defense response | 1213 | 1174 | Table S |
| Abiotic stress tolerance | 152 | 220 | Table S |
| Lipid metabolism | 269 | 536 | Table S |
| Cellulose synthase | 43 | 37 | Table S |
| Sugars and starch synthesis | 108 | 284 | Table S |
| Transcription factors | 1470 | 2300 | Table S |
ain 511 Mb of 858 Mb genome; bin 950 Mb of 1,115 Mb genome
Different types of repeat elements in the 511 Mb of pigeonpea genome sequence
| Repeat category | Number of elements | Sequence length (bp) | Percent of repeats |
|---|---|---|---|
|
| |||
| 1.1 Class I (Retro transposons) | 127,602 | 77,096,057 | 23.59 |
| LINE-L1 | 5,239 | 2,270,477 | 0.69 |
| LINE-R1 | 1,277 | 784,129 | 0.24 |
| LINE-RTE-BovB | 2,087 | 333,215 | 0.10 |
| LTR | 186 | 39,775 | 0.01 |
| LTR-Caulimovirus | 2,508 | 1,376,233 | 0.42 |
| LTR-Copia | 40,373 | 19,937,308 | 6.10 |
| LTR-Gypsy | 75,932 | 52,354,920 | 16.02 |
| 1.2 Class II (DNA transposons) | 21,212 | 9,772,250 | 2.99 |
| En-spm | 5,166 | 2,339,643 | 0.71 |
| Harbinger | 1,348 | 467,230 | 0.14 |
| hat-AC | 6,572 | 4,059,651 | 1.24 |
| hat-Tag1 | 1,806 | 586,556 | 0.17 |
| hat-Tip100 | 934 | 337,903 | 0.10 |
| MuDR | 4,980 | 1,830,967 | 0.56 |
| TcMAR-Pogo | 153 | 43,654 | 0.01 |
| RC/Hiltron | 253 | 106,646 | 0.03 |
| 1.3 Unclassified | 623,425 | 216,262,607 | 66.20 |
|
| 72,522 | 8,405,304 | 2.57 |
|
| 282,968 | 15,134,850 | 4.63 |
| Total | 1,127,729 | 326,671,068 | 99.98 |
Major classes of repeat elements (RE) in the pigeonpea genome in comparison to ten other sequenced plant genomes
| Repeat Category | Pigeon pea | Soybean | Apple | Brachypodium | Cucumber | Grape | Papaya | Potato | Rice | Sorghum | Maize |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Genome sequence (Mb) | 511 | 955 | 742 | 271 | 227 | 477 | 271 | 727 | 370 | 740 | 2045 |
| RE in genome (%) | 63.95 | 61.47 | 67.00 | 28.10 | 25.03 | 23.90 | 53.17 | 64.13 | 34.79 | 62.00 | 84.20 |
| 1. Interspersed repeats | |||||||||||
| 1.1 Class I (Retro transposons) | 23.6 | 68.7 | 55.72 | 83.01 | 48.55 | 85.8 | 82.72 | 50.44 | 55.60 | 87.9 | 89.77 |
| Line | 1.03 | 0.4 | 9.6 | 6.91 | 6.94 | – | 2.08 | 3.50 | 3.22 | 0.10 | 1.16 |
| Copia | 6.1 | 20.28 | 8.1 | 17.28 | – | 21.08 | 10.61 | – | 11.05 | 8.40 | 28.1 |
| Gypsy | 16.02 | 48.01 | 37.36 | 57.12 | – | 61.93 | 53.64 | – | 31.28 | 30.7 | 55.05 |
| 1.2 Class II (DNA transposons) | 2.99 | 26.83 | 1.31 | 16.98 | 4.94 | 6.26 | 0.39 | 6.14 | 37.25 | 12.00 | 10.22 |
| Hat super family | 1.52 | 0.06 | 0.41 | 0.84 | – | 3.58 | – | – | 1.08 | 0.02 | 1.35 |
| Harbinger | 0.14 | 0.47 | 0 | 1.49 | – | – | – | – | – | 0.02 | – |
| Helitron | 0.03 | 0.86 | 0 | 0.64 | – | – | – | – | – | 1.30 | 2.64 |
| 1.3 Unclassified | 66.2 | – | 35.35 | – | 46.49 | 2.99 | 16.44 | 43.40 | – | 1.17 | – |
| 2. Simple repeats | 2.57 | 0.75 | – | – | – | – | – | – | – | – | – |
| 3. Low complexity repeats | 4.63 | 1.77 | – | – | – | – | – | – | – | – | – |
Fig. 5Distribution of 100 copies of miRNA in 32 different family in peageonpea genome
Gene and repeat densities in the pigeonpea genome scaffolds anchored with 347 genetically mapped genic-SNP markers
| Chrom. No. | No. of Markers | Size of scaffolds (bp) | No. of genes | No. of genes per 50 kb | Size of repeats (bp) | Percent repeats in scaffolds |
|---|---|---|---|---|---|---|
| 1 | 40 | 797,775 | 122 | 7.65 | 182,690 | 22.90 |
| 2 | 40 | 763,938 | 103 | 6.74 | 1,74,751 | 22.88 |
| 3 | 64 | 1,078,018 | 136 | 6.31 | 264,330 | 24.42 |
| 4 | 49 | 1,404,117 | 194 | 6.91 | 274,870 | 19.58 |
| 5 | 27 | 431,074 | 60 | 6.96 | 108,700 | 25.22 |
| 6 | 19 | 442,848 | 58 | 6.55 | 108,148 | 24.42 |
| 7 | 27 | 768,188 | 101 | 6.57 | 167,903 | 21.86 |
| 8 | 18 | 338,877 | 51 | 7.52 | 85,621 | 25.27 |
| 9 | 28 | 640,548 | 99 | 7.73 | 150,894 | 23.56 |
| 10 | 23 | 583,427 | 82 | 7.03 | 152,508 | 26.14 |
| 11 | 12 | 175,561 | 35 | 9.97 | 27,072 | 15.42 |
| Total | 347 | 7,424,371 | 1,041 | 7.01 | 1,697,486 | 22.87 |
Fig. 6Density of genes and repeat elements (TE) in the 347 anchored scaffolds on the eleven chromosomes of pigeonpea. Blue bars on the left side of each chromosome represent RE percentage in the scaffold and orange bars on the right side represent gene density per 50 kb. Red segments at the end of orange bars represent number of TE-related genes in the scaffold. Discontinuous blue bars indicate RE density of >40% whereas discontinuous orange bars represent gene density in excess of >10 genes per 50 kb
Fig. 7Circular map of syntenic relationship between 11 pairs of pigeonpea chromosomes with 20 pairs of soybean chromosomes based on 512 single copy genes in the genetically anchored scaffolds of pigeonpea genome. The outer circles depict soybean chromosome bars showing proportion of gene matches with different chromosomes of pigeonpea and vice versa
Frequency of SSRs in the 511 Mb of pigeonpea genome sequence
| Type of SSR | Total no. of SSRs | Class I SSR (n ≥20 bp) | HASSRa (n ≥50 bp) |
|---|---|---|---|
| Mononucleotide | 1,00,373 | 987 | 0 |
| Dinucleotide | 49,325 | 18,000 | 203 |
| Trinucleotide | 18,505 | 5,822 | 515 |
| Tetranucleotide | 2,217 | 2,217 | 17 |
| Pentanucleotide | 512 | 512 | 15 |
| Hexanucleotide | 815 | 815 | 70 |
| Compound | 18,148 | 18,148 | 10,891 |
| Total | 189,895 | 46,501 | 11,711 |
aHypervariable “Arhar” SSR
Wet lab validation of the PCR amplification and polymorphism of 437 HASSR markers designed from pigeonpea genome sequence information
| SSR category | No. of loci | Poly-morphic | Mono-morphic | Unexpected size bands | Not amplified | % Poly morphism |
|---|---|---|---|---|---|---|
| Trinucleotide | 281 | 124 | 103 | 28 | 26 | 44.1 |
| Tetranucleotide | 10 | 5 | 3 | 2 | 0 | 50.0 |
| Pentanucleotide | 7 | 1 | 3 | 2 | 1 | 14.2 |
| Hexanucleotide | 16 | 8 | 6 | 2 | 0 | 50.0 |
| Complex | 123 | 8 | 97 | 6 | 12 | 6.5 |
| Total | 437 | 146 | 212 | 40 | 39 | 40.8 |
Fig. 8Agarose gel showing allelic variation in PCR product size with three different HASSR markers (HASSR27, HASSR28, HASSR27) in a set of eight pigeonpea varieties. 1 Asha, 2 UPAS 120, 3 HDM 04–1, 4 Pusa Dwarf, 5 H2004-1, 6 Bahar, 7 Maruti, 8 TTB7; M = 100 bp DNA size marker