| Literature DB >> 28018385 |
Tanvi Kaila1, Pavan K Chaduvla2, Swati Saxena2, Kaushlendra Bahadur2, Santosh J Gahukar3, Ashok Chaudhury4, T R Sharma2, N K Singh2, Kishor Gaikwad2.
Abstract
Pigeonpea (Cajanus cajan (L.) Millspaugh), a diploid (2n = 22) legume crop with a genome size of 852 Mbp, serves as an important source of human dietary protein especially in South East Asian and African regions. In this study, the draft chloroplast genomes of Cajanus cajan and Cajanus scarabaeoides (L.) Thouars were generated. Cajanus scarabaeoides is an important species of the Cajanus gene pool and has also been used for developing promising CMS system by different groups. A male sterile genotype harboring the C. scarabaeoides cytoplasm was used for sequencing the plastid genome. The cp genome of C. cajan is 152,242bp long, having a quadripartite structure with LSC of 83,455 bp and SSC of 17,871 bp separated by IRs of 25,398 bp. Similarly, the cp genome of C. scarabaeoides is 152,201bp long, having a quadripartite structure in which IRs of 25,402 bp length separates 83,423 bp of LSC and 17,854 bp of SSC. The pigeonpea cp genome contains 116 unique genes, including 30 tRNA, 4 rRNA, 78 predicted protein coding genes and 5 pseudogenes. A 50 kb inversion was observed in the LSC region of pigeonpea cp genome, consistent with other legumes. Comparison of cp genome with other legumes revealed the contraction of IR boundaries due to the absence of rps19 gene in the IR region. Chloroplast SSRs were mined and a total of 280 and 292 cpSSRs were identified in C. scarabaeoides and C. cajan respectively. RNA editing was observed at 37 sites in both C. scarabaeoides and C. cajan, with maximum occurrence in the ndh genes. The pigeonpea cp genome sequence would be beneficial in providing informative molecular markers which can be utilized for genetic diversity analysis and aid in understanding the plant systematics studies among major grain legumes.Entities:
Keywords: Cajanus cajan; Cajanus scarabaeoides; RNA editing; Roche 454 sequencing; chloroplast genome
Year: 2016 PMID: 28018385 PMCID: PMC5145887 DOI: 10.3389/fpls.2016.01847
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Map of . Genes shown on the outside of the map are transcribed clockwise while the genes that are shown on the inside are transcribed counterclockwise. The innermost darker gray corresponds to GC content, whereas the lighter gray corresponds to AT content. Different genes are color coded. IR, inverted repeat; LSC, large single copy region; SSC, small single copy region. Pseudogenes are marked with “*.”
Figure 2Map of . Genes shown on the outside of the map are transcribed clockwise while the genes that are shown on the inside are transcribed counterclockwise. The innermost darker gray corresponds to GC content, whereas the lighter gray corresponds to AT content. Different genes are color coded. IR, inverted repeat; LSC, large single copy region; SSC, small single copy region. Pseudogenes are marked with “*.”
List of genes present in the cp genome of .
| Photosystem I | |
| Photosystem II | |
| Cytochrome b6/f | |
| ATP Synthase | |
| Rubisco | |
| NADH Oxidoreductase | |
| Large subunit ribosomal proteins | |
| Small subunit ribosomal proteins | |
| RNAP | |
| Other Proteins | |
| Proteins of unknown Function | |
| Ribosomal RNAs | |
| Transfer RNAs |
Gene containing two introns.
Gene containing a single intron.
Two gene copies in the IRs.
Gene divided into two independent transcription units.
Pseudogenes.
Features of the chloroplast genome of .
| Genome | 33 | 33 | 17 | 17 | 33 | 33 | 18 | 18 | 152,201 | 152,242 | 66 | 66 |
| LSC | 34 | 34 | 16 | 16 | 34 | 34 | 17 | 17 | 83,423 | 83,455 | 68 | 68 |
| SSC | 36 | 36 | 13 | 13 | 36 | 36 | 15 | 15 | 17,854 | 17,871 | 72 | 72 |
| IR | 29 | 29 | 20 | 20 | 29 | 29 | 22 | 22 | 25,402 | 25,398 | 58 | 58 |
| Prt.Coding genes | 32 | 32 | 17 | 17 | 31 | 31 | 19 | 19 | 79,052 | 75,031 | 63 | 63 |
| tRNA | 26 | 25 | 23 | 23 | 23 | 23 | 28 | 29 | 2888 | 2888 | 48 | 48 |
| rRNA | 19 | 19 | 23 | 23 | 26 | 26 | 32 | 32 | 9054 | 9054 | 45 | 45 |
| First position | 32.1 | 32.1 | 17.5 | 17.5 | 31.8 | 31.8 | 18.3 | 18.3 | 50,747 | 50,747 | 64 | 64 |
| Second position | 33.02 | 33.02 | 17.23 | 17.23 | 32.77 | 32.77 | 16.96 | 16.96 | 50,747 | 50,747 | 65.79 | 65.79 |
| Third position | 32.3 | 32.3 | 16.6 | 16.6 | 32.9 | 32.9 | 18.1 | 18.1 | 50,747 | 50,747 | 64.5 | 64.5 |
Figure 3Gene order comparison of legume cp genomes, with Arabidopsis cp genome as reference, using MAUVE software. The boxes above the line represent the gene sequence in clockwise direction and the boxes below the line represent gene sequences in opposite orientation. The gene names at the bottom indicate the genes located at the boundaries of the boxes in cp genome of pigeonpea. AKPA1- C. scarabaeoides, AKPR375- C. cajan.
Figure 4Sequence alignment of legume cp genomes, with . Position and transcriptional direction of each gene is indicated by gray arrows. Intergenic and genic regions are indicated by red and blue areas respectively. Sequence identity between the cp genomes is shown on y-axis as a percentage between 50 and 100%. AKPA1- C. scarabaeoides, AKPR375- C. cajan.
Figure 5Comparison of the border positions of LSC, SSC, and IR regions among the legume genomes. Genes are denoted by boxes and the gaps between the genes and the boundaries are indicated by number of bases unless the gene coincides with the boundary. Extensions of the genes are also indicated above the boxes. AKPA1- C. scarabaeoides, AKPR375- C. cajan.
Figure 6Repeat distribution among three different regions: coding sequences, intronic sequences, and intergenic spacer regions (A) AKPA1 (C. scarabaeoides); (B) AKPR375 (C. cajan).
Figure 7SSR distribution on the basis of repeat type.
Figure 8SSR type distribution between coding and non-coding regions (A) AKPA1 (C. scarabaeoides); (B) AKPR375 (C. cajan).