| Literature DB >> 20007682 |
S Tangphatsornruang1, D Sangsrakru, J Chanprasert, P Uthaipaisanwong, T Yoocha, N Jomchai, S Tragoonrung.
Abstract
Mungbean is an economically important crop which is grown principally for its protein-rich dry seeds. However, genomic research of mungbean has lagged behind other species in the Fabaceae family. Here, we reported the complete chloroplast (cp) genome sequence of mungbean obtained by the 454 pyrosequencing technology. The mungbean cp genome is 151 271 bp in length which includes a pair of inverted repeats (IRs) of 26 474 bp separated by a small single-copy region of 17 427 bp and a large single-copy region of 80 896 bp. The genome contains 108 unique genes and 19 of these genes are duplicated in the IR. Of these, 75 are predicted protein-coding genes, 4 ribosomal RNA genes and 29 tRNA genes. Relative to other plant cp genomes, we observed two distinct rearrangements: a 50-kb inversion between accD/rps16 and rbcL/trnK-UUU, and a 78-kb rearrangement between trnH/rpl14 and rps19/rps8. We detected sequence length polymorphism in the cp homopolymeric regions at the intra- and inter-specific levels in the Vigna species. Phylogenetic analysis demonstrated a close relationship between Vigna and Phaseolus in the phaseolinae subtribe and provided a strong support for a monophyletic group of the eurosid I.Entities:
Mesh:
Substances:
Year: 2009 PMID: 20007682 PMCID: PMC2818187 DOI: 10.1093/dnares/dsp025
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Map of the V. radiata cp genome. The thick lines indicate the extent of the IRs (IRa and IRb) which separate the genome into SSC and LSC regions. Genes outside the map are transcribed clockwise and those inside the map are transcribed counter clockwise. Genes containing introns and pseudogenes are marked with * and #, respectively.
Genes present in the V. radiata cp genome
| Gene products |
|---|
| 1. Photosystem I: psaA, B, C, I, J, ycf3,a ycf4 |
| 2. Photosystem II: psbA, B, C, D, E, F, H, I, J, K, L, M, N, T, Z |
| 3. Cytochrome b6/f: petA, B,b D,b G, L, N |
| 4. ATP synthase: atpA, B, E, F,b H, I |
| 5. Rubisco: rbcL |
| 6. NADH oxidoreductase: ndhA,b B,b,c C, D, E, F, G, H, I, J, K |
| 7. Large subunit ribosomal proteins: rpl2,b,c 14, 16,b 20, 23,c 32, 36 |
| 8. Small subunit ribosomal proteins: rps2, 3, 4, 7,c 8, 11, 12,b–d 14, 15, 18, 19c |
| 9. RNAP: rpoA, rpoB, C1,b C2 |
| 10. Other proteins: accD, ccsA, cemA, clpP,a matK |
| 11. Proteins of unknown function: ycf1, 2c |
| 12. Ribosomal RNAs: rrn16,c 23,c 4.5,c 5c |
| 13. Transfer RNAs: trnA(UGC),b,c C(GCA), D(GUC), E(UUC), F(GAA), G(UCC), H(GUG), I(CAU),c I(GAU),b,c K(UUU),b L(CAA),c L(UAA),b L(UAG), fM(CAU), M(CAU), N(GUU),c P(UGG), Q(UUG), R(ACG),c R(UCU), S(GCU), S(GGA), S(UGA), T(GGU), T(UGU), V(GAC),c V(UAC),b W(CCA), Y(GUA) |
aGene containing two introns.
bGene containing a single intron.
cTwo gene copies in the IRs.
dGene divided into two independent transcription units.
The codon–anticodon recognition pattern and codon usage for L. japonicus cp genome
| Phe | UUU | 1125 | Ser | UCU | 588 | Tyr | UAU | 843 | Cys | UGU | 220 | ||||
| Phe | UUC | 517 | Ser | UCC | 299 | Tyr | UAC | 158 | Cys | UGC | 82 | ||||
| Leu | UUA | 929 | Ser | UCA | 438 | stop | UAA | 53 | stop | UGA | 15 | ||||
| Leu | UUG | 548 | Ser | UCG | 174 | stop | UAG | 20 | Trp | UGG | 455 | ||||
| Leu | CUU | 583 | Pro | CCU | 417 | His | CAU | 496 | Arg | CGU | 342 | ||||
| Leu | CUC | 159 | Pro | CCC | 181 | His | CAC | 122 | Arg | CGC | 79 | ||||
| Leu | CUA | 389 | Pro | CCA | 316 | Gln | CAA | 756 | Arg | CGA | 351 | ||||
| Leu | CUG | 149 | Pro | CCG | 128 | Gln | CAG | 201 | Arg | CGG | 88 | ||||
| Ile | AUU | 1175 | Thr | ACU | 572 | Asn | AAU | 1057 | Ser | AGU | 401 | ||||
| Ile | AUC | 401 | Thr | ACC | 199 | Asn | AAC | 280 | Ser | AGC | 110 | ||||
| Ile | AUA | 829 | Thr | ACA | 433 | Lys | AAA | 1243 | Arg | AGA | 471 | ||||
| Met | AUG | 585 | Thr | ACG | 126 | Lys | AAG | 332 | Arg | AGG | 150 | ||||
| Val | GUU | 528 | Ala | GCU | 619 | Asp | GAU | 828 | Gly | GGU | 594 | ||||
| Val | GUC | 127 | Ala | GCC | 189 | Asp | GAC | 196 | Gly | GGC | 157 | ||||
| Val | GUA | 518 | Ala | GCA | 381 | Glu | GAA | 1017 | Gly | GGA | 709 | ||||
| Val | GUG | 168 | Ala | GCG | 106 | Glu | GAG | 298 | Gly | GGG | 254 |
Numerals indicate the frequency of usage of each codon in 26 274 codons in 82 potential protein-coding genes.
Figure 2Comparison of legume cp genomes with Arabidopsis cp DNA as a reference using MAUVE. The boxes above the line represent DNA sequences in clockwise direction and those below the line represent DNA sequences in the anticlockwise direction. The gene names at the bottom indicate the boundaries of the boxes within the mungbean cp DNA.
Figure 3A representative microsatellite locus (Locus 113; forward primer 5′-GAA ACC CTT CCT GAA AAA TCC-3′ and reverse primer 5′-TCT TTG ACG AAT GCA AGT GG-3′) with polymorphism among Vigna species: V. radiata accessions KPS1, H262, H337, H412, H417, V. mungo accession Subsamotod, V. umbellata accession JP99485, V. unguiculata accessions VU210 and TVNU294. PCR products were separated on 5% polyacrylamide gel electrophoresis and visualized by silver staining.
A list of repeated sequences and their locations identified using REPuter with n≥ 30 bp, and a sequence identity ≥90% in the V. radiata cp genome
| Number | Size (bp) | Repeat | Location |
|---|---|---|---|
| 1 | 42 | D | rpl16 (intron):IGS (trnV-GAC and rps12_3end) |
| 2 | 42 | IR | rpl16 (intron): IGS rps12_3end and trnV-GAC |
| 3 | 50 | IR | rpl16 (intron): ndhA (intron) |
| 4 | 37 | D | rpl16 (intron):ycf3 (intron) |
| 5 | 40 | IR | IGS (trnK-UUU and rbcL) |
| 6 | 81 | D | IGS (trnK-UUU and rbcL) |
| 7 | 40 | D | IGS (ndhJ and trnF-GAA) |
| 8 | 30 | D | IGS (ndhJ and trnF-GAA), IGS (psbM and petN) |
| 9 | 31 | IR | trnT-UGU, IGS (psbD and trnT-GGU) |
| 10 | 31 | IR | trnS-GGA, trnS-GCU |
| 11 | 32 | IR | ycf3 (intron) |
| 12 | 37 | IR | ycf3 (intron): ndhA (intron) |
| 13 | 30 | IR | IGS (trnG-UCC and psbZ) |
| 14 | 31 | D | IGS (trnG-UCC and psbZ), IGS (petN and trnC-GCA) |
| 15 | 31 | D | trnS-UGA, trnS-GCU |
| 16 | 41 | IR | IGS (trnG-UCC and psbZ) |
| 17 | 36 | IR | IGS (psbD and trnT-GGU) |
| 18 | 50 | IR | IGS (psbD and trnT-GGU) |
| 19 | 34 | D | IGS (trnC-GCA and rpoB) |
| 20 | 30 | D | IGS (atpI and atpF), IGS (trnL-CAA and ndhB) |
| 21 | 30 | IR | IGS (atpI and atpH), IGS (ndhB and trnL-CAA) |
| 22 | 30 | IR | IGS (trnR-UCU and trnS-GCU) |
| 23 | 30 | IR | IGS (aacD and psaI) |
| 24 | 34 | D | IGS (trnW-CCA and trnP-UGG) |
| 25 | 41 | IR | IGS (psaJ and rpl33_pseudo) |
| 26 | 50 | IR | IGS (psaJ and rpl33_pseudo) |
| 27 | 80 | D | IGS (rps8 and rps19) |
| 28 | 80 | IR | IGS (rps8 and rps19), IGS (rps19 and rps3) |
| 29 | 287 | D | ycf2_pseudo, ycf2 |
| 30 | 287 | IR | ycf2_pseudo, ycf2 |
| 31 | 287 | IR | ycf2, ycf2_pseudo |
| 32 | 30 | D | ycf2 |
| 33 | 30 | IR | ycf2 |
| 34 | 30 | IR | ycf2 |
| 35 | 30 | D | IGS (trnL-CAA and ndhB) |
| 36 | 40 | D | IGS (rps12_3end and trnV-GAC), ndhA (intron) |
| 37 | 30 | D | IGS (rrn16 and trnI-GAU), IGS (trnI-GAU and rrn16) |
| 38 | 30 | IR | IGS (rrn16 and trnI-GAU) |
| 39 | 30 | D | IGS rrn16 and trnI-GAU: IGS trnI-GAU and rrn16 |
| 40 | 43 | D | IGS rrn5 and trnR-ACG: IGS trnR-ACG and rrn5 |
| 41 | 43 | IR | IGS (rrn5 and trnR-ACG) |
| 42 | 31 | IR | ndhF |
| 43 | 40 | IR | ndhA (intron): IGS (trnV-GAC and rps12_3end) |
| 44 | 50 | IR | IGS (rps15 and ycf1) |
| 45 | 43 | IR | rrn5 |
| 46 | 30 | IR | IGS (trnI-GAU and rrn16) |
| 47 | 30 | D | IGS ndhB and trnL-CAA |
| 48 | 52 | D | ycf2 |
| 49 | 287 | D | ycf2:ycf2_pseudo |
| 50 | 80 | D | IGS rps19 and rps3 |
D, direct repeat; IR, inverted repeat; IGS, intergenic space.
Figure 4The MP phylogenetic tree is based on 25 protein-coding genes from 34 plant taxa. The MP tree has a length of 29 081 with a consistency index of 0.4939 and a retention index of 0.6399. Numbers above node are bootstrap support values. Ordinal and higher level group names are also indicated. The ML tree has the same topology but is not shown.