| Literature DB >> 28326093 |
Sajjad Asaf1, Muhammad Waqas2, Abdul L Khan3, Muhammad A Khan1, Sang-Mo Kang1, Qari M Imran1, Raheem Shahzad1, Saqib Bilal1, Byung-Wook Yun1, In-Jung Lee1.
Abstract
Oryza minuta, a tetraploid wild relative of cultivated rice (family Poaceae), possesses a BBCC genome and contains genes that confer resistance to bacterial blight (BB) and white-backed (WBPH) and brown (BPH) plant hoppers. Based on the importance of this wild species, this study aimed to understand the phylogenetic relationships of O. minuta with other Oryza species through an in-depth analysis of the composition and diversity of the chloroplast (cp) genome. The analysis revealed a cp genome size of 135,094 bp with a typical quadripartite structure and consisting of a pair of inverted repeats separated by small and large single copies, 139 representative genes, and 419 randomly distributed microsatellites. The genomic organization, gene order, GC content and codon usage are similar to those of typical angiosperm cp genomes. Approximately 30 forward, 28 tandem and 20 palindromic repeats were detected in the O. minuta cp genome. Comparison of the complete O. minuta cp genome with another eleven Oryza species showed a high degree of sequence similarity and relatively high divergence of intergenic spacers. Phylogenetic analyses were conducted based on the complete genome sequence, 65 shared genes and matK gene showed same topologies and O. minuta forms a single clade with parental O. punctata. Thus, the complete O. minuta cp genome provides interesting insights and valuable information that can be used to identify related species and reconstruct its phylogeny.Entities:
Keywords: SSRs; codon usage; cp genome; phylogeny; repeat analysis; sequence divergence; wild rice (Oryza minuta)
Year: 2017 PMID: 28326093 PMCID: PMC5339285 DOI: 10.3389/fpls.2017.00304
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of complete chloroplast genomes for twelve .
| Length (bp) | 81,074 | 80,974 | 80,544 | 80,594 | 80,512 | 80,594 | 80,952 | 80,684 | 80,621 | 80,604 | 80,595 | 80,612 |
| GC(%) | 37.07 | 37.1 | 37.12 | 37.11 | 37.09 | 37.1 | 37.1 | 37.1 | 37.05 | 37.1 | 37.1 | 37.1 |
| Length (%) | 59.95 | 59.9 | 59.8 | 59.9 | 59.8 | 59.9 | 60 | 59.9 | 59.8 | 59.9 | 59.8 | 59.8 |
| Length (bp) | 12,470 | 12,446 | 12,346 | 12,347 | 12,345 | 12,345 | 12,330 | 12,381 | 12,387 | 12,347 | 12,357 | 12,356 |
| GC(%) | 33.18 | 33.3 | 33.33 | 33.33 | 33.3 | 33.34 | 33.33 | 33.33 | 33.34 | 33.33 | 33.33 | 33.33 |
| Length (%) | 9.22 | 9.2 | 9.1 | 9.1 | 9.1 | 9.1 | 9.1 | 9.1 | 9.1 | 9.17 | 9.1 | 9.1 |
| Length (bp) | 20,840 | 20,836 | 20,802 | 20,802 | 20,795 | 20,795 | 20,813 | 20,804 | 20,797 | 20,803 | 20,807 | 20,807 |
| GC(%) | 44.33 | 44.3 | 44.35 | 44.35 | 44.3 | 44.3 | 44.3 | 44.3 | 44.4 | 44.4 | 44.33 | 44.33 |
| Length (%) | 15.4 | 15 | 15.4 | 15.4 | 15.4 | 15.4 | 15.4 | 15.4 | 15.4 | 15.4 | 15.4 | 15.4 |
| GC(%) | 38.95 | 39 | 39.1 | 39 | 39 | 39 | 39 | 39 | 39 | 39 | 39 | 39 |
| Length (%) | 135,224 | 135,094 | 134,494 | 134,544 | 134,448 | 134,525 | 134,911 | 134,674 | 134,604 | 134,558 | 134,567 | 134,583 |
O. aust, O. australiensis; O. min, O. minuta; O. niv, O. nivara; O. rufi, O. rufipogon; O. sat. ind, O. sativa indica; O.s. jap, O. sativa japonica; O. offi, O. officinalis; O. barth, O. barthii; O. punc, O. punctate; O. meri, O. meridionalis; O. long, O. longistaminata; O. glum, O. glumipatula.
Figure 1Gene map of the . Genes drawn inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise. Genes belonging to different functional groups are color coded. The darker gray color in the inner circle corresponds to the GC content, and the lighter gray color corresponds to the AT content.
Genes in the sequenced .
| Self-replication | Large subunit of ribosomal proteins | |
| Small subunit of ribosomal proteins | ||
| DNA dependent RNA polymerase | ||
| rRNA genes | ||
| tRNA genes | ||
| Photosynthesis | Photosystem I | |
| Photosystem II | ||
| NadH oxidoreductase | ||
| Cytochrome b6/f complex | ||
| ATP synthase | ||
| Rubisco | ||
| Other genes | Translational initiation factor | |
| Maturase | ||
| Protease | ||
| Envelop membrane protein | ||
| Subunit Acetyl- CoA-Carboxylate | ||
| c-type cytochrome synthesis gene | ||
| Unknown | Conserved Open reading frames |
Comparison of coding and non-coding region sizes among twelve .
| Length (bp) | 59,700 | 61,062 | 68,598 | 56,133 | 61,464 | 66,444 | 59,433 | 59,385 | 62,964 | 55,329 | 59,499 | 59,496 |
| GC(%) | 39.3 | 39.5 | 39.7 | 39.3 | 39.5 | 39.6 | 39.4 | 39.4 | 39.3 | 39.1 | 39.3 | 39.3 |
| Length (%) | 44.1 | 45.1 | 51 | 41.7 | 45.7 | 49.3 | 44 | 44 | 59.8 | 41.1 | 44.2 | 44.2 |
| Length (bp) | 2,866 | 3,031 | 2,865 | 2,772 | 2,795 | 2,784 | 2,474 | 2,474 | 3,043 | 3,049 | 2,474 | 2,474 |
| GC(%) | 53.2 | 52.1 | 53 | 52.3 | 53 | 52.9 | 52.7 | 52.7 | 51.7 | 52.6 | 52.7 | 52.7 |
| Length (%) | 2.1 | 2.2 | 2.1 | 2 | 2 | 2 | 1.83 | 1.83 | 2.2 | 2.2 | 1.83 | 1.83 |
| Length (bp) | 9,190 | 9,190 | 9,190 | 9,190 | 9,190 | 9,182 | 9,190 | 9,190 | 9,190 | 9,190 | 9,190 | 9,190 |
| GC(%) | 54.8 | 54.8 | 54.8 | 54.8 | 54.8 | 54.7 | 54.8 | 54.8 | 54.8 | 54.8 | 54.8 | 54.8 |
| Length (%) | 6.7 | 6.8 | 6.8 | 6.8 | 6.8 | 6.8 | 6.8 | 6.8 | 6.8 | 6.8 | 6.8 | 6.8 |
| 63,468 | 61,811 | 53,841 | 66,449 | 60,999 | 56,115 | 63,814 | 63,625 | 59,407 | 66,990 | 63,404 | 63,423 | |
| GC(%) | 36 | 36 | 37 | 37 | 36 | 36 | 35 | 35 | 36 | 36 | 37 | 35 |
| Length (%) | 47 | 45.8 | 41 | 50 | 45.4 | 41.8 | 47.4 | 47.3 | 44.2 | 49.8 | 47.2 | 47.2 |
O. aust, O. australiensis; O. min, O. minuta; O. niv, O. nivara; O. rufi, O. rufipogon; O. sat. ind, O. sativa indica; O.s. jap, O. sativa japonica; O. offi, O. officinalis; O. barth, O. barthii; O. punc, O. punctata; O. meri, O. meridionalis; O. long, O. longistaminata; O. glum, O. glumipatula.
Base compositions in the .
| Genome | 30.4 | 19.4 | 30.7 | 19.6 | 135,094 |
| LSC | 31.6 | 18.3 | 31.3 | 18.8 | 80,974 |
| SSC | 30.8 | 17.3 | 35.9 | 16.0 | 12,446 |
| IR | 27.7 | 23.1 | 28 | 21.3 | 20,836 |
| tRNA | 23.5 | 26.1 | 24.3 | 26 | 3,031 |
| rRNA | 22.6 | 27.4 | 22.6 | 27.4 | 9,190 |
| Protein-coding genes | 29.9 | 19.5 | 30.5 | 20.0 | 60,948 |
| 1st position | 23.27 | 19.0 | 29.3 | 28.2 | 20,354 |
| 2nd position | 32.72 | 21.1 | 27.3 | 18.82 | 20,354 |
| 3rd position | 37.04 | 14.9 | 31.66 | 16.5 | 20,354 |
The codon–anticodon recognition pattern and codon usage for the .
| Phe | UUU | 733 | 1.28 | Ala | GCA | 378 | 1.18 | ||
| Phe | UUC | 407 | 0.7 | Ala | GCG | 160 | 0.5 | ||
| Leu | UUA | 710 | 1.9 | Tyr | UAU | 567 | 1.5 | ||
| Leu | UUG | 402 | 1.1 | Tyr | UAC | 176 | 0.47 | ||
| Leu | CUU | 473 | 1.29 | Stop | UAG | 22 | 0.74 | ||
| Leu | CUC | 165 | 0.4 | Stop | UGA | 24 | 0.80 | ||
| Leu | CUA | 319 | 0.87 | Stop | UAA | 43 | 1.44 | ||
| Leu | CUG | 120 | 0.32 | His | CAU | 351 | 1.49 | ||
| Ile | AUU | 820 | 1.51 | His | CAC | 119 | 0.50 | ||
| Ile | AUC | 323 | 0.5 | Gln | CAA | 521 | 1.53 | ||
| Ile | AUA | 485 | 0.89 | Gln | CAG | 167 | 0.49 | ||
| Met | AUG | 499 | 1 | Asn | AAU | 579 | 1.44 | ||
| Val | GUU | 450 | 1.50 | Asn | AAC | 222 | 0.55 | ||
| Val | GUC | 140 | 0.46 | Lys | AAA | 752 | 1.44 | ||
| Val | GUA | 442 | 1.47 | Lys | AAG | 291 | 0.55 | ||
| Val | GUG | 163 | 0.54 | Asp | GAU | 558 | 1.55 | ||
| Ser | UCU | 383 | 1.56 | Asp | GAC | 159 | 0.44 | ||
| Ser | UCC | 304 | 1.23 | Glu | GAA | 764 | 1.48 | ||
| Ser | UCA | 254 | 1.03 | Glu | GAG | 267 | 0.51 | ||
| Ser | UCG | 120 | 0.48 | Cys | UGU | 177 | 1.50 | ||
| Ser | AGU | 306 | 1.24 | Cys | UGC | 58 | 0.49 | ||
| Ser | AGC | 105 | 0.42 | Trp | UGG | 356 | 1 | ||
| Pro | CCU | 351 | 1.59 | Arg | CGU | 290 | 1.36 | ||
| Pro | CCC | 190 | 0.86 | Arg | CGC | 110 | 0.51 | ||
| Pro | CCA | 236 | 1.07 | Arg | CGA | 264 | 1.24 | ||
| Pro | CCG | 105 | 0.47 | Arg | CGG | 102 | 0.48 | ||
| Thr | ACU | 455 | 1.68 | Arg | AGA | 377 | 1.77 | ||
| Thr | ACC | 208 | 0.76 | Arg | AGG | 131 | 0.61 | ||
| Thr | ACA | 294 | 1.08 | Gly | GGU | 493 | 1.28 | ||
| Thr | ACG | 124 | 0.45 | Gly | GGC | 161 | 0.42 | ||
| Ala | GCU | 553 | 1.72 | Gly | GGA | 582 | 1.52 | ||
| Ala | GCC | 189 | 0.59 | Gly | GGG | 295 | 0.77 |
Figure 2Analysis of repeated sequences in twelve Total of three repeat types; (B) frequency of the palindromic repeat by length; (C) frequency of the tandem repeat by length; (D) frequency of forward repeat by length.
Figure 3Analysis of simple sequence repeats (SSRs) in twelve Number of different SSR types detected in twelve genomes; (B) frequency of identified SSR motifs in different repeat class types; (C) frequency of identified SSRs in coding regions; (D) frequency of identified SSRs in LSC, SSC and IR regions.
Figure 4Alignment of twelve chloroplast genome sequences. VISTA-based identity plot showing sequence identity among twelve Oryza species using O. minuta as a reference. The thick black line shows the inverted repeats (IRs) in the chloroplast genomes.
Figure 5Pairwise sequence distances of .
Figure 6Comparison of border distances between adjacent genes and junctions of LSC, SSC, and two IR regions among chloroplast genomes of twelve . Boxes above or below the main line indicate the adjacent border genes. The figure is not to scale with regard to sequence length and only shows relative changes at or near IR/SC borders.
Figure 7Phylogenetic trees were constructed for 14 species from the rice tribe using different methods, and two Bayesian trees are shown for data sets of the entire genome sequence and 65 shared genes. (A) The entire genome sequence data set (B). The data set of 65 shared genes. Each data set was used with four different methods, Bayesian inference (BI), maximum parsimony (MP), maximum likelihood (ML) and neighbor-joining (NJ). Numbers above the branches are the posterior probabilities of BI and bootstrap values of MP, ML, and NJ, respectively. Stars represent positions for O. minuta (KU179220) in the two trees.