| Literature DB >> 25485030 |
Marta Brozynska1, Ernnie Syafika Omar1, Agnelo Furtado1, Darren Crayn2, Bryan Simon1, Ryuji Ishikawa3, Robert James Henry1.
Abstract
Rice (Oryza sativa L.) was probably domesticated from O. rufipogon in Asia in the last 10,000 years. Relatives of cultivated rice (A genome species of Oryza) are found in South America, Africa, Australia and Asia. These A genome species are the close relatives of cultivated rice and represent the effective gene pool for rice improvement. Members of this group in Northern Australia include, an annual species, O. meridionalis, and two recently distinguished perennial taxa, to one of which the name O. rufipogon has been applied and the other a perennial form of O. meridionalis. Comparison of whole chloroplast genome sequences of these taxa has now been used to determine the relationships between the wild taxa and cultivated rice. The chloroplast genomes of the perennials were both found to be distinguished from O. rufipogon from Asia by 124 or 125 variations and were distinguished from each other by 53 variations. These populations have remained isolated from the overwhelming genetic impact of the large domesticated rice populations in Asia and may be unique descendants of the gene pool from which domesticated rice arose. The conservation of this wild genetic resource may be critical for global food security.Entities:
Keywords: Chloroplast sequence; Oryza; Phylogeny; Wild rice
Year: 2014 PMID: 25485030 PMCID: PMC4245483 DOI: 10.1007/s12042-014-9142-8
Source DB: PubMed Journal: Trop Plant Biol ISSN: 1935-9756 Impact factor: 1.512
Fig. 1Australian perennial wild rice (Taxon A) growing in its natural habitat in Northern Queensland, Australia. Open panicles and anther length (>3–7.4 mm) affirm its morphological resemblance to Oryza rufipogon
Summary of rice chloroplast sequences used in phylogenetic analysis and comparative study: their genome group, full chloroplast length, length without one inverted repeat (IR) sequence and GenBank accession number
| Rice species | Genome group | Full length chloroplast sequence (bp) | Sequence length without IR (bp) | Accession number |
|---|---|---|---|---|
| Taxon A | AA | 134,557 | 113,754 | KF428978 |
|
| AA | 134,551 | 113,749 | GU592207 |
|
| AA | 134,496 | 113,698 | AY522329 |
|
| AA | 134,558 | 113,755 | JN005831 |
| Taxon B | AA | 134,557 | 113,754 | JN005833 |
|
| AA | 134,544 | 113,743 | JN005832 |
|
| AA | 134,494 | 113,692 | AP006728 |
|
| EE | 134,549 | 113,749 | GU592209 |
IR, Inverted repeat
Fig. 2A chloroplast genome map of an Australian perennial wild rice (Species A). The inner circle indicates the boundaries between large single copy (LSC), inverted repeats (IRA and IRB) and small single copy (SSC). The dashed area shows the GC content of the genome and marks the 50 % threshold with additional grey line inside. Genes are transcribed either clockwise (those found outside the outer circle) or counter clockwise (inside the outer circle). Features which belong to diverse functional groups are marked with different colours
Fig. 4Analysis of coding and non-coding regions in chloroplast genomes. (a) Distribution of sequence types in Oryza Taxon A and (b) proportion of variants relative to Oryza Taxon A found in each of these regions
Fig. 3Phylogenetic relationships between chloroplast genomes of Oryza species. Nodal support is presented as precent bootstrap in MP/precent bootstrap in ML/Bayesian posterior probability. Scale bar is the number of substitutions per site
Distance matrix of rice species chloroplast genomes
The top panel represents number of variants between species (a variant corresponds to a single nucleotide- or a multi nucleotide-variant at any given position). The bottom panel represents number of nucleotide differences in the sequences (numbers of non-identical bases). The heatmap corresponds to individual values in a matrix as colours: red cells indicate close distance; orange, yellow and green cells show gradually greater distances
Total number and type of sequence variants in rice species chloroplast genomes in comparison with Taxon A
| Rice species | Variants number | Deletions | Insertions | MNV | SNPs (Ts/Tv) |
|---|---|---|---|---|---|
|
| 36 | 4 | 10 | 3 | 19 (8/11) |
| Taxon B | 53 | 6 | 9 | 5 | 33 (16/17) |
|
| 122 | 18 | 17 | 10 | 77 (39/38) |
|
| 125 | 18 | 14 | 7 | 86 (40/46) |
|
| 125 | 14 | 17 | 4 | 90 (47/43) |
|
| 143 | 21 | 20 | 10 | 92 (46/46) |
|
| 861 | 14 | 16 | 20 | 811 (518/293) |
MNV, Multi-nucleotide variant; Ts and Tv, Transitions and transversions, respectively
SNPs and frame-shift events in coding region of the chloroplast genomes of rice species in comparison with Taxon A
| Rice species | Synonymous SNPs | Non-synonymous SNPs | Frameshifts |
|---|---|---|---|
|
| 5 | 1 | 0 |
| Taxon B | 6 | 7 | 0 |
|
| 22 | 11 | 4 |
|
| 17 | 9 | 4 |
|
| 16 | 15 | 3 |
|
| 22 | 11 | 3 |
SNP, Single nucleotide polymorphism
Polymorphisms in chloroplast coding regions found to be specific to Australian or Asian clades or Taxon A
| Position | Australian clade | Asian clade | Taxon A | Gene | Protein effect |
|---|---|---|---|---|---|
|
| |||||
| 448 | G | A | G | psbA | * |
| 817 | G | A | G | psbA | * |
| 2,220 | G | T | G | matK | Ala → Glu |
| 3,067 | G | T | G | matK | Leu → Ile |
| 4,546 | A | T | A | rps16 | * |
| 9,321 | G | G |
| psbD | * |
| 24,164 | A | A |
| rpoC1 | Asn → Ser |
| 24,742 | T | G | T | rpoC2 | His → Gln |
| 25,000 | G | A | G | rpoC2 | * |
| 27,949 | C | A | C | rpoC2 | * |
| 29,885 | T | C | T | rps2 | * |
| 55,325 | G | A | G | rbcL | * |
|
| |||||
| 90,581 | G | T | G | ycf76 | * |
| 93,538 | C | del | C | ycf68 | frameshift |
|
| |||||
| 104,506 | T | T |
| rpl32 | * |
| 105,927 | C | A | C | ccsA | Ser → Tyr |
| 106,567 | G | T | G | ndhD | Ala → Glu |
| 106,710 | G | T | G | ndhD | * |
| 110,849 | G | A | G | ndhA | * |
| 113,255 | G | G |
| ndhH | * |
|
| |||||
| 121,627 | G | del | G | ycf68 | frameshift |
| 124,581 | C | A | C | ycf76 | * |
Positions refer to the positions in Taxon A chloroplast genome. Amino acid changes refer to a change with respect to Australian clade and Taxon A with the exception of position 24,164 (Ser is exclusive for Taxon A). Australian clade includes: O. meridionalis and Taxon B; Asian clade includes: O. sativa japonica and indica, O. rufipogon (Asian origin) and O. nivara. Nucleotides marked in bold and italics are found to be unique to Taxon A. LSC: long single copy; SSC: short single copy; IRA and IRB: inverted repeats; del: deletion, *: synonymous SNP
Predicted amino acid substitutions in chloroplast encoded proteins between Australian and Asian clades of Oryza
| Position | Australian clade | Asian clade | Taxon A | Gene | Uniqueness of substitution |
|---|---|---|---|---|---|
| 46 | Leu | Ile | Leu |
| Leu exclusive for Australian clade |
| 328 | Ala | Glu | Ala |
| known |
| 567 | Asn | Asn | Ser |
| Ser exclusive for Taxon A |
| 10 | His | Gln | His |
| Gln exclusive for Asian clade |
| 224 | Ser | Tyr | Ser |
| Ser exclusive for Australian clade |
| 452 | Ala | Glu | Ala |
| known |
Positions refer to the amino acids positions in given proteins from Taxon A; known – given substitution was found among other genes from chloroplast genomes of numerous plant species; exclusive for Australian/Asian clade – given substitution was found only in Australian or Asian clades, respectively; exclusive for Taxon A – given substitution was found only in Australian wild rice (Taxon A)