| Literature DB >> 27889940 |
Marta Brozynska1, Dario Copetti2,3, Agnelo Furtado1, Rod A Wing2,3, Darren Crayn4, Glen Fox5, Ryuji Ishikawa6, Robert J Henry1.
Abstract
The related A genome species of the Oryza genus are the effective gene pool for rice. Here, we report draft genomes for two Australian wild A genome taxa: O. rufipogon-like population, referred to as Taxon A, and O. meridionalis-like population, referred to as Taxon B. These two taxa were sequenced and assembled by integration of short- and long-read next-generation sequencing (NGS) data to create a genomic platform for a wider rice gene pool. Here, we report that, despite the distinct chloroplast genome, the nuclear genome of the Australian Taxon A has a sequence that is much closer to that of domesticated rice (O. sativa) than to the other Australian wild populations. Analysis of 4643 genes in the A genome clade showed that the Australian annual, O. meridionalis, and related perennial taxa have the most divergent (around 3 million years) genome sequences relative to domesticated rice. A test for admixture showed possible introgression into the Australian Taxon A (diverged around 1.6 million years ago) especially from the wild indica/O. nivara clade in Asia. These results demonstrate that northern Australia may be the centre of diversity of the A genome Oryza and suggest the possibility that this might also be the centre of origin of this group and represent an important resource for rice improvement.Entities:
Keywords: zzm321990Oryzazzm321990; assembly; molecular clock; phylogeny; sequencing; wild rice
Mesh:
Substances:
Year: 2017 PMID: 27889940 PMCID: PMC5425390 DOI: 10.1111/pbi.12674
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
Figure 1Australian perennial A genome taxa from northern Australia. Taxon A is characterized by open panicles, while Taxon B has closed panicles.
Figure 2Phylogenetic relationships between A genome rice species; (a) tree topology based upon analysis of supermatrix of 4643 nuclear genes; (b) tree topology based on whole chloroplast genome sequences. Figure adapted and modified from Wambugu et al. (2015). Taxa marked in green represent Asian rice species, in blue: African, in orange: South American and in red: Australian. L. perrieri and O. punctata were used as outgroups in nuclear and chloroplast studies, respectively.
Taxon A and Taxon B hybrid and PacBio assembly statistics. The metrics were calculated for scaffolds and contigs for hybrid assembly and for scaffolds only for PacBio assembly
| Taxon A | Taxon B | |||
|---|---|---|---|---|
| Assembly | Hybrid | PacBio‐only | Hybrid | PacBio‐only |
| Assembler | Sparse Assembler + DBG2OLC | Celera Assembler | Sparse Assembler + DBG2OLC | Celera Assembler |
| Scaffolds | ||||
| Number of scaffolds | 3359 | 2585 | 4718 | 3252 |
| Total size of scaffolds | 382 655 312 | 384 759 810 | 446 369 637 | 354 906 376 |
| Total scaffold length as percentage of known genome size | 98.1 | 98.7 | 120.6 | 95.9 |
| Longest scaffold | 1 305 248 | 1 692 155 | 2 079 733 | 3 232 522 |
| Shortest scaffold | 2297 | 9523 | 2425 | 12 563 |
| Mean scaffold size | 113 919 | 148 843 | 94 610 | 109 135 |
| Median scaffold size | 61 996 | 97 803 | 54 787 | 61 207 |
| N50 scaffold length | 217 336 | 219 409 | 163 003 | 159 640 |
| Contigs | ||||
| Number of contigs | 3425 | – | 4808 | – |
| Total size of contigs | 382 644 322 | – | 446 351 110 | – |
| Longest contig | 1 158 569 | – | 1 449 836 | – |
| Shortest contig | 1139 | – | 790 | – |
| Mean contig size | 111 721 | – | 92 835 | – |
| Median contig size | 61 459 | – | 54 495 | – |
| N50 contig length | 211 599 | – | 159 759 | – |
Sequence similarities between rice taxa in the supermatrix used for phylogenetic inference
Figure 3Molecular clock analyses for A genome rice evolution. The most frequent tree topology retrieved in analyses of alignments of separate chromosomes, inferred for 8 out of 12 chromosomes. Scale axis represents age in million years (mya). Node bars display 95% highest posterior density (HDP) interval.
Figure 4Results of four‐taxon test for Taxon A and selected Oryza species; (a) D‐statistics and Z‐scores calculated for two sets of selected species per chromosome: (((O. rufipogon, O. barthii), Taxon A), O. punctata) and (((O. rufipogon, O. nivara), Taxon A), O. punctata). Z‐scores marked with asterisk (*) indicate statistically significant values; (b) four‐taxon tree used in the first test. Bidirectional arrow shows inferred introgression for chromosomes with significant D‐statistics (listed below the tree); (c) four‐taxon tree used in the second test. Bidirectional arrows show inferred introgression for chromosomes with significant D‐statistics (listed below the tree).