| Literature DB >> 24578372 |
Hiroaki Sakai1, Hiroyuki Kanamori1, Yuko Arai-Kichise2, Mari Shibata-Hatta2, Kaworu Ebana1, Youko Oono1, Kanako Kurita1, Hiroko Fujisawa1, Satoshi Katagiri1, Yoshiyuki Mukai1, Masao Hamada1, Takeshi Itoh1, Takashi Matsumoto1, Yuichi Katayose1, Kyo Wakasa3, Masahiro Yano1, Jianzhong Wu4.
Abstract
Having a deep genetic structure evolved during its domestication and adaptation, the Asian cultivated rice (Oryza sativa) displays considerable physiological and morphological variations. Here, we describe deep whole-genome sequencing of the aus rice cultivar Kasalath by using the advanced next-generation sequencing (NGS) technologies to gain a better understanding of the sequence and structural changes among highly differentiated cultivars. The de novo assembled Kasalath sequences represented 91.1% (330.55 Mb) of the genome and contained 35 139 expressed loci annotated by RNA-Seq analysis. We detected 2 787 250 single-nucleotide polymorphisms (SNPs) and 7393 large insertion/deletion (indel) sites (>100 bp) between Kasalath and Nipponbare, and 2 216 251 SNPs and 3780 large indels between Kasalath and 93-11. Extensive comparison of the gene contents among these cultivars revealed similar rates of gene gain and loss. We detected at least 7.39 Mb of inserted sequences and 40.75 Mb of unmapped sequences in the Kasalath genome in comparison with the Nipponbare reference genome. Mapping of the publicly available NGS short reads from 50 rice accessions proved the necessity and the value of using the Kasalath whole-genome sequence as an additional reference to capture the sequence polymorphisms that cannot be discovered by using the Nipponbare sequence alone.Entities:
Keywords: Oryza sativa; SNPs and indels; comparative genomics; gain and loss of genes; genome re-sequencing
Mesh:
Year: 2014 PMID: 24578372 PMCID: PMC4131834 DOI: 10.1093/dnares/dsu006
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Statistics of de novo assembly and chromosomal mapping of Kasalath NGS reads to Nipponbare pseudomolecules
| No. of contigs | N50 (bp) | Maximum L (bp) | Mean L (bp) | Total L (bp) | |
|---|---|---|---|---|---|
| Mapped | 36 936 | 13 728 | 103 131 | 7847 | 289 796 664 |
| Unmapped | 14 822 | 3615 | 43 777 | 2749 | 40 748 813 |
L, length; N50, minimum length of contigs representing 50% of the assembly.
Statistics of SNPs and indels detected between Kasalath, Nipponbare, and 93-11 genomes
| Kasalath–Nipponbare | Kasalath–93-11 | |||
|---|---|---|---|---|
| SNPs | Large indels | SNPs | Large indels | |
| chr01 | 318 375 | 921 | 252 557 | 506 |
| chr02 | 253 524 | 651 | 184 485 | 383 |
| chr03 | 278 482 | 749 | 204 712 | 405 |
| chr04 | 233 426 | 617 | 212 291 | 329 |
| chr05 | 244 149 | 616 | 179 973 | 294 |
| chr06 | 241 274 | 648 | 210 155 | 373 |
| chr07 | 231 566 | 605 | 155 910 | 300 |
| chr08 | 214 710 | 535 | 166 666 | 255 |
| chr09 | 179 050 | 434 | 141 896 | 232 |
| chr10 | 193 491 | 531 | 149 212 | 241 |
| chr11 | 211 931 | 533 | 185 340 | 195 |
| chr12 | 187 272 | 553 | 173 054 | 267 |
| Total | 2 787 250 | 7 393 | 2 216 251 | 3780 |
Figure 1.Distribution pattern of SNPs detected between the genomes of Nipponbare and Kasalath cultivars.
Figure 2.Size distribution and sequence classification of large deletions in Kasalath in comparison with Nipponbare.
Figure 3.Nipponbare transcripts covered by Kasalath pseudomolecule sequences. The horizontal axis represents the sequence coverage (×100%) of each gene annotated on Nipponbare pseudomolecules.
Top 10 over-represented functional domains in the genes found in Kasalath but not in Nipponbare
| IPR000719 | Protein kinase, catalytic domain |
| IPR000767 | Disease resistance protein |
| IPR001245 | Serine-threonine/tyrosine-protein kinase catalytic domain |
| IPR001611 | Leucine-rich repeat |
| IPR002182 | NB-ARC |
| IPR008271 | Serine/threonine-protein kinase, active site |
| IPR011009 | Protein kinase-like domain |
| IPR013210 | Leucine-rich repeat-containing N-terminal, type 2 |
| IPR013320 | Concanavalin A-like lectin/glucanase, subgroup |
| IPR017441 | Protein kinase, ATP-binding site |
Figure 4.Rate of uniquely mapped NGS reads from 50 rice accessions by using Nipponbare, Kasalath, and 93-11 pseudomolecule sequences as references. Arabic numerals under the horizontal axis represent different accessions of cultivated and wild rice (see Supplementary Table S2 for details).