| Literature DB >> 28854638 |
Khuat Huu Trung1, Truong Khoa Nguyen1, Hoang Bao Truc Khuat1, Thuy Diep Nguyen1, Tran Dang Khanh1, Tran Dang Xuan2, Xuan-Hung Nguyen3.
Abstract
Elucidation of the rice genome will not only broaden our understanding of genetic characterization of the agronomic characteristics but also facilitate the rice genetic improvement through marker assisted breeding. However, the genome resources of aromatic rice varieties are largely unexploited. Therefore, the whole genome of two elite aromatic traditional japonica rice landraces in North Vietnam, Tam Xoan Bac Ninh (TXBN), and Tam Xoan Hai Hau (TXHH), was sequenced to identify their genome-wide polymorphisms. Overall, we identified over 40,000 novel polymorphisms in each aromatic rice landrace. Although a discontinuous 8-bp deletion and an A/T SNP just upstream the 5-bp deletion in exon 7 of BADH2 gene were present in both rice landraces, the number of SNP high resolution regions of TXBN was six times higher than that of TXHH. Furthermore, several hot spot regions of novel SNPs and indels were found in both genomes, providing their potential gene pools related to aroma formation. The genomic information of two aromatic rice landraces described in this study will facilitate the identification of fragrance-related genes and the genetic improvement of rice.Entities:
Keywords: Japonica; Tam Xoan Bac Ninh; Tam Xoan Hai Hau; Vietnamese aromatic rice; genome sequencing; polymorphic islands
Mesh:
Year: 2017 PMID: 28854638 PMCID: PMC5533115 DOI: 10.1093/gbe/evx135
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Genome Features and Resources of TXBN and TXHH
| Genome Features/Resources | ||
|---|---|---|
| TXBN | TXHH | |
| NCBI bioproject ID | PRJNA384811 | |
| NCBI SRA accession No. | SRP105436 | |
| NCBI biosample ID | SAMN06848499 | SAMN06848500 |
| Total raw reads | 133,528,362 | 154,565,174 |
| Read length (bp) | 100 | 100 |
| Total clean reads | 118,282,878 | 131,916,386 |
| Clean reads percentage (%) | 77.47 | 78.18 |
| Clean reads mapped percentage (%) | 78.29 | 76.89 |
| Mapping depth (X) | 21.67 | 24.85 |
| Reference coverage (%) | 85.81 | 87.76 |
| Estimated genome size (Mb) | 312 | 319 |
| SNP | 1,330,823 | 1,160,772 |
| Intergenic | 1,014,897 | 872,046 |
| Genic | 315,926 | 288,726 |
| Intron | 181,622 | 165,595 |
| UTRs | 68,946 | 61,757 |
| CDS | 65,358 | 61,374 |
| Synonymous | 30,517 | 28,574 |
| Nonsynonymous | 34,841 | 32,800 |
| Indel | 291,206 | 290,883 |
| Intergenic | 223,335 | 223,102 |
| Genic | 67,871 | 67,781 |
| Intron | 48,841 | 48,777 |
| UTRs | 17,683 | 17,639 |
| CDS | 1,347 | 1,365 |
. 1.—Polymorphism comparisons between TXBN and TXHH rice cultivars. Tracks from inner to outer circles indicate the distribution of total SNP number (blue) with SNP high resolution regions highlighted (orange), unique SNP number (red), indel number (green) and unique indel number (orange) on all 12 rice chromosomes in the 100 kb sliding windows. The chromosome structure in the scale of Mb is in gray. For each track, the outer and inner layers indicate TXBN and TXHH data, respectively.