Literature DB >> 27899667

Rice SNP-seek database update: new SNPs, indels, and queries.

Locedie Mansueto¹, Roven Rommel Fuentes¹, Frances Nikki Borja¹, Jeffery Detras¹, Juan Miguel Abriol-Santos¹, Dmytro Chebotarov¹, Millicent Sanciangco¹, Kevin Palis^1,2, Dario Copetti³, Alexandre Poliakov^4,5, Inna Dubchak^4,5, Victor Solovyev⁶, Rod A Wing^1,3, Ruaraidh Sackville Hamilton¹, Ramil Mauleon¹, Kenneth L McNally¹, Nickolai Alexandrov⁷.

Abstract

We describe updates to the Rice SNP-Seek Database since its first release. We ran a new SNP-calling pipeline followed by filtering that resulted in complete, base, filtered and core SNP datasets. Besides the Nipponbare reference genome, the pipeline was run on genome assemblies of IR 64, 93-11, DJ 123 and Kasalath. New genotype query and display features are added for reference assemblies, SNP datasets and indels. JBrowse now displays BAM, VCF and other annotation tracks, the additional genome assemblies and an embedded VISTA genome comparison viewer. Middleware is redesigned for improved performance by using a hybrid of HDF5 and RDMS for genotype storage. Query modules for genotypes, varieties and genes are improved to handle various constraints. An integrated list manager allows the user to pass query parameters for further analysis. The SNP Annotator adds traits, ontology terms, effects and interactions to markers in a list. Web-service calls were implemented to access most data. These features enable seamless querying of SNP-Seek across various biological entities, a step toward semi-automated gene-trait association discovery. URL: http://snp-seek.irri.org.

Entities: Chemical Disease Species

Mesh：

Year: 2016 PMID： 27899667 PMCID： PMC5210592 DOI： 10.1093/nar/gkw1135

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Genomic data play increasingly important roles in plant breeding by helping to discover new gene-trait associations and to understand how nucleotide variations are translated into phenotypic diversity of plants. While several other databases curate rice nucleotide variants, e.g. dbSNP at NCBI (1), Gramene (2), RiceVarMap (3), IC4R (4), RMBreeding (5), the SNP-Seek database features interactive real-time visualization of millions of SNPs in thousands of rice varieties, making SNP-Seek a unique tool for allele mining (6). Since the first release of Rice SNP-Seek Database (7), we have undertaken considerable development to incorporate new analysis results, datasets, viewers and query interfaces for multiple reference genomes and assemblies. We have also included features requested by users that will be useful to the broader rice research community.

NEW SNP SETS

We envision SNP-Seek to host data from not only our projects, but also as a repository for rice variant data in the public domain. Consequently, we redesigned the software architecture to handle multiple datasets and reference genomes. Further enhancement has occurred allowing the display and analysis of other variant types like indels and genetic markers and tools for analysis.

SNP and variety sets

The middleware of the SNP-Seek application was redesigned to handle multiple datasets and data formats. This enables use of the same user interface to display various SNP datasets and formats. Additional and updated analyses of the 3k varieties from the 3k Rice Genome Project (8) resulted in five SNP sets: All (32M positions), All biallelic (29M), Base (18M), Filtered (4.8M) and Core (404k). The details of these are described at the Download page (http://snp-seek.irri.org/_download.zul). The Filtered SNP set is the default dataset. We also imported the HDRA (9) data into SNP-Seek. With various SNP datasets available in a single interface, we can now efficiently query combined genotypes from these data. The current options are to query either/or both of the 3k or HDRA varieties; if both are selected, the SNP positions may be the union or the intersection of the two datasets. We also have access to phenotype data from the International Rice Genebank Collection database (Genetic Resource Information Management System, GRIMS) (10) linked to SNP-Seek. When a new genotyping dataset is added to our database and the varieties can be traced to genetic stocks or source accessions in GRIMS, phenotype data for the genetic stocks or legacy data for the source accessions are immediately available, adding value to the dataset.

SNPs from multiple O. sativa assemblies

In addition to Nipponbare (japonica), the SNP-calling pipeline (11) was run on four sequenced rice genomes representing indica and aus, two of the major rice subpopulations: IR 64 (indica) and DJ 123 (aus) (12), Kasalath (aus) (13) and 93-11 (indica) (14). To avoid including redundant SNP positions, a custom pipeline was run on a path alignment of the five genomes computed from the pair-wise alignment between the five reference genomes. New SNPs for regions unique to each of the other genomes were sequentially added, with the union of SNP calls loaded into SNP-Seek. We used VISTA (15) for the genome comparison. Results of the pairwise genome alignments are viewable in the VISTA browser accessible through the SNP-Seek menu. Our comprehensive all-against-all pair-wise alignments showed that all five genomes are highly similar to one another. However, a significant fraction of each genome sequence was found to be unique, namely 8.59% of Nipponbare, 5.35% of 93-11, 4.11% of IR 64, 19.85% of Kasalath and 3.56% of DJ 123 (Supplementary Table S1). Reference genome-specific regions may be informative for the discovery of novel variants since variants in these unique genomic regions (which represent additional 12 Mb to 79 Mb, Supplementary Table S1) would not be detected in accessions aligned to reference genomes that are too distant. Over ∼11 million additional SNPs and ∼0.5 million indels were discovered from the additional reference genomes (Supplementary Table S2). In a genotype query result, a position in the genotype table is based on the selected reference genome. The chromosome/contig and locus choices also depend on the selected reference. Selecting the ‘Show all reference alleles’ will display the alleles for all reference genomes. A gap in the allele means the position is not found, and possibly deleted for that genome (shown in Figure 1). The location of the queried region in the other references is reported in a message at the top of the results page.

Figure 1.

Genotype query options (A) and results table (B) with multiple reference genomes alleles. The selected reference genome (Nipponbare) is displayed at the top header row just below the SNP positions. The alleles for the other genomes (Kasalath, DJ 123, IR 64, 93-11) are shown below in the table header. The corresponding positions in the other genomes are displayed in the message box.

Indels

In addition to SNPs, the variant calling pipeline (11) yielded short indel data. Unlike most variant databases where indels are presented as alleles, we optionally display them in the genotype matrix along with the SNPs, in a multiple sequence alignment-like format (shown in Figure 2). To accomplish this, the longest indel at each anchor point is determined for all 3024 varieties, call this length N. Then N columns are inserted to the right of the anchor point column. For insertion of length I less than N, the inserted nucleotides are filled in the first I columns, and gap(s) are padded from columns I + 1 to N. If the anchor point is at position P, the insertion region columns position are P.01, P.02, to P.N and the reference alleles are set to gap. For deletion D of length less than N, gap(s) are filled in the first D columns, and the reference is copied for columns D + 1 to N. If the anchor point is at position P, the deletion region column positions are P + 1, P + 2, to P + N and the reference alleles are set from the reference genome.

Figure 2.

Genotype matrix with short indels. The table displays deletions (positions in blue) at anchor positions (region): 27698 (27699), 27791 (27792–27794), 27836 (27837–27841). Insertion regions (positions in green) are at 27722.01 27722.04 and 27797.01. For deletion regions, the reference is copied from the reference genome, while for insertions the reference genome is set to gaps.

Genomic features

We also developed features to query genomic data. Although most of the genomic data were incorporated as provided by external data sources, we introduced some conventions to uniformly name and merge various gene models. First, we merged the Nipponbare MSUv7 (16) and RAP (17) gene models, plus in-house FGenesh++ (18) annotation into a single set of gene models and named them using our convention, of the form OsNippo{YY}g{NNNNNN}. Details about the merging and naming procedure are described in Supplementary Information Sections I and II, respectively. The advantage with this approach is that we were able to map and interconvert all Nipponbare genes between gene models based on location and overlaps instead of by using names. We also introduced a naming convention for gene loci for the other reference genomes, described in Supplementary Information Section II and summarized in Table 1.

Table 1.

Gene loci names we used for rice reference genomes we are using

Reference genome	Reference	Gene loci names
93-11	(14)	Os9311_{YY}g{NNNNNN}, Os9311_{XXXXX}g{NNNNNN}
IR 64	(12)	OsIR64_{XXXXX}g{NNNNNN}
DJ 123	(12)	OsDJ123{XXXXX}g{NNNNNN}
Kasalath	(13)	OsKasal{YY}g{NNNNNN}
Nipponbare	(16,17)	OsNippo{YY}g{NNNNNN}

Where data are available, genes for any of the five reference assemblies can be queried using any of these constraints: functional annotation, gene name/symbols, accession number, Gene Ontology terms, traits or Trait Ontology terms, sequence or lists of SNP positions. The data sources are listed in Table 2 and were imported into SNP-Seek using the CHADO schema (19). Storing them with genotype and phenotype data allows complex queries to be performed for various analyses with display in SNP-Seek interface.

Table 2.

Data sources for genomics data

Data	Source	URL	Reference
Gene model	MSU v7	http://rice.plantbiology.msu.edu	(16)
Gene model	RAP	http://rapdb.dna.affrc.go.jp	(17)
Gene names/symbols	Oryzabase	http://shigen.nig.ac.jp/rice/oryzabase	(20)
Gene ontology	MSU v7	http://rice.plantbiology.msu.edu	(16)
Trait genes	OGRO	http://qtaro.abr.affrc.go.jp/ogro/table	(21)
Trait ontology-genes	Oryzabase	http://shigen.nig.ac.jp/rice/oryzabase	(20)
Plant ontology-genes	Oryzabase	http://shigen.nig.ac.jp/rice/oryzabase	(20)
QTL	Q-TARO	http://qtaro.abr.affrc.go.jp	(26)
Sequence	MSU v7	http://rice.plantbiology.msu.edu	(16)

NEW QUERY METHODS AND TOOLS

Data alone will not have an impact unless it's made available to those who need it the most and who have the best understanding of its biological significance. Large data sets require above average data handling and programming skills that may not be available to the average biologist. With this motivation, we implemented several query, data management and visualization features that use large variant and genomic data sets, but are relatively quick and intuitive for the general researcher.

Ontology-driven queries

Taking advantage of the CHADO data model, we use ontology terms to constrain gene queries, exploiting transitive closure. That is, selecting a term in the ontology will return entities related to the term and its descendants as defined in the ontology. In the gene locus query, Gene Ontology terms can be used to constrain gene locus queries. Trait Ontology terms are also used to query genes where gene–trait association data are available from Oryzabase (20) or OGRO (21). In variety queries, phenotypes are mapped to Trait Ontology (http://browser.planteome.org/amigo/term/TO:0000387) and Crop Ontology (Rice) terms (http://www.cropontology.org/terms/CO_320:ROOT/).

Allele frequency display

The allele frequency chart (shown in Figure 3) displays the frequency or count of major and minor alleles for the queried region or positions, for all varieties or by subpopulation. The chart can be useful for detecting haplotype blocks in the queried region, since adjacent SNPs in a block tend to have the same allele frequencies. If the major frequency of a subpopulation is 1, but less for the other groups, it means the subpopulation has no variant at those positions, which may be an important discriminator for the group. It can also show genotype instead of allele statistics.

Figure 3.

Allele frequency chart with major/minor allele/genotype frequency/count at each SNP position in the queried region for all or each subpopulation.

New jbrowse tracks and reference genomes

This is a significant update to the JBrowse genome browser (22) since our first release. First, new sets of tracks for Nipponbare are organized by categories. We added tracks for trait associated genes and the BAM and VCF analysis results for each of the 3024 varieties stored in Amazon S3. All tracks for Nipponbare are listed in Table 3.

Table 3.

JBrowse tracks for Nipponbare

Category	Track names (count)	Reference
Gene model	MSU7 RAP representative RAP predicted FGenesh++ Merged MSU7, RAP, FGenesh++	(16,17)
Trait Genes	28 OGRO trait track OGRO all traits genes Oryzabase all trait genes	(20,21)
QTL	28 QTARO QTL tracks QTARO all QTL	(26)
BAM	3024 varieties	https://aws.amazon.com/public-data-sets/3000-rice-genome/
BAM Coverage	3024 varieties	https://aws.amazon.com/public-data-sets/3000-rice-genome/
VCF	3024 varieties	https://aws.amazon.com/public-data-sets/3000-rice-genome/
Alignment	Nipponbare versus 9311 Nipponbare versus IR64-21 Nipponbare versus DJ123 Nipponbare versus Kasalath	This project
Variants	SNPs v2 INDELs v2 SNPs v1	This project

The second update is a separate JBrowse instance for each of the other four reference assemblies. The tracks in JBrowse are loaded with the same data as Nipponbare using gff format. Each instance has the option to display the sequence and gene models as provided by the source using original locus names and another track using our locus naming convention described in Supplement II. The alignments with the other four genomes can also be viewed as tracks. These instances are accessible through the main menu.

Large snp queries

In the prior version of SNP-Seek, querying SNPs for large regions or using a list of many positions was prohibited by server timeout where the client application needed to wait for a query to finish before the user could proceed with other tasks. To accommodate large queries, we implemented an asynchronous query engine, utilizing Spring (https://spring.io) @Async annotation to manage parallel processes. This allowed us to extend the genotype query limit to a 5Mb region, 500kb SNPs or 1000 gene loci; however, the results are not displayed but are available for download. The user need not wait for the task to finish since a dynamic link is given to monitor the progress and download the results when ready.

Alternate sequence download

A common task is to reconstruct the alternate sequence for a list of regions and varieties, by substituting the SNPs, and integrating the insertions and deletions into the reference genome. This feature uses tabix (23) to query the VCF files from Amazon S3 and process the VCF using the FastaAlternateReferenceMaker tool in GATK (11). Results are downloadable as compressed Fasta files in gzip format (*.fasta.gz).

List management

We want SNP-Seek to be used by researchers with minimal data interface/conversion issues. The List Manager is designed for this purpose. There are currently three types of lists implemented: variety, SNP and locus lists. The user can create a list, use it to constrain a query, generate a list from the query results and then download or submit it to other analysis tools or queries. The flow of data and queries available in SNP-Seek is illustrated in Figure 4. Arrows show the possible data from an initial set of information. Along with the available set operations, the system can be used for gene-trait association discovery. We extended the SNP List functionality to perform SNP-Marker annotation and genotype matching:

Figure 4.

Query capabilities of SNP-Seek. The blocks at the left (rounded) are possible query constraints to the query modules (rectangles). The query results may be stored as a list (parallelograms), and used as constraints in further queries. Lists may also be created by the user as initial constraints. The marker annotator accepts a list of SNP positions, which may be the result from experiments or GWAS studies, to generate constraints for further queries or loop back to the initial constraints, increasing the confidence of the association.

SNP/Marker annotator

The user can create a list of SNP positions and this feature annotates the markers with evidence collected from various other databases and analyses. This list may be significant markers from gene expression or GWAS studies. The annotations can include gene models (RAP (17), MSUv7 (16) or FGenesh++ (18)) or promoter regions (FGenesh++, PlantPromDB (24)) if SNPs are located within these loci. The effects of SNP variants were also added using results from SNPEff (25). For SNPs within gene models, additional evidence about the gene are included using Gene Ontology terms, Plant and Trait Ontology terms and gene names collected from Oryzabase (20), trait genes from OGRO (21), and QTL from Q-TARO (26), interacting genes from RiceNet v2 (27) and rice proteins from PRIN (28). The list of annotations and references are in Supplementary Table S3.

Genotype match

A common use case is to find the most related genotypes among the 3k/HDRA varieties when given a particular genotype. We created a query where the constraint is a list of SNP positions with allele values, and the result is a genotype table where the varieties are sorted based on the number of matching alleles between the query and each variety in the selected dataset. The genotype table can also display values for a selected phenotype allowing quick evaluation of the effect of the queried genotype on the phenotype.

SOFTWARE AND DATABASE UPDATES

To provide the data and query requirements described in the previous sections, several modifications were made in the underlying software architecture. Our major objectives for these updates were to improve query performance, to increase query capacity, to easily integrate new datasets and to serve data to other software systems. Query speed was improved by using hybrid storage wherein genotype matrices are located on the web server as HDF5 (https://www.hdfgroup.org/HDF5) formatted files while other data and the indices for HDF5 files are located in the RDMS. The middleware was re-designed into Service and Data Access layers using Java interfaces, further allowing multiple genotypic data sets to be accommodated. The details of these updates are described in Supplementary Information III.

Web services

We defined a set of RESTful web-service calls for internal use and shared to collaborators through our development site (http://snp-seek.irri.org/dev). The calls are focused on Germplasm, Phenotypes and Genotypes. Another set of calls were implemented for SNP-Seek for compliance to the Breeding API (BrAPI, http://docs.brapi.apiary.io) for Germplasm, Phenotypes, Maps, Studies and Genotypes. The web services documentation can be accessed from the Help menu. Most calls are open but some require login and password. Interested collaborators may contact the authors to use the protected calls.

CONCLUSION AND PERSPECTIVE

The 2016 release of SNP-Seek is designed to be adaptive and responsive to the deluge of genomic data from various sequencing and high-density genotyping projects. It can also accommodate phenotyping (trait) data from germplasm panels with curated genotype data and connect to legacy, phenotypic data for germplasm from the International Rice Genebank Collection (GRIMS) database. SNP-Seek, promises to continue to be an indispensible resource and tool for rice genomics and allele discovery. Our next efforts will focus on analysis and visualizations tools for GWA and genomic selection studies and integrating more of the public genotypic and phenotypic datasets.

26 in total

1. dbSNP: the NCBI database of genetic variation.

Authors: S T Sherry; M H Ward; M Kholodov; J Baker; L Phan; E M Smigielski; K Sirotkin
Journal: Nucleic Acids Res Date: 2001-01-01 Impact factor: 16.971

2. PlantProm: a database of plant promoter sequences.

Authors: Ilham A Shahmuradov; Alex J Gammerman; John M Hancock; Peter M Bramley; Victor V Solovyev
Journal: Nucleic Acids Res Date: 2003-01-01 Impact factor: 16.971

3. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors: Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal: Genome Res Date: 2010-07-19 Impact factor: 9.043

4. A draft sequence of the rice genome (Oryza sativa L. ssp. indica).

Authors: Jun Yu; Songnian Hu; Jun Wang; Gane Ka-Shu Wong; Songgang Li; Bin Liu; Yajun Deng; Li Dai; Yan Zhou; Xiuqing Zhang; Mengliang Cao; Jing Liu; Jiandong Sun; Jiabin Tang; Yanjiong Chen; Xiaobing Huang; Wei Lin; Chen Ye; Wei Tong; Lijuan Cong; Jianing Geng; Yujun Han; Lin Li; Wei Li; Guangqiang Hu; Xiangang Huang; Wenjie Li; Jian Li; Zhanwei Liu; Long Li; Jianping Liu; Qiuhui Qi; Jinsong Liu; Li Li; Tao Li; Xuegang Wang; Hong Lu; Tingting Wu; Miao Zhu; Peixiang Ni; Hua Han; Wei Dong; Xiaoyu Ren; Xiaoli Feng; Peng Cui; Xianran Li; Hao Wang; Xin Xu; Wenxue Zhai; Zhao Xu; Jinsong Zhang; Sijie He; Jianguo Zhang; Jichen Xu; Kunlin Zhang; Xianwu Zheng; Jianhai Dong; Wanyong Zeng; Lin Tao; Jia Ye; Jun Tan; Xide Ren; Xuewei Chen; Jun He; Daofeng Liu; Wei Tian; Chaoguang Tian; Hongai Xia; Qiyu Bao; Gang Li; Hui Gao; Ting Cao; Juan Wang; Wenming Zhao; Ping Li; Wei Chen; Xudong Wang; Yong Zhang; Jianfei Hu; Jing Wang; Song Liu; Jian Yang; Guangyu Zhang; Yuqing Xiong; Zhijie Li; Long Mao; Chengshu Zhou; Zhen Zhu; Runsheng Chen; Bailin Hao; Weimou Zheng; Shouyi Chen; Wei Guo; Guojie Li; Siqi Liu; Ming Tao; Jian Wang; Lihuang Zhu; Longping Yuan; Huanming Yang
Journal: Science Date: 2002-04-05 Impact factor: 47.728

5. SNP-Seek database of SNPs derived from 3000 rice genomes.

Authors: Nickolai Alexandrov; Shuaishuai Tai; Wensheng Wang; Locedie Mansueto; Kevin Palis; Roven Rommel Fuentes; Victor Jun Ulat; Dmytro Chebotarov; Gengyun Zhang; Zhikang Li; Ramil Mauleon; Ruaraidh Sackville Hamilton; Kenneth L McNally
Journal: Nucleic Acids Res Date: 2014-11-27 Impact factor: 16.971

6. Whole genome de novo assemblies of three divergent strains of rice, Oryza sativa, document novel gene space of aus and indica.

Authors: Michael C Schatz; Lyza G Maron; Joshua C Stein; Alejandro Hernandez Wences; James Gurtowski; Eric Biggers; Hayan Lee; Melissa Kramer; Eric Antoniou; Elena Ghiban; Mark H Wright; Jer-ming Chia; Doreen Ware; Susan R McCouch; W Richard McCombie
Journal: Genome Biol Date: 2014 Impact factor: 13.583

7. Construction of pseudomolecule sequences of the aus rice cultivar Kasalath for comparative genomics of Asian cultivated rice.

Authors: Hiroaki Sakai; Hiroyuki Kanamori; Yuko Arai-Kichise; Mari Shibata-Hatta; Kaworu Ebana; Youko Oono; Kanako Kurita; Hiroko Fujisawa; Satoshi Katagiri; Yoshiyuki Mukai; Masao Hamada; Takeshi Itoh; Takashi Matsumoto; Yuichi Katayose; Kyo Wakasa; Masahiro Yano; Jianzhong Wu
Journal: DNA Res Date: 2014-02-26 Impact factor: 4.458

8. Open access resources for genome-wide association mapping in rice.

Authors: Susan R McCouch; Mark H Wright; Chih-Wei Tung; Lyza G Maron; Kenneth L McNally; Melissa Fitzgerald; Namrata Singh; Genevieve DeClerck; Francisco Agosto-Perez; Pavel Korniliev; Anthony J Greenberg; Ma Elizabeth B Naredo; Sheila Mae Q Mercado; Sandra E Harrington; Yuxin Shi; Darcy A Branchini; Paula R Kuser-Falcão; Hei Leung; Kowaru Ebana; Masahiro Yano; Georgia Eizenga; Anna McClung; Jason Mezey
Journal: Nat Commun Date: 2016-02-04 Impact factor: 14.919

9. Allele mining and enhanced genetic recombination for rice breeding.

Authors: Hei Leung; Chitra Raghavan; Bo Zhou; Ricardo Oliva; Il Ryong Choi; Vanica Lacorte; Mona Liza Jubay; Casiana Vera Cruz; Glenn Gregorio; Rakesh Kumar Singh; Victor Jun Ulat; Frances Nikki Borja; Ramil Mauleon; Nickolai N Alexandrov; Kenneth L McNally; Ruaraidh Sackville Hamilton
Journal: Rice (N Y) Date: 2015-11-25 Impact factor: 4.783

10. Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data.

Authors: Yoshihiro Kawahara; Melissa de la Bastide; John P Hamilton; Hiroyuki Kanamori; W Richard McCombie; Shu Ouyang; David C Schwartz; Tsuyoshi Tanaka; Jianzhong Wu; Shiguo Zhou; Kevin L Childs; Rebecca M Davidson; Haining Lin; Lina Quesada-Ocampo; Brieanne Vaillancourt; Hiroaki Sakai; Sung Shin Lee; Jungsok Kim; Hisataka Numa; Takeshi Itoh; C Robin Buell; Takashi Matsumoto
Journal: Rice (N Y) Date: 2013-02-06 Impact factor: 4.783

83 in total

1. The potentiality of rice microsatellite markers in assessment of cross-species transferability and genetic diversity of rice and its wild relatives.

Authors: Umakanta Ngangkham; Sofini Dash; Madhuchhanda Parida; Sanghamitra Samantaray; Devachandra Nongthombam; Manoj Kumar Yadav; Awadhesh Kumar; Parameswaran Chidambaranathan; Jawahar L Katara; Bhaskar C Patra; Lotan K Bose
Journal: 3 Biotech Date: 2019-05-20 Impact factor: 2.406

2. NGS sequencing reveals that many of the genetic variations in transgenic rice plants match the variations found in natural rice population.

Authors: Doori Park; Su-Hyun Park; Youn Shic Kim; Beom-Soon Choi; Ju-Kon Kim; Nam-Soo Kim; Ik-Young Choi
Journal: Genes Genomics Date: 2018-11-07 Impact factor: 1.839

3. Nucleotide variations of 9-cis-epoxycarotenoid dioxygenase 2 (NCED2) and pericarp coloration genes (Rc and Rd) from upland rice varieties.

Authors: Muazr Amer Hamzah; Nur Aini Mohd Kasim; Athirah Shamsuddin; Nadia Mustafa; Norliana Izzati Mohamad Rusli; Chui-Yao Teh; Chai-Ling Ho
Journal: 3 Biotech Date: 2020-02-07 Impact factor: 2.406

4. Cloning and characterization of a gene encoding MIZ1, a domain of unknown function protein and its role in salt and drought stress in rice.

Authors: Vikender Kaur; Shashank K Yadav; Dhammaprakash P Wankhede; Pranusha Pulivendula; Ashok Kumar; Viswanathan Chinnusamy
Journal: Protoplasma Date: 2019-11-30 Impact factor: 3.356

5. Genome-wide association mapping for resistance to bacterial blight and bacterial leaf streak in rice.

Authors: Nan Jiang; Jun Fu; Qin Zeng; Yi Liang; Yanlong Shi; Zhouwei Li; Youlun Xiao; Zhizhou He; Yuntian Wu; Yu Long; Kai Wang; Yuanzhu Yang; Xionglun Liu; Junhua Peng
Journal: Planta Date: 2021-04-08 Impact factor: 4.116

6. Evolutionary Processes Involved in the Emergence and Expansion of an Atypical O. sativa Group in Madagascar.

Authors: Nourollah Ahmadi; Alain Ramanantsoanirina; João D Santos; Julien Frouin; Tendro Radanielina
Journal: Rice (N Y) Date: 2021-05-20 Impact factor: 4.783

7. An epigenetic pathway in rice connects genetic variation to anaerobic germination and seedling establishment.

Authors: Lina Castano-Duque; Sharmistha Ghosal; Fergie A Quilloy; Thomas Mitchell-Olds; Shalabh Dixit
Journal: Plant Physiol Date: 2021-06-11 Impact factor: 8.340

8. Superior haplotypes towards development of low glycemic index rice with preferred grain and cooking quality.

Authors: Ramchander Selvaraj; Arun Kumar Singh; Vikas Kumar Singh; Ragavendran Abbai; Sonali Vijay Habde; Uma Maheshwar Singh; Arvind Kumar
Journal: Sci Rep Date: 2021-05-12 Impact factor: 4.379

9. Integrating GWAS and transcriptomics to identify genes involved in seed dormancy in rice.

Authors: Jin Shi; Jianxin Shi; Wanqi Liang; Dabing Zhang
Journal: Theor Appl Genet Date: 2021-07-26 Impact factor: 5.699

10. Resequencing of 672 Native Rice Accessions to Explore Genetic Diversity and Trait Associations in Vietnam.

Authors: Janet Higgins; Bruno Santos; Tran Dang Khanh; Khuat Huu Trung; Tran Duy Duong; Nguyen Thi Phuong Doai; Nguyen Truong Khoa; Dang Thi Thanh Ha; Nguyen Thuy Diep; Kieu Thi Dung; Cong Nguyen Phi; Tran Thi Thuy; Nguyen Thanh Tuan; Hoang Dung Tran; Nguyen Thanh Trung; Hoang Thi Giang; Ta Kim Nhung; Cuong Duy Tran; Son Vi Lang; La Tuan Nghia; Nguyen Van Giang; Tran Dang Xuan; Anthony Hall; Sarah Dyer; Le Huy Ham; Mario Caccamo; Jose J De Vega
Journal: Rice (N Y) Date: 2021-06-10 Impact factor: 4.783