Literature DB >> 31624841

MBKbase for rice: an integrated omics knowledgebase for molecular breeding in rice.

Hua Peng^1,2, Kai Wang¹, Zhuo Chen^1,2, Yinghao Cao¹, Qiang Gao¹, Yan Li¹, Xiuxiu Li^1,2, Hongwei Lu^1,2, Huilong Du^1,2, Min Lu^1,2, Xin Yang¹, Chengzhi Liang^1,2.

Abstract

To date, large amounts of genomic and phenotypic data have been accumulated in the fields of crop genetics and genomic research, and the data are increasing very quickly. However, the bottleneck to using big data in breeding is integrating the data and developing tools for revealing the relationship between genotypes and phenotypes. Here, we report a rice sub-database of an integrated omics knowledgebase (MBKbase-rice, www.mbkbase.org/rice), which integrates rice germplasm information, multiple reference genomes with a united set of gene loci, population sequencing data, phenotypic data, known alleles and gene expression data. In addition to basic data search functions, MBKbase provides advanced web tools for genotype searches at the population level and for visually displaying the relationship between genotypes and phenotypes. Furthermore, the database also provides online tools for comparing two samples by their genotypes and finding target germplasms by genotype or phenotype information, as well as for analyzing the user submitted SNP or sequence data to find important alleles in the germplasm. A soybean sub-database is planned for release in 3 months and wheat and maize will be added in 1-2 years. The data and tools integrated in MBKbase will facilitate research in crop functional genomics and molecular breeding.

Entities: Chemical Disease Species

Mesh：

Year: 2020 PMID： 31624841 PMCID： PMC7145604 DOI： 10.1093/nar/gkz921

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Germplasm resources provide the genetic basis for crop improvement (1). In the past, many international organizations and research centers have been engaged in crop germplasm collection and preservation and have generated a large amount of germplasm phenotype data in field tests. These organizations include EURISCO-ECPGR (www.ecpgr.cgiar.org) (2), GRIN-NGRP (www.ars-grin.gov), NBRP (https://nbrp.jp/) (3), IRGCIS-IRRI (http://irgcis.irri.org:81/grc/IRGCISHome.html) (4), CIMMYT (www.cimmyt.org), CGRIS (www.cgris.net), Genesys (www.genesys-pgr.org) and others (5). Recently, high-throughput phenotyping technologies have enabled the collection of large-scale germplasm phenomic data under different experimental designs or environments (6). Over the last decade, next generation sequencing (NGS) technologies have deeply impacted crop breeding. NGS has been widely used for linkage mapping, genome wide association studies (GWAS), marker-assisted selection (MAS), genomic selection (GS), domestication and population structure analysis in crops (7–12). Benefitting from the application of these technologies, a large number of important genes/QTLs have been mapped and cloned (10,13–20). These genes provide abundant operational targets for molecular design breeding and gene modification (21). Along with reduced sequencing cost, high coverage whole‐genome sequencing (WGS) of natural crop populations have opened the door to exploring genetic diversity among individuals, which results in a large amount of genomic data. More recently, single-molecule DNA sequencing of long reads has been employed in high-quality assembly with genomes of various sizes and levels of complexity (22–29). These high-quality genomes are very important for expanding variant calling, which makes it possible to obtain all alleles of all genes in WGS populations using high-throughput methods. For efficient usage, those public raw datasets need to be integrated into a big data platform where functional associations between heterogeneous datasets can be explored further. More importantly, the central challenge of integration is to establish genotype to phenotype (G2P) links and develop a user-friendly and highly flexible web tool for data visualization and mining. Here, we describe an integrated omics knowledgebase for molecular breeding (MBKbase, www.mbkbase.org), specifically the rice sub-database, which integrates multiple reference genomes, population WGS data, germplasm information, phenotype, known genes and RNA-seq data. A soybean sub-database will be released in 3 months and wheat and maize will be added later. MBKbase aims to integrate multiple high-quality reference genomes, and reveal the relationship among germplasm, phenotype and genotype by providing genotypes for each gene locus and their associated phenotypes in a population. In this respect, MBKbase is very different from other breeding data management systems, such as GOBII (http://gobiiproject.org), NRSP10 (www.nrsp10.org), and AgBioData (www.agbiodata.org) (30) projects, as well as those mainly focusing on nucleotide variants such as GVM (31), RiceVarMap (32), and SNP-Seek (33), or the genomic or comparative genomic databases such as Ensembl (34) and Gramene (35). MBKbase is designed for multi-species data management and display. Each species will share the same functional framework but have an independent set of databases, web home page and a set of menu items to go to subcategories or modules (Figure 1). The basic architecture of MBKbase consists of an Apache web server, MongoDB NoSQL data management system, Pyramid web framework, Beaker web library and JQuery library. By using multiple application servers, distributed databases and efficient database structure for standardized data storage, as well as a data caching function in middleware, MBKbase ensures high-speed data access, integration analysis and visualization. We adopt a modular approach for front-end web development, which allows multiple types of data to be associated flexibly and displayed in various combinations on different web pages (Figure 1).

Figure 1.

Schematic of data, page and function modules in MKBbase. Each data module has many linked pages and functional units for displaying the information related to the module, as exemplified by the Germplasm module in the figure. The Asian cultivated rice (Oryza sativa L.) is one of the most important crops, and the accumulated data related to rice population genomics is also the most abundant among all crops. Because of this, we first developed the sub-database for rice (http://www.mbkbase.org/rice) and only describe the rice data in this publication. Asian rice consists of two subspecies, indica and japonica, with high-quality reference genomes of the japonica rice Nipponbare (Nip) and the indica rice Shuhui498 (R498) having been released (22,36). R498 is more useful than Nip as a reference for mapping sequences of indica rice, which consists of more varieties than japonica rice and has more genetic diversity. Therefore, besides Nip, we also provide genomic variation based on the R498 reference genome, which is not found in any other databases. In addition, we provide a unified set of gene loci as a pan-genome based on the annotations on both Nip and R498 genomes. We collected publicly available rice germplasm information, population WGS data and phenotype data (Table 1 and Supplementary Tables S1–3). In particular, the data include >1000 Chinese rice accessions with both WGS data and phenotype data not available in other databases, which have been deposited into BIG-BioProject for download. MBKbase will continue to integrate all published WGS projects, phenomic data, known genes and published data related to molecular breeding to develop advanced analytical and data mining tools, as well as add more crop species.

Table 1.

The summary of rice data in MBKbase

				Phenotype
	Total num	With pedigree	With source locations	Traits	Value Num
Germplasm	137 769	24 462	134 626	130	4 786 640
WGS sample	7010	1153	6351	122	207 158
	Named Gene num		Trait gene num		Verified alleles num
Known genes	13 219		4821		91
	Run num		Tissue		Stage or tissue
RNA-seq	175		20		45
		MBK Allele Num	SNP num (AF ≥ 0.01)		Indel num (AF > 0.005)
Reference	Nip	51 722	14 850 931		2 250 804
	R498	54 973	13 278 107		2 387 538

The summary of rice data in MBKbase

TOP-LEVEL DATABASE MODULES

MBKbase is divided into several different top-level modules for organizing various data types and applications (Figure 1). For each data module, we provide a data description and basic search functions as described below. The Germplasm module (www.mbkbase.org/rice/germplasm) provides basic information for rice germplasm collections as well as links to other related information of each germplasm such as genotype or phenotype. Germplasm can be searched by name or ID, which supports regular expressions. A germplasm can either have an associated sequence or not. Because a cultivar or an accession sharing the same germplasm name may have different origins, we assign a different sample ID to each instance of the same germplasm which is associated with WGS data or other omics data. The rice data include 7010 samples with WGS data, and 137 769 germplasms for reference purposes. The Phenotype module (www.mbkbase.org/rice/phenotype) provides information on traits and phenotypes of a germplasm or a sample. Phenotype can be searched by trait symbol or name or browsed by trait class. We used standard terms for trait description and phenotype classification when they were available. The standard terms come from the standard evaluation system for Rice (37), the Chinese Crop Germplasm Information System (CGRIS, www.cgris.net/cgris_english.html) and Plant Trait Ontology (TO) (38). Due to complexity and inconsistency of trait terms in real data, we roughly classified the rice phenotypes into five top-level classes. The trait classes were further divided into subclasses manually based on the TO terms. We will update the trait definitions, classifications and phenotypes continually in MBKbase and connect the phenotype data explicitly to TO accessions in the future. The phenotype data in MBKbase come from two different sources: (a) evaluation data from resource databases or books for general descriptions of germplasms or cultivars that are not necessarily sequenced or genotyped, and (b) population phenotypes collected for GWAS along with sequences or genotypes. The distribution of phenotype values can be found in the trait detail page (Supplementary Figure S1). The rice data include 122 traits with phenotype data for WGS samples and 130 traits with phenotype data for un-sequenced germplasms. The Genotype module (www.mbkbase.org/rice/genotype) provides genotype information of all sequenced germplasms or samples. The WGS data come from published projects that were collected from NCBI-SRA (www.ncbi.nlm.nih.gov/sra) and BIG-BioProject (https://bigd.big.ac.cn/bioproject/). Sequence variation including both single-nucleotide polymorphisms (SNPs) and insertions-deletions (indels) in MBKbase are called based on multiple reference genomes. The WGS data were mapped to reference genomes using the BWA-MEM (39) tool, multiple mapping reads filtered using samtools (v1.8) (40) and SNPs and indels are called using the GATK (v3.8) UnifiedGenotyper (41) tool. We define gene locus as a conceptual gene to hold multiple alleles in both reference genomes and WGS populations. Currently the rice data include 51 722 gene alleles, 14 850 931 SNPs and 2 250 804 short indels based on Nip; and 54 973 alleles, 13 278,107 SNPs and 2 387 538 short indels based on R498. Most of the loci in rice contain the alleles of both Nip and R498, which can be found with a locus search. Genotype can be searched at a locus or a set of user-selected positions. Many functions and tools are provided based on genotypes in MBKbase and they are further described below. The KnownGene module (www.mbkbase.org/rice/knownGene) provides information on functionally annotated genes and experimentally verified alleles. The module supports gene search by symbol, name or related traits, and provides gene browsing by trait categories. Currently there are 13 219 rice genes with assigned symbols and functional annotations downloaded from Q-TARO, GenBank, funRiceGenes and Oryzabase. Particularly, 63 experimental rice genes with verified variation sites were curated from the literature and associated with their corresponding phenotypes in populations. The Population module (www.mbkbase.org/rice/population) provides information on the collected WGS populations and their phylogenetic trees. Most of the rice populations were collected from external data sources and one was from an internally sequenced population. A germplasm sample can be searched to show its position in the tree with its closely related germplasms/samples. Clicking on a node in the tree will show a sub-tree structure of nodes, and the geographic distribution of the germplasm/samples is also displayed (Supplementary Figure S2).

UNITED GENE SET AS PAN-GENOME

A pan-genome consists of the genes of all individuals in a species. In MBKbase, we defined the concept of a virtual locus corresponding to a gene in a pan-genome based on reference genomes only. The genes from multiple reference genomes are united into a single gene set consisting of virtual loci. A locus can contain a single allele if the gene is present in only one reference genome, or multiple alleles if these alleles are in a collinear position in multiple reference genomes. From Nip and R498, there are a total of 95 325 loci at present. For example, under locus OsG00067204 (Figure 2A), there are alleles from both Nip (with multiple annotations based on different annotation methods) and R498. The different gene annotation sets are usually complementary to each other, providing more useful genes than a single annotation. As more reference genomes are added in MBKbase, the locus set will also be expanded with more loci as well as more alleles.

Figure 2.

An example of a locus page showing multiple types of information associated with the locus. Rice gene GS5 controlling grain size is on locus OsG00067204. (A) Two reference genomes contain different alleles at the locus. (B) Genotype table showing all genotypes in the rice population, with reference genome positions in the first row of the table and reference bases in the second row. Genotype IDs are enumerated from T1 as the most frequent genotype (allele or haplotype for homozygous genome). ‘0’ indicates that the genotype at this position is the same as in the reference genome. Capitalized bases indicate homozygous variants, lowercase bases indicate heterozygous variants and ‘-’ indicates missing information. In the REF row, the letter ‘N’ represents a short insertion in the reference genome compared with some other samples. In the GT row, the ‘N’ represents insertion of bases in the samples compared with the reference genome. (C) Distribution of each genotype (allele) in a rice population. For example, the majority of genotype (allele) T1 is found in temperate japonica lines (1251 out of 1769). (D) Boxplot showing the relationship between genotypes and phenotypes. The phenotype can be selected by users from all collected traits in the database. In this case, the locus OsG0067204 is known to be associated with grain width (GW) traits. (E) Expression profile at the locus in different reference genomes in different tissues.

GENOTYPES IN MBKBASE

Genotype is one of the key concepts in MBKbase on which many applications are built. We define a genotype to be a group of SNPs and short indels, which are either adjacent or non-adjacent on a chromosome. A genotype is equivalent to a haplotype when the identified SNP and indel are homozygous in a genome. We define four types of genotypes: (i) positional genotype at a single SNP or indel position, (ii) locus genotype at a gene locus (i.e. an allele in a homozygous genome) (Figure 2), (iii) customized genotype for a group of user-selected SNPs and indels at any positions throughout the genome (Supplementary Figure S3) and (iv) window genotype for a group of core SNPs and indels in 20-kb windows. We developed a set of web tools for searching and displaying genotypes. A genotype can be the same as or different from the reference sequence. The genotypes based on WGS data of a population are affected by two major factors: variation type and variation number, which are further influenced by many factors, such as the size of the population, the length of selected chromosomal regions, the allele frequency, missing allele frequency, sequencing depth and mapping quality. In the customized genotype page, all related parameters can be customized by users (Supplementary Figure S3). This function is useful for users to examine the type of variations in a set of target genome regions in a subpopulation.

INTEGRATION OF MULTI-OMICS DATA

In MBKbase, we used known genes controlling important agronomic traits as a bridge between genotype and phenotype. The experimentally verified functional alleles controlling phenotypic traits were manually annotated with trait class and description. In the locus genotype or custom genotype query page, all genotypes are listed in one genotype table. Samples with the same genotype are merged and assigned a genotype ID and sorted according to the number of samples sharing a genotype in descending order (Figure 2B). This provides a convenient base for revealing the relationship among germplasm, genotypes and phenotypes. The percent of ecotype or originating country of each genotype are displayed in a separate panel (Figure 2C), with the boxplots of G2P relationships generated online (Figure 2D). Users can choose which phenotype is associated for display with the genotypes based on their knowledge of known genes or variation. The phenotypes directly associated with genotypes in published GWAS projects are used for default plotting. If this type of phenotype value is missing, other phenotype data under the same germplasm name will be retrieved automatically for comparison purpose. In the genotype table, clicking any of the genotype IDs in the first column (GT_ID) leads to the genotype detail page, which contains the sample information list, the sample geographical distribution and the traits with phenotypic value (Supplementary Figure S4). If the chromosomal region of the genotype contains genes, each gene's mRNA expression profile is also shown in the expression panel (Figure 2E).

ADVANCED GENOTYPE-BASED FUNCTIONS

We provide several advanced data analytical or mining tools that are based on genotypes. They are described as follows.

Germplasm search

MBKbase provides advanced search methods for germplasm by genotype or by sequence. In the former case, a user can set up logical queries using a different combination of positional genotypes to identify the germplasm containing the genotypes. For example, a user can set up the following logical query: (Chr1:11663:CCA and Chr1:26254:T) and (Chr2:30096330:A or Chr6:1767006:G) (Figure 3A), to find and display all germplasms passing the filter. Clicking the filtered germplasm number will display the germplasm information and their geographical distribution in a result table and Google Maps. A known allele can also be used to search for germplasm that contains the allele. In the latter case, a user can submit a short DNA sequence as a query and find out which germplasms contain the same DNA sequence. For example, 142 germplasms were found that contain the same sequence as BD295334.1 (Figure 3B).

Figure 3.

Advanced functions based on genotype. (A) Advanced germplasm query by genotype (GT), the options in the drop-down list of genotype source include position GT, Locus GT and Verified GT. Users can set up complex logical queries by selecting different combinations of variants. In this example, 177 samples passed the filter with the same genotype. (B) Advanced germplasm query by sequence. Users can submit a genome sequence and identify the germplasms containing the sequence. (C) Comparison of two germplasm samples (A and B) based on Locus GT. X-axis: window position along the chromosome; Y-axis: locus number in the window, positive for sample A, negative for sample B. A red line represents the number of identical genotypes between A and B. A green line represents the number of specific genotypes of sample A. A teal line represents the number of specific genotypes of sample B. A purple line represents the number of missing or low frequency genotypes (containing sample number <10). (D) Identification of the known alleles in a WGS sample. This can be used for WGS samples either stored in the database or submitted by users through the web interface. (E) Tools for viewing the effect of variation. Clicking on any variation in a reference row of genotype table, for example, clicking ‘T’ (at position 30,315,214 bp) in the example will result in the sequence view panel automatically jumping to the corresponding position. As shown, this mutation results in the substitution of amino acids (K→M).

Germplasm comparison

A user can compare two samples in MBKbase to show the genetic variations between them at the chromosome level (Figure 3C) to find out how similar they are. Furthermore, global comparisons between three samples are also available for comparing breeding parents and offspring to identify the chromosomal segments of the offspring from each of its parents. In the genotype home page of MBKbase, clicking the ‘sample compare’ link will open the variant genotype comparison home page and clicking the ‘pedigree compare’ link will enter the pedigree genotype comparison home page.

Analyzing user submitted WGS data

A user can submit WGS data of germplasm samples to MBKbase for genotype identification. After submission of WGS data, they will be processed in real time and the resulting known alleles identified in the samples are displayed in the sample detail page (Figure 3D). MBKbase also supports offline batch uploading of private raw data of WGS samples and phenotypes, and the private data can be combined with published project data for analysis and visualization. Users can decide when to share the private data.

Variant annotation

We developed an annotation tool for adding experimentally verified functional annotations to a sequence variant such as an SNP or an indel. In the custom genotyping page, for any gene, its structure, reference protein sequence and genotype protein sequence are displayed (Figure 3E). Users can select the gene model by clicking an mRNA ID in the gene structure panel, and the corresponding protein sequence will be automatically generated and displayed in the sequence panel. In the same way, users can select a genotype by clicking a genotype row in the genotype table, and the corresponding DNA and protein sequence will be displayed. Note that the protein sequence of the genotype (haplotype) comes from the resulting sequence with all the variants in the query region, including SNPs and indels. Users can click on a variant in the genotype table, which pops up a variant annotation page. If this variant has been annotated, the variant font is marked in red, as shown at position 30 315 214 bp (Figure 3E). This provides a convenient way to manually update functional annotations as more and more genes are cloned.

CONCLUSION AND FUTURE DIRECTIONS

Here, we report MBKbase, specifically the rice sub-database, for integrating germplasm information and multi-omics data to support functional genomics and molecular breeding in rice. We did not describe all features available in the database here. Users can go to tutorials on the MBKbase website (www.mbkbase.org/rice/help) for more information. Starting with the rice sub-database, we have completed the development of the functional framework and database structure of MBKbase. We have also developed a set of pipelines for genotype construction, making it possible to achieve online integration analysis and visualizations of genotype and phenotype. As population genomic and phenotypic data become available in other crops, they will be incorporated into MBKbase as independent sub-databases. Our next released crop is soybean, which will be available in 3 months. The database will be continually updated by adding more reference genomes (such as those from (42)) with an expanded pan-genome, as well as the increasing publicly available WGS data (such as from (43)) and high-throughput phenomic data. We will also add more annotations of experimentally verified alleles and trait ontology terms to improve the annotation of phenotypes; more connections to external databases such as Gramene Pathways; and more web tools to support molecular design breeding based on known gene networks and large-scale G2P associations and prediction models for major agronomic traits. Click here for additional data file.

40 in total

1. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors: Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal: Genome Res Date: 2010-07-19 Impact factor: 9.043

2. Resequencing 302 wild and cultivated accessions identifies genes related to domestication and improvement in soybean.

Authors: Zhengkui Zhou; Yu Jiang; Zheng Wang; Zhiheng Gou; Jun Lyu; Weiyu Li; Yanjun Yu; Liping Shu; Yingjun Zhao; Yanming Ma; Chao Fang; Yanting Shen; Tengfei Liu; Congcong Li; Qing Li; Mian Wu; Min Wang; Yunshuai Wu; Yang Dong; Wenting Wan; Xiao Wang; Zhaoli Ding; Yuedong Gao; Hui Xiang; Baoge Zhu; Suk-Ha Lee; Wen Wang; Zhixi Tian
Journal: Nat Biotechnol Date: 2015-02-02 Impact factor: 54.908

Review 3. Molecular genetic dissection of quantitative trait loci regulating rice grain size.

Authors: Jianru Zuo; Jiayang Li
Journal: Annu Rev Genet Date: 2014-08-18 Impact factor: 16.830

4. Pan-genome analysis highlights the extent of genomic variation in cultivated and wild rice.

Authors: Qiang Zhao; Qi Feng; Hengyun Lu; Yan Li; Ahong Wang; Qilin Tian; Qilin Zhan; Yiqi Lu; Lei Zhang; Tao Huang; Yongchun Wang; Danlin Fan; Yan Zhao; Ziqun Wang; Congcong Zhou; Jiaying Chen; Chuanrang Zhu; Wenjun Li; Qijun Weng; Qun Xu; Zi-Xuan Wang; Xinghua Wei; Bin Han; Xuehui Huang
Journal: Nat Genet Date: 2018-01-15 Impact factor: 38.330

Review 5. Genomic Selection in the Era of Next Generation Sequencing for Complex Traits in Plant Breeding.

Authors: Javaid A Bhat; Sajad Ali; Romesh K Salgotra; Zahoor A Mir; Sutapa Dutta; Vasudha Jadon; Anshika Tyagi; Muntazir Mushtaq; Neelu Jain; Pradeep K Singh; Gyanendra P Singh; K V Prabhu
Journal: Front Genet Date: 2016-12-27 Impact factor: 4.599

6. EURISCO: The European search catalogue for plant genetic resources.

Authors: Stephan Weise; Markus Oppermann; Lorenzo Maggioni; Theo van Hintum; Helmut Knüpffer
Journal: Nucleic Acids Res Date: 2016-08-31 Impact factor: 16.971

7. Rice SNP-seek database update: new SNPs, indels, and queries.

Authors: Locedie Mansueto; Roven Rommel Fuentes; Frances Nikki Borja; Jeffery Detras; Juan Miguel Abriol-Santos; Dmytro Chebotarov; Millicent Sanciangco; Kevin Palis; Dario Copetti; Alexandre Poliakov; Inna Dubchak; Victor Solovyev; Rod A Wing; Ruaraidh Sackville Hamilton; Ramil Mauleon; Kenneth L McNally; Nickolai Alexandrov
Journal: Nucleic Acids Res Date: 2016-11-29 Impact factor: 16.971

8. Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors: Heng Li; Richard Durbin
Journal: Bioinformatics Date: 2009-05-18 Impact factor: 6.937

9. Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data.

Authors: Yoshihiro Kawahara; Melissa de la Bastide; John P Hamilton; Hiroyuki Kanamori; W Richard McCombie; Shu Ouyang; David C Schwartz; Tsuyoshi Tanaka; Jianzhong Wu; Shiguo Zhou; Kevin L Childs; Rebecca M Davidson; Haining Lin; Lina Quesada-Ocampo; Brieanne Vaillancourt; Hiroaki Sakai; Sung Shin Lee; Jungsok Kim; Hisataka Numa; Takeshi Itoh; C Robin Buell; Takashi Matsumoto
Journal: Rice (N Y) Date: 2013-02-06 Impact factor: 4.783

10. A chromosome-scale assembly of the sorghum genome using nanopore sequencing and optical mapping.

Authors: Stéphane Deschamps; Yun Zhang; Victor Llaca; Liang Ye; Abhijit Sanyal; Matthew King; Gregory May; Haining Lin
Journal: Nat Commun Date: 2018-11-19 Impact factor: 14.919

13 in total

Review 1. Advances in Multi-Omics Approaches for Molecular Breeding of Black Rot Resistance in Brassica oleracea L.

Authors: Ranjan K Shaw; Yusen Shen; Jiansheng Wang; Xiaoguang Sheng; Zhenqing Zhao; Huifang Yu; Honghui Gu
Journal: Front Plant Sci Date: 2021-12-06 Impact factor: 5.753

2. GinkgoDB: an ecological genome database for the living fossil, Ginkgo biloba.

Authors: Kai-Jie Gu; Chen-Feng Lin; Jun-Jie Wu; Yun-Peng Zhao
Journal: Database (Oxford) Date: 2022-06-27 Impact factor: 4.462

3. Genome-Wide Identification and Evolution Analysis of the Gibberellin Oxidase Gene Family in Six Gramineae Crops.

Authors: Chenhao Zhang; Xin Nie; Weilong Kong; Xiaoxiao Deng; Tong Sun; Xuhui Liu; Yangsheng Li
Journal: Genes (Basel) Date: 2022-05-12 Impact factor: 4.141

4. Xa7, a new executor R gene that confers durable and broad-spectrum resistance to bacterial blight disease in rice.

Authors: Xifeng Chen; Pengcheng Liu; Le Mei; Xiaoling He; Long Chen; Hui Liu; Shurong Shen; Zhandong Ji; Xixi Zheng; Yuchen Zhang; Zhenyu Gao; Dali Zeng; Qian Qian; Bojun Ma
Journal: Plant Commun Date: 2021-01-09

5. Whole-Genome Sequencing of the NARO World Rice Core Collection (WRC) as the Basis for Diversity and Association Studies.

Authors: N Tanaka; M Shenton; Y Kawahara; M Kumagai; H Sakai; H Kanamori; J Yonemaru; S Fukuoka; K Sugimoto; M Ishimoto; J Wu; K Ebana
Journal: Plant Cell Physiol Date: 2020-05-01 Impact factor: 4.927

6. Genome-Wide Association Study Dissects Resistance Loci against Bacterial Blight in a Diverse Rice Panel from the 3000 Rice Genomes Project.

Authors: Jialing Lu; Chunchao Wang; Dan Zeng; Jianmin Li; Xiaorong Shi; Yingyao Shi; Yongli Zhou
Journal: Rice (N Y) Date: 2021-02-27 Impact factor: 4.783

10. Genome-wide survey of the dehydrin genes in bread wheat (Triticum aestivum L.) and its relatives: identification, evolution and expression profiling under various abiotic stresses.

Authors: Yongchao Hao; Ming Hao; Yingjie Cui; Lingrang Kong; Hongwei Wang
Journal: BMC Genomics Date: 2022-01-23 Impact factor: 3.969