Literature DB >> 31336017

Towards a deeper haplotype mining of complex traits in rice with RFGB v2.0.

Chun-Chao Wang1, Hong Yu2, Ji Huang3, Wen-Sheng Wang1, Muhiuddin Faruquee1, Fan Zhang1, Xiu-Qin Zhao1, Bin-Ying Fu1, Kai Chen4, Hong-Liang Zhang5, Shuai-Shuai Tai6, Chaochun Wei7, Kenneth L McNally8, Nickolai Alexandrov8, Xiu-Ying Gao3, Jiayang Li2,9, Zhi-Kang Li1,4, Jian-Long Xu1,4, Tian-Qing Zheng1.   

Abstract

Entities:  

Keywords:  deep haplotype mining; phenotype-haplotype real-time assciation; rice (Oryza sativa L.); webservice

Mesh:

Year:  2019        PMID: 31336017      PMCID: PMC6920129          DOI: 10.1111/pbi.13215

Source DB:  PubMed          Journal:  Plant Biotechnol J        ISSN: 1467-7644            Impact factor:   9.803


× No keyword cloud information.
Rice (Oryza sativa L.) not only provides insurance covering global food security but also works as a model for plant research. Currently, with overwhelmingly accumulated sequencing data, various databases were constructed for different target users, including the Genome Variation Map (Song et al., 2018), RiceVarMap (Zhao et al., 2015), SNP-Seek database (Alexandrov et al., 2015), RPAN (Sun et al., 2017) and MBK V1 (Institute of Genetics and Developmental Biology, C.A.S., 2018). However, none of them is designed to meet increasing demands of correlation mining between huge sets of phenotypic and genotypic data of re-sequenced genome. Online tool for deeper mining of favourable allele/haplotypes is in urgent need. In 2015, prototype of Rice Functional and Genomic Breeding (RFGB) was developed for breeding application (Zheng et al., 2015) based on SNP & InDel data from the 3000 rice genome (3K-RG) project (The-3K-rice-genomes-project, 2014). In order to bridge huge gaps between phenotypic and genotypic data sets of sequenced genome, our newly designed web service named RFGB v2.0 is now formally online (http://www.rmbreeding.cn/Index/). RFGB v2.0 contains five major modules, including Phenotype, Haplotype, SNP & InDel, Germplasm and Restore Sequence. The phenotypic data for 3K accession were embedded in the Phenotype module. Currently, data of 12 traits are already publicly accessible. To improve the adaptivity, a function of real-time analysis with user uploaded phenotypic data is provided. In the analysis function embedded in the Haplotype module, non-synonym SNPs were considered. The functions of allelic frequency (MAF) screening and sample definition are also available. In the SNP & InDel module, information of a gene query is accessible by searching with either locus ID, chromosome region or even key words. In the Germplasm module, grouping of the 3K germ-plasms has been updated according to our recent report (Wang et al., 2018). The germplasm data set was re-organized to improve data efficiency. Additionally, we have corrected calculation errors in the trial version of the Restore Sequence module with full consideration of InDel. More details about the RFGB v2.0 construction and utility methodologies are accessible through our online materials (http://www.rmbreeding.cn/Index/manual). Four typical user cases picked from our web tests were presented here: 1) exploring favourable donors, 2) shortlisting candidate genes, 3) mining favourable haplotypes and 4) seeking variations and restore sequences. A typical pre-breeding/forward genetics work begins with phenotyping. As shown in Figure 1a, we adopted our published data set of zinc concentration in milled grains (Zn) (Zhang et al., 2018) with normal distribution as a sample case. Interestingly, by uploading data to the Phenotype module, user found that Zn, the custom trait (uploaded data), varied between different groups of the 3K-RG with the box-plot viewer. Three groups of the 3K-RG germplasms were found to present top Zn values, which are Geng/japonica (GJ), Aus and Basmati (Bas) groups. For breeding schemes working on Geng/japonica cultivar development, germplasm from GJ group with higher Zn values would be recommended as breeding donors. On the contrary, for a Xian/indica breeding scheme, donors from Aus and/or Bas group would be favourable. Further details for the above donors are accessible using the Germplasm module.
Figure 1

Four user cases working on exploring favourable donors, shortlisting candidate genes, mining favourable haplotypes and seeking target sequence/variations using RFGBv2.0. (a) User analysed phenotypic distribution by uploading zinc concentration in milled grains (Zn) data using the Phenotypic module. With the information of elite group for Zn, more details of favourable germplasms for different breeding schemes were accessible using the Germplasm module. (b) With aid of the Haplotype module, a list of 22 differentially expressed genes (DEGs) based on the transcriptomic and phosphoproteomic analyses between near-isogenic line (NIL) and its recurrent parent (RP) were significantly reduced by 31.8%. (c) Based on GWAS mapping results of tiller number (TN) in a set of germplasms, some candidate regions were submitted to the Haplotype module for confirmation by haplotype analysis. (d) A chromosome region controlling leaf rolling trait (LRI), qRl4-2 was scanned for SNP & InDel variations using the SNP & InDel module. With the Restore Sequence module, restore sequence information was available for design of primers to confirm the LRI candidate region.

Four user cases working on exploring favourable donors, shortlisting candidate genes, mining favourable haplotypes and seeking target sequence/variations using RFGBv2.0. (a) User analysed phenotypic distribution by uploading zinc concentration in milled grains (Zn) data using the Phenotypic module. With the information of elite group for Zn, more details of favourable germplasms for different breeding schemes were accessible using the Germplasm module. (b) With aid of the Haplotype module, a list of 22 differentially expressed genes (DEGs) based on the transcriptomic and phosphoproteomic analyses between near-isogenic line (NIL) and its recurrent parent (RP) were significantly reduced by 31.8%. (c) Based on GWAS mapping results of tiller number (TN) in a set of germplasms, some candidate regions were submitted to the Haplotype module for confirmation by haplotype analysis. (d) A chromosome region controlling leaf rolling trait (LRI), qRl4-2 was scanned for SNP & InDel variations using the SNP & InDel module. With the Restore Sequence module, restore sequence information was available for design of primers to confirm the LRI candidate region. As shown in Figure 1b, a route for shortlisting candidate genes with RFGB v2.0 was supported by a user case working on grain length (GL) dissection with four steps: 1) construct near-isogenic lines (NIL) for the target locus; 2) set up multiple omics analysis for the NIL and recurrent parent (RP) to get a list of differentially expressed genes (DEGs); 3) use the Haplotype module for haplotype–phenotype association analysis; and 4) shortlist the DEG list based on comparisons of the target trait differences between haplotypes. In the GL case, transcriptomic and phosphoproteomic analyses were carried out for NIL and RP, which were found to be significantly different in GL. A total of 22 DEGs were found. Then, we carried out haplotype analysis with the GL data set embedded in RFGB v2.0, which has more than 2,000 valid GL data points at present. With the parameter setting of MAF = non-filtered, the candidate gene list was shortened by dropping 31.8% genes off the list which were found to be non-related GL with intuitive supporting evidences from the Haplotype module. As shown in Figure 1c, a route for mining favourable haplotypes with RFGB v2.0 includes four major steps: 1) trait evaluation for the germplasms from the 3K-RG; 2) definition of candidate regions by GWAS/linkage mapping; 3) shortlist candidate genes with multiple evidences; and 4) explore favourable haplotypes of candidate genes for target traits. In this case, by GWAS mapping, we found a loci qSV3e, which was harbouring a known gene Os03 g0856700 coding Gibberellin 20 oxidase 1 affecting both tiller number (TN) and plant height (PH) at the seedling stage under a paddy direct seeding rice (PDSR) system. Since relatively higher TN would be favourable trait for rice breeders, we carried our haplotype analysis for Os03 g0856700 with TN data uploaded using the Haplotype module. We found that Hap 7 of Os03 g0856700 contributed more to TN than the other haplotypes and could be adopted as candidate elite haplotypes for rice molecular breeding under PDSR system. A donor list of 16 3K-RG germplasms carrying Hap 7 were also accessible in the Haplotype module. In addition to the above explorations, a route for variation mining with RFGB v2.0 includes the following three steps (Figure 1d): 1) define a target region for targeting trait with partial accessions from the 3K-RG, 2) access more variations within the target region with the SNP & InDel module and 3) access the restore sequences harbouring the targeting variations using the Restore Sequence module for further wet-lab confirmation. In our user case, a target region, qRl4-2, was defined with GWAS with no more than 1000 accessions for a targeting trait, leaf rolling index (LRI). With the JBrowse engine embedded in the SNP & InDel module, a deeper mining of variations within qRl4-2 in more than 2000 additional germplasm was feasible. Restore sequences of elite germplasms were then downloaded according to the sub-region harbouring the genome variations according to genome browsing with the SNP & InDel module. Further wet-lab work would be carried out based on these results. RFGB v2.0 has offered a unique view on the relationships between two big data sets (phenotypic and genotypic data) of sequenced genome, especially a real-time analysis for the phenotype–haplotype associations. With different combinations of the modules and functions embedded in RFGB v2.0, users may feel free to have a deeper view of targeting complex trait. This may inspire more ideas on deeper mining of complex traits with sequenced genome data. Finally, in order set up an open platform to the public, two functions are now available. One is ‘Seed request’ (http://www.rmbreeding.cn/public/request_germplasm) which helps users to access the 3K-RG seeds more easily for their own phenotyping works. The other is ‘contribute your data to RFGB’ (http://www.rmbreeding.cn/Phenotype/contribute_phenotype) which helps users upload their own phenotypic data to RFGB. With respect to intellectual property, at present, only the published data are fully open for download.
  7 in total

1.  RiceVarMap: a comprehensive database of rice genomic variations.

Authors:  Hu Zhao; Wen Yao; Yidan Ouyang; Wanneng Yang; Gongwei Wang; Xingming Lian; Yongzhong Xing; Lingling Chen; Weibo Xie
Journal:  Nucleic Acids Res       Date:  2014-10-01       Impact factor: 16.971

2.  SNP-Seek database of SNPs derived from 3000 rice genomes.

Authors:  Nickolai Alexandrov; Shuaishuai Tai; Wensheng Wang; Locedie Mansueto; Kevin Palis; Roven Rommel Fuentes; Victor Jun Ulat; Dmytro Chebotarov; Gengyun Zhang; Zhikang Li; Ramil Mauleon; Ruaraidh Sackville Hamilton; Kenneth L McNally
Journal:  Nucleic Acids Res       Date:  2014-11-27       Impact factor: 16.971

3.  Joint Exploration of Favorable Haplotypes for Mineral Concentrations in Milled Grains of Rice (Oryza sativa L.).

Authors:  Guo-Min Zhang; Tian-Qing Zheng; Zhuo Chen; Yong-Li Wang; Ying Wang; Yu-Min Shi; Chun-Chao Wang; Li-Yan Zhang; Jun-Tao Ma; Ling-Wei Deng; Wan Li; Tian-Tian Xu; Cheng-Zhi Liang; Jian-Long Xu; Zhi-Kang Li
Journal:  Front Plant Sci       Date:  2018-04-12       Impact factor: 5.753

4.  The 3,000 rice genomes project.

Authors: 
Journal:  Gigascience       Date:  2014-05-28       Impact factor: 6.524

5.  RPAN: rice pan-genome browser for ∼3000 rice genomes.

Authors:  Chen Sun; Zhiqiang Hu; Tianqing Zheng; Kuangchen Lu; Yue Zhao; Wensheng Wang; Jianxin Shi; Chunchao Wang; Jinyuan Lu; Dabing Zhang; Zhikang Li; Chaochun Wei
Journal:  Nucleic Acids Res       Date:  2016-12-10       Impact factor: 16.971

6.  Genome Variation Map: a data repository of genome variations in BIG Data Center.

Authors:  Shuhui Song; Dongmei Tian; Cuiping Li; Bixia Tang; Lili Dong; Jingfa Xiao; Yiming Bao; Wenming Zhao; Hang He; Zhang Zhang
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

7.  Genomic variation in 3,010 diverse accessions of Asian cultivated rice.

Authors:  Wensheng Wang; Ramil Mauleon; Zhiqiang Hu; Dmytro Chebotarov; Shuaishuai Tai; Zhichao Wu; Min Li; Tianqing Zheng; Roven Rommel Fuentes; Fan Zhang; Locedie Mansueto; Dario Copetti; Millicent Sanciangco; Kevin Christian Palis; Jianlong Xu; Chen Sun; Binying Fu; Hongliang Zhang; Yongming Gao; Xiuqin Zhao; Fei Shen; Xiao Cui; Hong Yu; Zichao Li; Miaolin Chen; Jeffrey Detras; Yongli Zhou; Xinyuan Zhang; Yue Zhao; Dave Kudrna; Chunchao Wang; Rui Li; Ben Jia; Jinyuan Lu; Xianchang He; Zhaotong Dong; Jiabao Xu; Yanhong Li; Miao Wang; Jianxin Shi; Jing Li; Dabing Zhang; Seunghee Lee; Wushu Hu; Alexander Poliakov; Inna Dubchak; Victor Jun Ulat; Frances Nikki Borja; John Robert Mendoza; Jauhar Ali; Jing Li; Qiang Gao; Yongchao Niu; Zhen Yue; Ma Elizabeth B Naredo; Jayson Talag; Xueqiang Wang; Jinjie Li; Xiaodong Fang; Ye Yin; Jean-Christophe Glaszmann; Jianwei Zhang; Jiayang Li; Ruaraidh Sackville Hamilton; Rod A Wing; Jue Ruan; Gengyun Zhang; Chaochun Wei; Nickolai Alexandrov; Kenneth L McNally; Zhikang Li; Hei Leung
Journal:  Nature       Date:  2018-04-25       Impact factor: 49.962

  7 in total
  17 in total

1.  OsWRKY115 on qCT7 links to cold tolerance in rice.

Authors:  Hualong Liu; Luomiao Yang; Shanbin Xu; Ming-Jie Lyu; Jingguo Wang; Huan Wang; Hongliang Zheng; Wei Xin; Jun Liu; Detang Zou
Journal:  Theor Appl Genet       Date:  2022-05-27       Impact factor: 5.574

2.  Long-read sequencing of 111 rice genomes reveals significantly larger pan-genomes.

Authors:  Fan Zhang; Hongzhang Xue; Xiaorui Dong; Min Li; Xiaoming Zheng; Zhikang Li; Jianlong Xu; Wensheng Wang; Chaochun Wei
Journal:  Genome Res       Date:  2022-04-08       Impact factor: 9.438

3.  OsPDCD5 negatively regulates plant architecture and grain yield in rice.

Authors:  Shiqing Dong; Xianxin Dong; Xiaokang Han; Fan Zhang; Yu Zhu; Xiaoyun Xin; Ying Wang; Yuanyi Hu; Dingyang Yuan; Jianping Wang; Zhou Huang; Fuan Niu; Zejun Hu; Peiwen Yan; Liming Cao; Haohua He; Junru Fu; Yeyun Xin; Yanning Tan; Bigang Mao; Bingran Zhao; Jinshui Yang; Longping Yuan; Xiaojin Luo
Journal:  Proc Natl Acad Sci U S A       Date:  2021-07-20       Impact factor: 11.205

4.  Identification and candidate gene screening of qCIR9.1, a novel QTL associated with anther culturability in rice (Oryza sativa L.).

Authors:  Cuihong Huang; Jian Zhang; Danhua Zhou; Yuting Huang; Ling Su; Guili Yang; Wenlong Luo; Zhiqiang Chen; Hui Wang; Tao Guo
Journal:  Theor Appl Genet       Date:  2021-03-13       Impact factor: 5.574

5.  Leaf Mutant 7 Encoding Heat Shock Protein OsHSP40 Regulates Leaf Size in Rice.

Authors:  Fuhua Wang; Zhengbin Tang; Ya Wang; Jing Fu; Wenbo Yang; Shengxuan Wang; Yuetao Wang; Tao Bai; Zhibo Huang; Haiqing Yin; Zhoufei Wang
Journal:  Int J Mol Sci       Date:  2022-04-18       Impact factor: 6.208

6.  Natural variation in OsGASR7 regulates grain length in rice.

Authors:  Zhengbin Tang; Xiuying Gao; Xiangyun Zhan; Nengyan Fang; Ruqin Wang; Chengfang Zhan; Jiaqi Zhang; Guang Cai; Jinping Cheng; Yongmei Bao; Hongsheng Zhang; Ji Huang
Journal:  Plant Biotechnol J       Date:  2020-07-06       Impact factor: 9.803

7.  Interpopulation differences of retroduplication variations (RDVs) in rice retrogenes and their phenotypic correlations.

Authors:  Haiyue Zeng; Xingyu Chen; Hongbo Li; Jun Zhang; Zhaoyuan Wei; Yi Wang
Journal:  Comput Struct Biotechnol J       Date:  2021-01-05       Impact factor: 7.271

8.  Genomics Analyses Reveal Unique Classification, Population Structure and Novel Allele of Neo-Tetraploid Rice.

Authors:  Hang Yu; Qihang Li; Yudi Li; Huijing Yang; Zijun Lu; Jinwen Wu; Zemin Zhang; Muhammad Qasim Shahid; Xiangdong Liu
Journal:  Rice (N Y)       Date:  2021-02-06       Impact factor: 4.783

Review 9.  Genomic resources in plant breeding for sustainable agriculture.

Authors:  Mahendar Thudi; Ramesh Palakurthi; James C Schnable; Annapurna Chitikineni; Susanne Dreisigacker; Emma Mace; Rakesh K Srivastava; C Tara Satyavathi; Damaris Odeny; Vijay K Tiwari; Hon-Ming Lam; Yan Bin Hong; Vikas K Singh; Guowei Li; Yunbi Xu; Xiaoping Chen; Sanjay Kaila; Henry Nguyen; Sobhana Sivasankar; Scott A Jackson; Timothy J Close; Wan Shubo; Rajeev K Varshney
Journal:  J Plant Physiol       Date:  2020-12-17       Impact factor: 3.549

10.  Genome-Wide Association Study Dissects Resistance Loci against Bacterial Blight in a Diverse Rice Panel from the 3000 Rice Genomes Project.

Authors:  Jialing Lu; Chunchao Wang; Dan Zeng; Jianmin Li; Xiaorong Shi; Yingyao Shi; Yongli Zhou
Journal:  Rice (N Y)       Date:  2021-02-27       Impact factor: 4.783

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.