Literature DB >> 35218297

Plant Public RNA-seq Database: a comprehensive online database for expression analysis of ~45 000 plant public RNA-Seq libraries.

Yiming Yu1,2,3,4, Hong Zhang2,3,4, Yanping Long2,3,4, Yi Shu2,3,4, Jixian Zhai2,3,4.   

Abstract

Entities:  

Keywords:  zzm321990Glycine maxzzm321990; zzm321990Gossypium hirsutumzzm321990; zzm321990Oryza sativazzm321990; zzm321990Triticum aestivumzzm321990; zzm321990Zea mayszzm321990; RNA-seq; database; transcriptome

Mesh:

Year:  2022        PMID: 35218297      PMCID: PMC9055819          DOI: 10.1111/pbi.13798

Source DB:  PubMed          Journal:  Plant Biotechnol J        ISSN: 1467-7644            Impact factor:   13.263


× No keyword cloud information.
Dear Editor, High‐throughput RNA‐sequencing (RNA‐seq) has become the most popular technology for profiling gene expression in the last decade due to its low cost and high coverage. As a result, the number of RNA‐seq libraries from the plant community has been increasing exponentially in recent years (Figure 1a). For major crops, such as maize, rice, soybean, wheat and cotton, the plant community has collected a total of ~45 000 libraries by 2021 (Figure 1b). Although currently there are several RNA‐seq databases for plants, for example, CoNekT with 750 rice and 574 maize RNA‐seq libraries (https://conekt.sbs.ntu.edu.sg/). However, these existing databases only host the already processed data from each study separately, and therefore, the expression values cannot be directly compared among projects, because they were derived from different bioinformatic pipelines and often mapped to different versions of the reference genomes. To take full advantage of the big data of RNA‐seq libraries, an effort to integrate all publicly available libraries via a uniformed processing pipeline and curate them into an easy‐to‐use searchable database is urgently needed. To address this challenge, here we present a comprehensive web‐based platform, Plant Public RNA‐seq Database (PPRD, http://ipf.sustech.edu.cn/pub/plantrna/). PPRD consists of a large number of RNA‐seq libraries of maize (19 664), rice (11 726), soybean (4085), wheat (5816) and cotton (3483) from Gene Expression Omnibus (GEO), Sequence Read Archive (SRA), European Nucleotide Archive (ENA) and DNA Data Bank of Japan (DDBJ) databases (Figure 1b). These RNA‐seq data are manually curated to highlight different mutants, tissues, developmental stages and abiotic or biotic stresses. Besides showing expression patterns from different tissues and developmental stages (Figure 1c–e), we also annotated the mutant‐related groups and treatment‐associated groups in our maize, rice, soybean, wheat and cotton database, respectively (Figure 1f,g). To reduce the quantification biases derived from differing bioinformatic processes, we processed the data of each species with a unified pipeline and the most up‐to‐date reference genomes (more details on the ‘Tutorials’ page and Appendix S1). Moreover, the database also provided hyperlinks to check the expression level of the homologous genes in other plants and supported a built‐in online Integrative Genomics Viewer (IGV) (Figure 1h; Robinson et al., 2017).
Figure 1

Overview of Plant Public RNA‐Seq Database. (a) The number of Oryza sativa, Zea mays, Glycine max, Triticum aestivum and Gossypium hirsutum sequenced bases per year from 2010 to 2020. Bar indicates the bases deposited per year (GB). Line indicates the total number of bases (GB). GB, giga base pairs. (b) The basic summary of RNA‐seq libraries. ‘Mutant‐related groups’ and ‘treatment‐related groups’ denote the number of groups used to analyse the differential expression. (c–e) The tissue‐specific expression of some marker genes. The left panel shows the endosperm‐specific expression of ZmESR1 in maize (c), the middle panel shows the endosperm‐specific expression of Wx in rice (d), and the right panel displays the root‐specific expression of GmTIP4;1 in soybean (e). (f) The expression level of OsLecRK3 (LOC_Os04g12580) among top10 biotic stresses in rice. (g) Down‐regulated expression of OsLecRK3 (LOC_Os04g12580) among top10 treatment groups in rice. (h) The overview of IGV. The mapped reads of OsLecRK3 show decreased abundance in drought stress‐related samples.

Overview of Plant Public RNA‐Seq Database. (a) The number of Oryza sativa, Zea mays, Glycine max, Triticum aestivum and Gossypium hirsutum sequenced bases per year from 2010 to 2020. Bar indicates the bases deposited per year (GB). Line indicates the total number of bases (GB). GB, giga base pairs. (b) The basic summary of RNA‐seq libraries. ‘Mutant‐related groups’ and ‘treatment‐related groups’ denote the number of groups used to analyse the differential expression. (c–e) The tissue‐specific expression of some marker genes. The left panel shows the endosperm‐specific expression of ZmESR1 in maize (c), the middle panel shows the endosperm‐specific expression of Wx in rice (d), and the right panel displays the root‐specific expression of GmTIP4;1 in soybean (e). (f) The expression level of OsLecRK3 (LOC_Os04g12580) among top10 biotic stresses in rice. (g) Down‐regulated expression of OsLecRK3 (LOC_Os04g12580) among top10 treatment groups in rice. (h) The overview of IGV. The mapped reads of OsLecRK3 show decreased abundance in drought stress‐related samples. In general, PPRD supports searches by gene ID, library ID, BioProject IDs, keywords or any combination of these terms in selected libraries. After querying the above terms, the results in tables and diagrams will be returned. Here, we take the query results of a key regulator of plant small RNA biogenesis, OsDCL3a (LOC_Os01g68120) (Wei et al., 2014), to illustrate the database. After entering ‘LOC_Os01g68120’ in a ‘Google‐like’ search box, the ‘Information’ page will return the basic information of this gene. PPRD also provides hyperlinks for easy access to more information about the corresponding gene in the species‐related websites, such as MaizeGDB for maize (Portwood et al., 2019), RGAP for rice (Kawahara et al., 2013) and SoyBase for soybean (Brown et al., 2021). On the ‘Data Table’ page, detailed information could be displayed in a table, and various ‘Filter’ options are designed to allow users to select specific libraries. The ‘Data Plot’ page shows the results of expression comparison in multiple interactive diagrams, including expression levels among different tissues, developmental stages, abiotic and biotic stresses and up‐regulated or down‐regulated expression in mutant‐related or treatment‐related samples. The ‘CoExpression’ page provides a list of genes co‐expressed with the searched one, and the ‘IGV Online’ page is flexible for visualizing the mapping landscape of the local genomic region in selected libraries. In addition, the ‘Share’ function was supported to facilitate showing the results with others. Here, we used the tissue‐specific expressed genes to validate the results. The expression levels of these genes are consistent with previous studies, such as endosperm‐specific expression of gene ZmESR1 (Zm00001d027820) in maize (Opsahl‐Ferstad et al., 1997), endosperm‐specific expression of gene Wx (LOC_Os06g04200) in rice (Sano, 1984) and root‐specific expression of gene GmTIP4;1 (Glyma.06G084600) in soybean (Song et al., 2016) (Figure 1c–e). Plant Public RNA‐seq Database also supports users to perform data mining from the large‐scale database efficiently. The brown planthopper (BPH) is the most destructive pest that has a massive impact on rice production by the transformations of viruses, and OsLecRK3 (LOC_Os04g12580) is a crucial gene that confers resistance to the BPH (Liu et al., 2015). As expected, OsLecRK3 showed higher expression in some viruses‐related libraries (Figure 1f). To our surprise, OsLecRK3 is down‐regulated in many drought‐related libraries, suggesting that OsLecRK3 plays a crucial role in drought resistance (Figure 1g). In addition, the mapping details of this gene can be visualized using the built‐in IGV browser (Figure 1h). This example showed the exciting power of big data in providing novel insights and quickly developing robust, testable hypotheses with no experimental cost. In summary, PPRD is a convenient, web‐accessible, user‐friendly RNA‐seq database that allows users to quickly scan the gene expression from maize, rice, soybean, wheat or cotton public RNA‐seq libraries and returns the multiple forms of results in tables and diagrams, showing the expression levels in various tissues, developmental stages, abiotic stresses, biotic stresses, as well as the differential expression in different mutants and treatments. Our previous Arabidopsis RNA‐seq database (ARS) has been updated recently, and the number of libraries has been increased from 20 068 to 28 164 (Zhang et al., 2020). We also plan to continue updating PPRD regularly by including new libraries and new plant species in the future. We believe PPRD will help make the transcriptome big data more available and accessible for our plant community members.

Conflicts of interest

The authors declare no conflicts of interest.

Author contributions

H.Z., Y.Y., Y.L. and Y.S. analysed the data, H.Z. and Y.Y. processed the data and built the database and website, and J.Z oversaw the study. Y.Y., H.Z. and J.Z. wrote the manuscript. Appendix S1 Supplementary Methods. Click here for additional data file.
  11 in total

1.  ZmEsr, a novel endosperm-specific gene expressed in a restricted region around the maize embryo.

Authors:  H G Opsahl-Ferstad; E Le Deunff; C Dumas; P M Rogowsky
Journal:  Plant J       Date:  1997-07       Impact factor: 6.417

2.  A gene cluster encoding lectin receptor kinases confers broad-spectrum and durable insect resistance in rice.

Authors:  Yuqiang Liu; Han Wu; Hong Chen; Yanling Liu; Jun He; Haiyan Kang; Zhiguang Sun; Gen Pan; Qi Wang; Jinlong Hu; Feng Zhou; Kunneng Zhou; Xiaoming Zheng; Yulong Ren; Liangming Chen; Yihua Wang; Zhigang Zhao; Qibing Lin; Fuqing Wu; Xin Zhang; Xiuping Guo; Xianian Cheng; Ling Jiang; Chuanyin Wu; Haiyang Wang; Jianmin Wan
Journal:  Nat Biotechnol       Date:  2014-12-08       Impact factor: 54.908

3.  A new decade and new data at SoyBase, the USDA-ARS soybean genetics and genomics database.

Authors:  Anne V Brown; Shawn I Conners; Wei Huang; Andrew P Wilkey; David Grant; Nathan T Weeks; Steven B Cannon; Michelle A Graham; Rex T Nelson
Journal:  Nucleic Acids Res       Date:  2020-12-02       Impact factor: 16.971

4.  A Comprehensive Online Database for Exploring ∼20,000 Public Arabidopsis RNA-Seq Libraries.

Authors:  Hong Zhang; Fei Zhang; Yiming Yu; Li Feng; Jinbu Jia; Bo Liu; Bosheng Li; Hongwei Guo; Jixian Zhai
Journal:  Mol Plant       Date:  2020-08-05       Impact factor: 13.164

5.  Variant Review with the Integrative Genomics Viewer.

Authors:  James T Robinson; Helga Thorvaldsdóttir; Aaron M Wenger; Ahmet Zehir; Jill P Mesirov
Journal:  Cancer Res       Date:  2017-11-01       Impact factor: 12.701

6.  Dicer-like 3 produces transposable element-associated 24-nt siRNAs that control agricultural traits in rice.

Authors:  Liya Wei; Lianfeng Gu; Xianwei Song; Xiekui Cui; Zhike Lu; Ming Zhou; Lulu Wang; Fengyi Hu; Jixian Zhai; Blake C Meyers; Xiaofeng Cao
Journal:  Proc Natl Acad Sci U S A       Date:  2014-02-19       Impact factor: 11.205

7.  Soybean TIP Gene Family Analysis and Characterization of GmTIP1;5 and GmTIP2;5 Water Transport Activity.

Authors:  Li Song; Na Nguyen; Rupesh K Deshmukh; Gunvant B Patil; Silvas J Prince; Babu Valliyodan; Raymond Mutava; Sharon M Pike; Walter Gassmann; Henry T Nguyen
Journal:  Front Plant Sci       Date:  2016-10-21       Impact factor: 5.753

8.  MaizeGDB 2018: the maize multi-genome genetics and genomics database.

Authors:  John L Portwood; Margaret R Woodhouse; Ethalinda K Cannon; Jack M Gardiner; Lisa C Harper; Mary L Schaeffer; Jesse R Walsh; Taner Z Sen; Kyoung Tak Cho; David A Schott; Bremen L Braun; Miranda Dietze; Brittney Dunfee; Christine G Elsik; Nancy Manchanda; Ed Coe; Marty Sachs; Philip Stinard; Josh Tolbert; Shane Zimmerman; Carson M Andorf
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

9.  Plant Public RNA-seq Database: a comprehensive online database for expression analysis of ~45 000 plant public RNA-Seq libraries.

Authors:  Yiming Yu; Hong Zhang; Yanping Long; Yi Shu; Jixian Zhai
Journal:  Plant Biotechnol J       Date:  2022-03-06       Impact factor: 13.263

10.  Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data.

Authors:  Yoshihiro Kawahara; Melissa de la Bastide; John P Hamilton; Hiroyuki Kanamori; W Richard McCombie; Shu Ouyang; David C Schwartz; Tsuyoshi Tanaka; Jianzhong Wu; Shiguo Zhou; Kevin L Childs; Rebecca M Davidson; Haining Lin; Lina Quesada-Ocampo; Brieanne Vaillancourt; Hiroaki Sakai; Sung Shin Lee; Jungsok Kim; Hisataka Numa; Takeshi Itoh; C Robin Buell; Takashi Matsumoto
Journal:  Rice (N Y)       Date:  2013-02-06       Impact factor: 4.783

View more
  7 in total

1.  Genome-Wide Analysis of Type-III Polyketide Synthases in Wheat and Possible Roles in Wheat Sheath-Blight Resistance.

Authors:  Xingxia Geng; Yihua Chen; Shufa Zhang; Zhen Gao; Shuhui Liu; Qunhui Yang; Jun Wu; Xinhong Chen
Journal:  Int J Mol Sci       Date:  2022-06-28       Impact factor: 6.208

2.  Deciphering Haplotypic Variation and Gene Expression Dynamics Associated with Nutritional and Cooking Quality in Rice.

Authors:  Nitika Rana; Surbhi Kumawat; Virender Kumar; Ruchi Bansal; Rushil Mandlik; Pallavi Dhiman; Gunvant B Patil; Rupesh Deshmukh; Tilak Raj Sharma; Humira Sonah
Journal:  Cells       Date:  2022-03-28       Impact factor: 6.600

3.  Plant Public RNA-seq Database: a comprehensive online database for expression analysis of ~45 000 plant public RNA-Seq libraries.

Authors:  Yiming Yu; Hong Zhang; Yanping Long; Yi Shu; Jixian Zhai
Journal:  Plant Biotechnol J       Date:  2022-03-06       Impact factor: 13.263

4.  Genome-Wide Association Study Reveals a Genetic Mechanism of Salt Tolerance Germinability in Rice (Oryza sativa L.).

Authors:  Caijing Li; Changsheng Lu; Baoli Zou; Mengmeng Yang; Guangliang Wu; Peng Wang; Qin Cheng; Yanning Wang; Qi Zhong; Shiying Huang; Tao Huang; Haohua He; Jianmin Bian
Journal:  Front Plant Sci       Date:  2022-07-15       Impact factor: 6.627

5.  GhWRKY33 Interacts with GhTIFY10A to Synergistically Modulate Both Ageing and JA-Mediated Leaf Senescence in Arabidopsis.

Authors:  Songguo Wu; Huimin Zhang; Ruling Wang; Guimei Chang; Yifen Jing; Zhifang Li; Ligang Chen
Journal:  Cells       Date:  2022-07-29       Impact factor: 7.666

6.  Characteristics of members of IGT family genes in controlling rice root system architecture and tiller development.

Authors:  Jianping Zhao; Lihui Jiang; Hanrui Bai; Yuliang Dai; Kuixiu Li; Saijie Li; Xiaoran Wang; Lixia Wu; Qijing Fu; Yanfen Yang; Qian Dong; Si Yu; Meixian Wang; Haiyan Liu; Ziai Peng; Haiyan Zhu; Xiaoyan Zhang; Xie He; Yan Lei; Yan Liang; Liwei Guo; Hongji Zhang; Decai Yu; Yixiang Liu; Huichuan Huang; Changning Liu; Sheng Peng; Yunlong Du
Journal:  Front Plant Sci       Date:  2022-08-26       Impact factor: 6.627

7.  Lateral transfers lead to the birth of momilactone biosynthetic gene clusters in grass.

Authors:  Dongya Wu; Yiyu Hu; Shota Akashi; Hideaki Nojiri; Longbiao Guo; Chu-Yu Ye; Qian-Hao Zhu; Kazunori Okada; Longjiang Fan
Journal:  Plant J       Date:  2022-07-18       Impact factor: 7.091

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.