Literature DB >> 23161682

PRGdb 2.0: towards a community-based database model for the analysis of R-genes in plants.

Walter Sanseverino1, Antonio Hermoso, Raffaella D'Alessandro, Anna Vlasova, Giuseppe Andolfo, Luigi Frusciante, Ernesto Lowy, Guglielmo Roma, Maria Raffaella Ercolano.   

Abstract

The Plant Resistance Genes database (PRGdb; http://prgdb.org) is a comprehensive resource on resistance genes (R-genes), a major class of genes in plant genomes that convey disease resistance against pathogens. Initiated in 2009, the database has grown more than 6-fold to recently include annotation derived from recent plant genome sequencing projects. Release 2.0 currently hosts useful biological information on a set of 112 known and 104 310 putative R-genes present in 233 plant species and conferring resistance to 122 different pathogens. Moreover, the website has been completely redesigned with the implementation of Semantic MediaWiki technologies, which makes our repository freely accessed and easily edited by any scientists. To this purpose, we encourage plant biologist experts to join our annotation effort and share their knowledge on resistance-gene biology with the rest of the scientific community.

Entities:  

Mesh:

Year:  2012        PMID: 23161682      PMCID: PMC3531111          DOI: 10.1093/nar/gks1183

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Plants protect themselves from disease by activating a broad array of defense responses that ultimately inhibit growth and spread of invading pathogens. The principal immune mechanism against pathogens in plants is mediated by resistance (R) gene (1). However, the essential components of this defense system remain still unknown even if it has been deeply investigated. High-throughput genomic experiments and plant genome sequences available in public databases are offering unprecedented opportunities to identify novel R-genes, to explore their function and their diversification process, to discover new resistance capacity, and ultimately to elucidate their mechanisms of interaction between pathogens and their plant hosts. With the intention to facilitate research on this agriculturally important gene family, we launched in 2009 the Plant Disease Resistance Gene database, a comprehensive repository of R-genes across hundreds of plant species (2). PRGdb version 1.0 comprised a total of 16 844 gene entries, of which 73 were known and manual-curated R-genes (e.g. the ‘reference’ data set), 6308 were putative R-genes retrieved from NCBI Genbank, and 10 463 were putative R-genes computationally predicted from NCBI UniGene transcriptomic data using the in-house developed bioinformatics pipeline DRAGO (Disease Resistance Analysis and Gene Orthology) (2). In the last few years, several plant genome-sequencing projects have progressed rapidly. For instance, tomato (3), potato (4) and melon (5) genomes were recently completed, and thus providing the opportunity to discover additional R-genes. Therefore, the huge amount of sequence data not yet analysed for this gene family required a substantial update of our database for addressing the community requests. In this manuscript, we present an update of the plant resistance gene database, PRGdb. In the release 2.0, our database has been expanded more than 6-fold to include useful biological information about a total of 104 459 R-genes (of which ∼90 000 newly annotated) from 233 plant species. Of these entries, 112 are manual-curated R-genes described in the literature to confer resistance to 122 different pathogens. All remaining genes were predicted by our DRAGO bioinformatics pipeline from sequences available in public transcriptomic and genomic databases (NCBI Genbank, NCBI UniGene, Phytozome) (6), as well as from gene models annotated in recent plant genome sequence projects, such as potato (4), tomato (3) and melon (5). Last but not least, we completely revised our web interface that is now based on Semantic MediaWiki technologies, which makes our repository freely accessible and easily editable by the scientific community. With this new wiki design, PRGdb aims to provide a forum for scientists working in this research field. Therefore, we do invite plant biology experts to join our efforts and help us on making PRGdb a more collaborative and comprehensive resource through their valuable contributions.

DATA UPDATE

To support the scientific community working on plant disease resistance area, we updated the PRGdb knowledge-base. In this release 2.0, the number of genes of previous version has grown more than 6-fold, new sections have been created and the domain analysis and the sequence classification of the candidate R-genes have been improved. To date, PRGdb 2.0 stores 112 ‘reference’ R-genes (40 more than the previous version), and 122 pathogen and 233 plant species (45 more than the previous version). The analysis with the DRAGO bioinformatics pipeline on 2012 NCBI ‘nr’ and UniGene databases yielded a total of 34 558 putative R-genes (20 000 more than the previous version). In addition to these traditional annotated categories, we included a new dataset predicted by DRAGO from genomic sequences available at Phytozome v8 (6), thus incrementing our repository of 68 509 putative R-genes from 31 plant sequenced genomes. Each putative R-gene sequence has been annotated with DNA and amino acid sequences, information about species and predicted protein domains. Noteworthy large-scale data released from PRGdb are listed in Table 1. Finally, another unique and important set of R-genes has been annotated by our team during the analysis of plant genome species recently sequenced and not yet present in the Phytozome database, such as tomato, potato and melon. For these species, a complete analysis of their R-genes panorama has been accomplished and a dedicated section of the database has been built to store, visualize, and consult these unique results (see wiki ‘metaspecies’ section). Table 2 shows a comparison of the R-gene classes identified in these agriculturally important species. Moreover, in this update, we include new identified R-protein domains, thus reaching a total of nine well-known domain types: the MLO (PFAM PF03094) (7), the RPW8 (8), and the GNK2 domain extracted from the antifungal protein Ginkobilobin2-1 (Interpro IPR002902) (9), and the already present CC (Coiled Coil), TIR (Toll-Interleukin like Region), NBS (Nucleotide Binding site), LRR (Leucine-Rich Region), KIN (Kinase) and Other (all other domains which have been described as conferring resistance through different molecular mechanisms) (2). The aforementioned new domain types helped to better classify the new candidate R-genes (Table 2). All data mentioned have been organized in new database sections as well-established repository for plant scientists. In particular, single gene pages are now available to directly visualize gene features allowing comparison of R-gene complements across entire plant lineages.
Table 1.

List of main entries present in PRG database

EntriesNo.
Plants233
Pathogens122
Domain types9
Gene classes16
Manually curated R-Genes112
Putative R-Genes, collected from NCBI Protein9639
Putative R-Genes, predicted from NCBI UniGene24 918
Putative R-Genes, predicted from Pythozome68 498
R-genes from plant genome sequencing projects1143
Total number of plant genes104 310
Avirulence genes23
Diseases120
Table 2.

Putative R-genes retrieved in each sequenced species divided for class

SpeciesCNCNLMlo-likeNNLOtherRLKRLK-GNK2RLPRPW8-NLTTNTNLUnknownTotal
Aquilegia coerulea 191452389690052282110010101691
Arabidopsis lyrata 14115101150703819833712057711568
Arabidopsis lyrata subsp. lyrata 336234140001514217025629962
Arabidopsis thaliana 6574840125230465390580191801268331956066
Areca catechu 3329531126481271837511228047132175826
Brachypodium distachyon 489615571022700100240505
Brassica rapa 317121654010272306326011911211887
Capsella rubella 1866164725121632425430567851379
Chlamydomonas reinhardtii 30025000000000165193
Citrus clementina 18340241011761152844911508413962648
Citrus sinensis 31203421312240355947733307619163230
Cucumis melo 020002000000105
Cucumis sativus 12159316015201219247010010471740
Ectocarpus siliculosus 00215001000000118136
Eucalyptus grandis 2423929104314271706931111030023324326
Eutrema halophilum 155015512470402462270727391288
Glycine max 371326019028108141803878022930685035
Gossypium arboreum 7170205675003623211008271425
Linum usitatissimum 14242582440329346136011412271945
Malus x domestica 3231742996631120715184115029928115002
Medicago truncatula 1598291053570358297495029116643016
Oryza barthii 01215711000019195190337211176
Oryza sativa 149117915331162439120015482800
Panicum virgatum 37332241155320264478030022933880
Pennisetum glaucum 17212742281224239140198441269
Phaseolus vulgaris 1811429692270367306218011412552222
Physcomitrella patens 7621893810022512067701166
Populus trichocarpa 14188351668094656540763032219464156
Prunus persica 81322269138003027311901319811804
Ricinus communis 1510316661612136240010212241867
Selaginella moellendorffii 012079406712700007691013
Setaria italica 2219416732043045314020011432016
Solanum lycopersicum 2268941104450103407547
Solanum tuberosum 07547102350804288397
Sorghum bicolor 51316943203045298150012182018
Vitis aestivalis 14102236198082526111302912991934
Vitis vinifera 171643396300033235511004212682321
Volvox carteri 1041500000000087107
Zea mays 241065912510464463515420023303382
List of main entries present in PRG database Putative R-genes retrieved in each sequenced species divided for class

NEW FEATURES

Several scientific databases allow a direct interaction of users for sharing experimental and theoretical information. Most of them have a wiki structure and can contain specific or more generic data. Examples are Gene Wiki (10,11), WikiPathways (12), SubtiWiki (13), EcoliWiki (14), Tetrahymena genome database Wiki (15), Reactome (16) or SNPedia (17). Like these, PRGdb interface has been completely rewritten in a wiki system because in this way, it can be easier to get a larger scientific community be involved in the accumulation of data. We have used MediaWiki, the open-source software behind Wikipedia, to build this second version of the PRGdb with new features, a friendly interface, and new tools. Mapping our existing database structure into the wiki has been possible thanks to a set of Semantic MediaWiki extensions. This allows users to access and list different wiki pages according to multiple and flexible criteria and, at the same time, it offers a linking point for third parties to existing data via RDF triples attached to every page (18). By using Semantic technologies, PRGdb becomes another node of the linked data web (http://linkeddata.org/) On the other hand, it retrieves and links to resources such as Bio2RDF (19) (e.g. for retrieving additional species images) and potentially to other semantic empowered sites. Other international plant databases use different models to interact with users. Among them, TAIR (20), SOLGenomics Network (21), Cucurbit Genomics Database (http://www.icugi.org) and Plant Organelles Database (22) are based on specific submission forms to contact database developers or systems based on information provided by emails; SOLGenomics Network has a forum section allowing users to interact as well. In our case, we take advantage of default wiki features such as change tracking and other social-like components, such as user pages, which show users activity and can be filled by plant disease resistance researchers. Moreover, apart from plain wiki editing for documentation purposes, several specific contribution entry points are also provided, so that registered users can suggest corrections or additions, such as related bibliography or even new reference genes. A group of reviewers, experts in the plant disease resistance field, will have the necessary permission to approve these contributions and, from that moment, these will be easily available and also linkable for third parties. In this way, PRGdb becomes a community database for plant scientists who could in turn contribute to this public resource collaboratively. According with our final intent, wiki pages have been designed in a user-friendly version, enriched with interactive images and pages that summarize species with known R-genes (Figure 1). In the wiki pages, the user can find internal links connecting different sections of the website and improving accessibility to sequence, resistance gene data and related published information. New sections about plant and pathogens have been added, allowing users to quickly retrieve information from their species of interest. Some interactive images have also been included to facilitate navigation in the web site and to summarize results coming from data collection. Moreover, wiki pages are also intended to provide technical details about the pipelines adopted to predict R-genes and information coming from international scientific literature built collaboratively. In particular, notable scientist from all around the world that showed interest in our initiative will be asked to be in PRGdb review board. Reviewers will responsible for reviewing scientific accuracy of information. Members of primary specialty will be asked to review content areas that coincide directly with their expertise. Information in this section not only can be directly added by users, but can also be directly linked to other scientific databases. A variety of links to other database relevant to the topic are included in the initial menu.
Figure 1.

An overview of the wiki interface of the PRGdb version 2.0. (A) Home page; (B) BLAST results; (C) Clickable image of species annotated with R-genes; (D) Examples of metaspecies page, and two examples of wiki search results; (E) Reference R-gene wiki page; (F) Contributed wiki page.

An overview of the wiki interface of the PRGdb version 2.0. (A) Home page; (B) BLAST results; (C) Clickable image of species annotated with R-genes; (D) Examples of metaspecies page, and two examples of wiki search results; (E) Reference R-gene wiki page; (F) Contributed wiki page. Moreover, we have developed a new interface for BLAST search (23,24), implemented directly in the wiki system, so that results links are directly connected to gene wiki pages. All the sequences stored in the wiki are also provided as downloadable files, which are updated periodically from approved user wiki contributors.

FUTURE PERSPECTIVES

Newly sequenced plant genomes will be monitored from public resources, such as NCBI, Phytozome or any other specialized resource. Releases are expected to take place twice per year by integrating new data from our reference database plus the accumulated user contributions up to that moment. At the technical level, we will continue the employment of pipelines and algorithms to automate the management and classification of the sequence data and to better organize the content of the database. Connecting the various resources with phenotypic, genomic and molecular information will become a crucial task for mining results.

FUNDING

Italian Ministry of Education, University and Research (GenHort Project). Funding for open access charge: Department of Soil, Plant, Environment and Animal Production Sciences, University of Naples ‘Federico II’, via Università 100, 80055, Portici, Italy. Spanish Ministry of Economy and Competitiveness funded infrastructure technician support. Ref.: PTA2010-4446-I. Conflict of interest statement. None declared.
  24 in total

1.  Topology, subcellular localization, and sequence diversity of the Mlo family in plants.

Authors:  A Devoto; P Piffanelli; I Nilsson; E Wallin; R Panstruga; G von Heijne; P Schulze-Lefert
Journal:  J Biol Chem       Date:  1999-12-03       Impact factor: 5.157

2.  Broad-spectrum mildew resistance in Arabidopsis thaliana mediated by RPW8.

Authors:  S Xiao; S Ellwood; O Calis; E Patrick; T Li; M Coleman; J G Turner
Journal:  Science       Date:  2001-01-05       Impact factor: 47.728

Review 3.  The plant immune system.

Authors:  Jonathan D G Jones; Jeffery L Dangl
Journal:  Nature       Date:  2006-11-16       Impact factor: 49.962

4.  Bio2RDF: towards a mashup to build bioinformatics knowledge systems.

Authors:  François Belleau; Marc-Alexandre Nolin; Nicole Tourigny; Philippe Rigault; Jean Morissette
Journal:  J Biomed Inform       Date:  2008-03-21       Impact factor: 6.317

5.  Genome sequence and analysis of the tuber crop potato.

Authors:  Xun Xu; Shengkai Pan; Shifeng Cheng; Bo Zhang; Desheng Mu; Peixiang Ni; Gengyun Zhang; Shuang Yang; Ruiqiang Li; Jun Wang; Gisella Orjeda; Frank Guzman; Michael Torres; Roberto Lozano; Olga Ponce; Diana Martinez; Germán De la Cruz; S K Chakrabarti; Virupaksh U Patil; Konstantin G Skryabin; Boris B Kuznetsov; Nikolai V Ravin; Tatjana V Kolganova; Alexey V Beletsky; Andrei V Mardanov; Alex Di Genova; Daniel M Bolser; David M A Martin; Guangcun Li; Yu Yang; Hanhui Kuang; Qun Hu; Xingyao Xiong; Gerard J Bishop; Boris Sagredo; Nilo Mejía; Wlodzimierz Zagorski; Robert Gromadka; Jan Gawor; Pawel Szczesny; Sanwen Huang; Zhonghua Zhang; Chunbo Liang; Jun He; Ying Li; Ying He; Jianfei Xu; Youjun Zhang; Binyan Xie; Yongchen Du; Dongyu Qu; Merideth Bonierbale; Marc Ghislain; Maria del Rosario Herrera; Giovanni Giuliano; Marco Pietrella; Gaetano Perrotta; Paolo Facella; Kimberly O'Brien; Sergio E Feingold; Leandro E Barreiro; Gabriela A Massa; Luis Diambra; Brett R Whitty; Brieanne Vaillancourt; Haining Lin; Alicia N Massa; Michael Geoffroy; Steven Lundback; Dean DellaPenna; C Robin Buell; Sanjeev Kumar Sharma; David F Marshall; Robbie Waugh; Glenn J Bryan; Marialaura Destefanis; Istvan Nagy; Dan Milbourne; Susan J Thomson; Mark Fiers; Jeanne M E Jacobs; Kåre L Nielsen; Mads Sønderkær; Marina Iovene; Giovana A Torres; Jiming Jiang; Richard E Veilleux; Christian W B Bachem; Jan de Boer; Theo Borm; Bjorn Kloosterman; Herman van Eck; Erwin Datema; Bas te Lintel Hekkert; Aska Goverse; Roeland C H J van Ham; Richard G F Visser
Journal:  Nature       Date:  2011-07-10       Impact factor: 49.962

6.  The Sol Genomics Network (solgenomics.net): growing tomatoes using Perl.

Authors:  Aureliano Bombarely; Naama Menda; Isaak Y Tecle; Robert M Buels; Susan Strickler; Thomas Fischer-York; Anuradha Pujar; Jonathan Leto; Joseph Gosselin; Lukas A Mueller
Journal:  Nucleic Acids Res       Date:  2010-10-08       Impact factor: 16.971

7.  The Gene Wiki in 2011: community intelligence applied to human gene annotation.

Authors:  Benjamin M Good; Erik L Clarke; Luca de Alfaro; Andrew I Su
Journal:  Nucleic Acids Res       Date:  2011-11-10       Impact factor: 16.971

8.  The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools.

Authors:  Philippe Lamesch; Tanya Z Berardini; Donghui Li; David Swarbreck; Christopher Wilks; Rajkumar Sasidharan; Robert Muller; Kate Dreher; Debbie L Alexander; Margarita Garcia-Hernandez; Athikkattuvalasu S Karthikeyan; Cynthia H Lee; William D Nelson; Larry Ploetz; Shanker Singh; April Wensel; Eva Huala
Journal:  Nucleic Acids Res       Date:  2011-12-02       Impact factor: 16.971

9.  SubtiWiki--a comprehensive community resource for the model organism Bacillus subtilis.

Authors:  Ulrike Mäder; Arne G Schmeisky; Lope A Flórez; Jörg Stülke
Journal:  Nucleic Acids Res       Date:  2011-11-16       Impact factor: 16.971

10.  Reactome: a knowledge base of biologic pathways and processes.

Authors:  Imre Vastrik; Peter D'Eustachio; Esther Schmidt; Geeta Joshi-Tope; Gopal Gopinath; David Croft; Bernard de Bono; Marc Gillespie; Bijay Jassal; Suzanna Lewis; Lisa Matthews; Guanming Wu; Ewan Birney; Lincoln Stein
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

View more
  35 in total

1.  A high-quality carrot genome assembly provides new insights into carotenoid accumulation and asterid genome evolution.

Authors:  Massimo Iorizzo; Shelby Ellison; Douglas Senalik; Peng Zeng; Pimchanok Satapoomin; Jiaying Huang; Megan Bowman; Marina Iovene; Walter Sanseverino; Pablo Cavagnaro; Mehtap Yildiz; Alicja Macko-Podgórni; Emilia Moranska; Ewa Grzebelus; Dariusz Grzebelus; Hamid Ashrafi; Zhijun Zheng; Shifeng Cheng; David Spooner; Allen Van Deynze; Philipp Simon
Journal:  Nat Genet       Date:  2016-05-09       Impact factor: 38.330

2.  Transcriptome characterization and expression profiling in chestnut cultivars resistant or susceptible to the gall wasp Dryocosmus kuriphilus.

Authors:  Alberto Acquadro; Daniela Torello Marinoni; Chiara Sartor; Francesca Dini; Matteo Macchio; Roberto Botta
Journal:  Mol Genet Genomics       Date:  2019-09-10       Impact factor: 3.291

3.  Genome-wide identification and characterization of NB-ARC resistant genes in wheat (Triticum aestivum L.) and their expression during leaf rust infection.

Authors:  Saket Chandra; Andaleeb Z Kazmi; Zainab Ahmed; Gargi Roychowdhury; Veena Kumari; Manish Kumar; Kunal Mukhopadhyay
Journal:  Plant Cell Rep       Date:  2017-04-11       Impact factor: 4.570

4.  Genome-wide analysis of the gene families of resistance gene analogues in cotton and their response to Verticillium wilt.

Authors:  Jie-Yin Chen; Jin-Qun Huang; Nan-Yang Li; Xue-Feng Ma; Jin-Long Wang; Chuan Liu; Yong-Feng Liu; Yong Liang; Yu-Ming Bao; Xiao-Feng Dai
Journal:  BMC Plant Biol       Date:  2015-06-19       Impact factor: 4.215

5.  A 4-gigabase physical map unlocks the structure and evolution of the complex genome of Aegilops tauschii, the wheat D-genome progenitor.

Authors:  Ming-Cheng Luo; Yong Q Gu; Frank M You; Karin R Deal; Yaqin Ma; Yuqin Hu; Naxin Huo; Yi Wang; Jirui Wang; Shiyong Chen; Chad M Jorgensen; Yong Zhang; Patrick E McGuire; Shiran Pasternak; Joshua C Stein; Doreen Ware; Melissa Kramer; W Richard McCombie; Shahryar F Kianian; Mihaela M Martis; Klaus F X Mayer; Sunish K Sehgal; Wanlong Li; Bikram S Gill; Michael W Bevan; Hana Simková; Jaroslav Dolezel; Song Weining; Gerard R Lazo; Olin D Anderson; Jan Dvorak
Journal:  Proc Natl Acad Sci U S A       Date:  2013-04-22       Impact factor: 11.205

6.  A novel workflow correlating RNA-seq data to Phythophthora infestans resistance levels in wild Solanum species and potato clones.

Authors:  Itziar Frades; Kibrom B Abreha; Estelle Proux-Wéra; Åsa Lankinen; Erik Andreasson; Erik Alexandersson
Journal:  Front Plant Sci       Date:  2015-09-17       Impact factor: 5.753

7.  Genetic analysis of Verticillium wilt resistance in a backcross inbred line population and a meta-analysis of quantitative trait loci for disease resistance in cotton.

Authors:  Jinfa Zhang; Jiwen Yu; Wenfeng Pei; Xingli Li; Joseph Said; Mingzhou Song; Soum Sanogo
Journal:  BMC Genomics       Date:  2015-08-05       Impact factor: 3.969

Review 8.  Evolution and Conservation of Plant NLR Functions.

Authors:  Florence Jacob; Saskia Vernaldi; Takaki Maekawa
Journal:  Front Immunol       Date:  2013-09-25       Impact factor: 7.561

Review 9.  Silicon era of carbon-based life: application of genomics and bioinformatics in crop stress research.

Authors:  Man-Wah Li; Xinpeng Qi; Meng Ni; Hon-Ming Lam
Journal:  Int J Mol Sci       Date:  2013-05-29       Impact factor: 5.923

10.  Analyses of hypomethylated oil palm gene space.

Authors:  Eng-Ti L Low; Rozana Rosli; Nagappan Jayanthi; Ab Halim Mohd-Amin; Norazah Azizi; Kuang-Lim Chan; Nauman J Maqbool; Paul Maclean; Rudi Brauning; Alan McCulloch; Roger Moraga; Meilina Ong-Abdullah; Rajinder Singh
Journal:  PLoS One       Date:  2014-01-30       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.