Literature DB >> 34716373

PlantPathMarks (PPMdb): an interactive hub for pathways-based markers in plant genomes.

Morad M Mokhtar1, Achraf El Allali2, Mohamed-Elamir F Hegazy3, Mohamed A M Atia4.   

Abstract

ABSTRACT: Over the past decade, the problem of finding an efficient gene-targeting marker set or signature for plant trait characterization has remained challenging. Many databases focusing on pathway mining have been released with one major deficiency, as they lack to develop marker sets that target only genes controlling a specific pathway or certain biological process. Herein, we present the PlantPathMarks database (PPMdb) as a comprehensive, web-based, user-friendly, and interactive hub for pathway-based markers in plant genomes. Based on our newly developed pathway gene set mining approach, two novel pathway-based marker systems called pathway gene-targeted markers (PGTMs) and pathway microsatellite-targeted markers (PMTMs) were developed as a novel class of annotation-based markers. In the PPMdb database, 2,690,742 pathway-based markers reflecting 9,894 marker panels were developed across 82 plant genomes. The markers include 691,555 PGTMs and 1,999,187 PMTMs. Across these genomes, 165,378 enzyme-coding genes were mapped against 126 KEGG reference pathway maps. PPMdb is furnished with three interactive visualization tools (Map Browse, JBrowse and Species Comparison) to visualize, map, and compare the developed markers over their KEGG reference pathway maps. All the stored marker panels can be freely downloaded. PPMdb promises to create a radical shift in the paradigm of the area of molecular marker research. The use of PPMdb as a mega-tool represents an impediment for non-bioinformatician plant scientists and breeders. PPMdb is freely available at http://ppmdb.easyomics.org .
© 2021. The Author(s).

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34716373      PMCID: PMC8556342          DOI: 10.1038/s41598-021-00504-2

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Today, an enormous amount of released biodata collections stimulates the development of computerized applications worldwide[1]. Inspiration drives scientists to develop intelligent approaches to extract hidden knowledge behind these massive amounts of data. Biodata mining attempts to find novel, reliable, useful, and meaningful insights from these vast amounts of data. Over the last decades, the scope of biodata mining has expanded from genome-mining to phenome-mining approaches[2]. Its applications outside of basic research have become extensive to study plants’ genetic diversity and improve economic crop breeding programs. Generally, plants produce various chemical compounds involving nutritional or medicinal benefits[3]. Our understanding of plant natural products’ biological pathways is still insufficient; however, scientists assume that genomic and metabolomic information can provide clues about unidentified enzymes and reactions involved in a particular biosynthesis process[4]. Plants have undergone complicated evolutionary events that have resulted naturally or are due to human-made plant breeding experiments and finally lead to polyploidy or genome duplication[3]. This duplication drives the availability of genes either as homologous or paralogous. Due to the phylogenetic relationship of homologous genes, they continue to preserve their core functions. Gene products are often mapped to many pathways revealing the same catalyst properties under different reaction conditions or showing the same reaction but in distinct subcellular locations[5]. Such an association provides a mechanism to study genes’ roles; their expression may be regulated spatially or temporally due to evolutionary implications on plants, gene products, gene functions, and, finally, developmental stages[1]. A biological pathway is broadly defined as a series of actions among cell molecules that lead to a particular product or cellular change. It can also trigger novel molecule assembly, turn genes on and off, or drive a cell to move. Any biological pathway comprises a cluster of genes that exhibit identical or complementary biological functions[6]. Nowadays, pathway analysis is a flourishing research area in systems biology. It relies on extracting knowledge from raw data generated from high-throughput sequencing technologies by building a model that describes and summarizes underlying biological processes. These high-throughput technologies mostly provide a list of differentially expressed genes (DEGs) between a control and a case of interest. Unfortunately, generated DEG data are usually removed from their biological context, causing the resultant genes to require further validation against biological pathways to prove their biological significance[7]. Notably, each of these genes may be a part of many pathways, which usually start with the expression of a gene of a particular ligand and is terminated with an alteration in the concentration of a specific signaling micro-molecule[8]. The dilemma of plant trait/metabolite characterization and improvement based on single or few genes targeting marker selections remains challenging, and many approaches have been developed to tackle these issues[9]. For decades, massive studies have aimed to determine a few differentially expressed marker genes participating in certain natural products’ biosynthesis processes or conferring a particular phenotype[10-12]. Nevertheless, these methods may not be accurate and may lack in capturing an in-depth snapshot of biological processes. Therefore, studies focusing on gene sets belonging to a particular pathway to investigate and explain phenotypic changes have gained increased popularity. These methods are more acceptable, as they assess the significance of a group of predefined genes with shared biological functions to explain specific phenotypes[13-15]. In contrast to traditional gene-based methods, pathway-based methods combined with biological knowledge can help gain a better understanding of functional insights into phenotypic differences. Pathway-mining approaches attempt to rectify such drawbacks by utilizing the available biological knowledge base about the structures and operations of biological pathways accompanying computational methods[5]. Therefore, instead of focusing on specific gene-based marker development, our proposed pathway gene set mining (PGM) approach is critically more appropriate and promises to create a radical shift in the paradigm of molecular markers as a research area. The dramatically increasing number of sequenced plant genomes is supposed to reveal more basic architectural principles of biosynthetic pathways for generating chemical diversity of natural products. Based on released knowledge on interactions between genes and their phenotypic reflections, many publicly available databases have been developed for storing this pathway mining information[16]. After numerous plant genomes are fully sequenced, scientists often search for an efficient scheme that can analyze their data and answer certain biological questions[3]. Common types of analyses include pathway enrichment analysis and the analysis of gene ontology and gene–gene interactions. Over the last decade, many powerful resources for studying metabolic pathways in plants, such as KEGG[17], Gramene[18], Plant Reactome[19], MapMan[20], MetaCyc[21], Plant Metabolic Network[22], and BioCyc[23], have been developed. These databases try their best to integrate the most available information when they were initially launched. Their construction process includes manual curation and computational efforts, regular screening of newly released publications, and updating of interactors. Despite extraordinary efforts, no single database is fully equipped with all the preferred information and may not accommodate all the necessary bioinformatics tools. Notably, all these resources have the significant limitation of missing genome-scale marker sets that target only the genes responsible for a specific pathway or that are involved in/control a particular biological process at the level. Gene sets involved in a particular pathway often work together in an orchestral pattern to reflect a specific biological function. Herein, we proposed a novel PGM pipeline/approach to develop a specific marker set for each pathway at the genome-scale level in plant genomes. Microsatellites, also known as simple sequence repeats (SSRs), are a set of one to six nucleotides that are repeated tandemly and classified either into perfect, imperfect, or composite repeats[24,25]. Due to their unique characteristics (hypervariability, multi-allelic nature, high reproducibility, Mendelian inheritance, and overall high abundance in plant genomes), as well as their essential roles in many biological functions (modulating transcription factor binding, tuning knobs of gene expression levels, organizing chromatin, nucleosome positioning, acting as recombination hot spots, and facilitating unusual structural conformations), microsatellites have become one of the most preferred choices among all genetic markers because they afford a molecular basis for plant genome evolution and demonstrate fast acclimatization to many abiotic stresses[26]. Therefore, we proposed a mining schema of microsatellites within gene sets involved in all plant genome pathways as a robust and functional approach to develop a novel marker system called pathway microsatellite-targeted markers (PMTMs). Ultimately, this study aims to develop a comprehensive hub for two novel pathway-based marker systems coined pathway gene-targeted markers (PGTMs) and pathway microsatellite-targeted markers (PMTMs) in plant genomes. Furthermore, the developed markers were mapped against reference KEGG pathway maps interactively. The use of these powerful portal represents an impediment for non-bioinformatician plant scientists and breeders.

Materials and methods

Data collection

To build a comprehensive pathway-based marker database in plants, we retrieved complete genome sequences and annotations of 82 various plant species, including 72 eudicots and 10 monocots. The sequences and annotations were downloaded as GenBank files from NCBI’s FTP site (ftp://ftp.ncbi.nlm.nih.gov/). Furthermore, we downloaded all KEGG gene annotations available on the KEGG database of these plant genomes (approximately 2,296,000 genes) in addition to all KEGG pathway reference maps (https://www.genome.jp/kegg/).

Data analysis and database construction

To construct the PlantPathMarks (PPMdb) core database, we integrated two powerful tools called MIcroSAtellite identification (MISA)[27] and primer3[28] into our in-house-developed “Perl and Shell” scripts to strap all standalone bioinformatics analysis steps in one main pipeline called the PGM pipeline. This pipeline was built to develop two novel marker systems called PGTM and PMTM. This pipeline was implemented to construct our PPMdb sub-databases through seven main steps: (a) splitting the retrieved genomic data into sequence and annotation files, (b) classifying genomic sequence into genic or intergenic, (c) sub-classifying genes into enzyme coding or non-enzyme coding, (d) mining microsatellites on enzyme-coding genes only (this step exclusive to PMTM), (e) designing primers and classifying markers into PMTM or PGTM, (f) mapping all designed primers against KEGG reference maps[17] besides linking associated information (sequence and annotation) to all pathway-mapped markers, and (g) data transforming into the PPMdb database format and implementing all datasets into the PPMdb web interface (Fig. 1).
Figure 1

The workflow of pathway gene-set mining (PGM) pipeline.

The workflow of pathway gene-set mining (PGM) pipeline. For effective marker development, we used a straightforward strategy to boost data analysis and marker generation processes. Before we began, we established in-house “Perl and Shell” scripts to convert all retrieved genome sequence and annotation data and unify input and output files used for the development of the PPMdb database. Initially, the data retrieved from the 82 plant genomes were dissected to classify each genome into genic and intergenic according to the genome’s annotation information. This classification step was achieved with the aid of the gene-finding format (GFF) file for each genome. The gene ID list for each plant was further searched against the KEGG database (https://www.genome.jp/kegg/ genes.html) to sub-classify genes into enzyme coding or non-enzyme coding. Thereafter, perfect and compound microsatellite motifs were identified using the MISA tool[27] according to the following parameters: mono- (≥ 9), di- (≥ 4), tri- (≥ 3), tetra- (≥ 3), penta- (≥ 2), and hexa-nucleotide (≥ 2). The parameter was set as ≥ 2 repeats interrupted by ≤ 20 bp for the characterization of compound microsatellite motifs[29]. PMTM and PGTM primers were then designed for all enzyme-coding genes for all plant genomes using the Primer3 software[28] according to the following criteria: a) optimal primer length of 20 bp; b) optimal melting temperature of 55 °C; c) product size range of 90–500 bp for PMTM primers and 100–1000 bp for PGTM primers; and d) a 50% G/C content. Then, e) a unique primer ID was assigned. All developed PMTM and PGTM primers were mapped against the KEGG pathway reference maps (https://www.genome.jp/kegg/pathway.html) by using in-house “Perl and Shell” scripts to integrate and localize our developed PMTM and PGTM primers and build a map set of marker-enriched pathways for each plant. Each map contains a pathway ID, pathway image, IDs of mapped enzymes (highlighted), PMTM/PGTM markers associated with mapped enzymes, annotation information of each marker, and other information related to this primer (Tm, GC%, length, etc.). All developed maps were visualized in an attractive user form using Jquery ImageMapster Plugin[30]. The JBrowse tool[31] was also integrated into our PPMdb database as a powerful web application for genome analysis and visualization. Herein, it was used to map and browse the identified microsatellite motifs and developed PMTMs and PGTMs and their linked information for each organism. All generated data of PMTMs, PGTMs, and maps were further processed to build a standard information platform for all marker types before integrating them into the PPMdb SQL database. With the aid of the in-house “Perl and Shell” scripts, we batch processed all developed markers, maps, and any associated data and converted them into a consistent format. Additionally, we uploaded these scripts to online open-source housing website (GitHub) to make it available for all users at the following link (https://github.com/MoradMMokhtar/PlantPathMarks-Scripts.git). The PPMdb database was developed with aid of the LPPM (Linux + Perl + PHP + MySQL) web application platform; finally, JavaScript, CSS, and HTML languages were used to design a user-friendly interactive web interface.

Features and utilities

Database interface

PPMdb presents an interactive, user-friendly portal well equipped with many features to enable users to search and download PMTMs and PGTMs across 82 plant genomes. PPMdb provides users with a navigation bar designed to help access the PPMdb database sections and tools in a responsive and convenient way. The PPMdb data can be straightforwardly browsed and retrieved via nine interactive pages: Home (PPMdb Quick-Access), Database Search, Map Browse, JBrowse, Statistics, Data Resources, Bulk Download, Species Comparisons and Manual. Under these pages, plants are taxonomically grouped or sorted alphabetically to improve the convenience of exploration and selection. The Home page introduces PPMdb as a comprehensive database along with its PGM pipeline by describing the PPMdb database sections, in addition to providing users with a “PPMdb Quick-Access; Analyzed Genomes in PPMdb Database Section” option for all analyzed plant genomes. In the “PPMdb Quick-Access” section, plant genomes analyzed in the PPMdb database are categorized according to plant type/class (dicot plants or monocot plants). Under each group, interactively, the plants are sorted alphabetically to enable straightforward access and searches within each plant genome independently through “Map Browse” or “JBrowse” quick-access links (Fig. 2).
Figure 2

Screenshot of the PPMdb homepage (A) Database description and (B) PPMdb Quick-Access section.

Screenshot of the PPMdb homepage (A) Database description and (B) PPMdb Quick-Access section. The Database Search page provides users with a searching utility within two main sections: (a) the Search section, which enables users to obtain results by selecting the following types of interest keywords: pathway name (e.g., Glycolysis), marker type (e.g., PGTM), and organism name (e.g., Arabidopsis lyrata), from the available drop-down menus to effortlessly access and retrieve all marker and map data stored in the PPMdb database. (b) The KEGG Pathway Maps Overview section, which provides users the necessary information for each pathway, such as pathway ID (hyperlinked to KEGG ref. pathways), pathway class (e.g., metabolism), pathway sub-class (e.g., carbohydrate metabolism), and pathway map. The search results of this section are designed in a user-friendly style containing essential descriptive information for each marker (e.g., enzyme ID, marker type, repeat type and sequence [exclusive to PMTM], primer sequence, primer annealing temperature, primer position within CDS, product size, JBrowse view link, NCBI gene accession number and CDS sequence, and gene information) (Fig. 3).
Figure 3

Screenshot of the PPMdb “Database Search” page layout.

Screenshot of the PPMdb “Database Search” page layout. In addition, general statistic tables are presented to summarize the total number of designed primers, number of PGTM primers, number of PMTM primers, and number of failed designed PMTM primers within the selected pathway of a particular plant. All search results can be downloaded freely for each plant straightforwardly as a CSV file (Fig. 3). The Map Browse page offers users with searching utility in two styles inside the same page: (a) selecting the plant of interest from the page-side plant list for a direct access of pathways and markers, (b) selecting the plant of interest from drop-down menus to directly access and obtain all markers and maps. In both styles, the search results are presented in a convenient, visualization-supported, and well-dissected manner involving essential information for each pathway, such as pathway ID (hyperlinked to KEGG ref. pathways), pathway class (e.g., metabolism), pathway sub-class (e.g., carbohydrate metabolism), pathway map browse (hyperlinked to interactive pathway map localizing all developed PMTM and PGTM markers on the pathway map), pathway description, and pathway map (Fig. 4).
Figure 4

Screenshot of the PPMdb database “Map Browse” section including page layout and Pathway markers mapping layout.

Screenshot of the PPMdb database “Map Browse” section including page layout and Pathway markers mapping layout. The “JBrowse” page enables the users to visualize and map all identified microsatellite motifs besides all developed PMTM and PGTM markers against the Refseq CDS for each plant genome by selecting the plant of interest from the page-side plant list. The mapped microsatellite motifs and developed PMTM/PGTM markers were linked to its essential information (Fig. 5).
Figure 5

Screenshot of the PPMdb database “JBrowse” page.

Screenshot of the PPMdb database “JBrowse” page. The ‘Statistics’ page was designed to provide users a primary indication of the taxonomy ID, number of mapped pathways, number of mapped enzymes, number of designed primers, number of PGTM primers, number of PMTM primers, and number of failed designed PMTM primers for each plant species stored in the PPMdb database. The Data Resources page provides users with hyperlinks of all types/formats of the data utilized to build the PPMdb database. The page includes hyperlinks of taxonomy ID, Genbank file, Fasta file, GFF file, and KEGG gene annotation for each plant species analyzed within the PPMdb. The Bulk Download page enables users to download all developed primers and their secondary information under a specific pathway in the organism of interest through the selection of organism name (e.g., Arabidopsis lyrata), pathway name (e.g., Glycolysis/Gluconeogenesis), and marker type (e.g., PGTM), from the drop-down menus straightforwardly in a convenient manner. The Species Comparisons page offers users a powerful utility to compare two or three organisms in a particular pathway. The comparison results are provided under three main sections: general statistics, marker statistics, and distribution of the various SSR classes. The general statistics section layouts the numbers of mapped pathways, mapped enzymes, total designed primers, number of designed PGTM and PMTM primers under a particular pathway between the organisms of interest. Furthermore, the page offers a simple comparison of the distribution of various SSR classes within the pathway of interest. All comparisons are offered in a simple and effective visualization style.

Statistics and discussion

PPMdb database statistics

As of January 2021, the PPMdb was launched and consisted of 2.7 million pathway-based markers distributed over 9894 marker panels developed across 82 plant genomes. Across these genomes, 165,378 enzyme-coding genes were mapped against 126 KEGG reference pathway maps. The SSR mining of all enzyme-coding genes identified 3,471,782 SSR motifs, including 2,844,501 perfect motifs and 627,281 compound motifs across the 82 plant genomes. Moreover, a total of 691,555 PGTMs and 1,999,187 PMTMs were developed (Fig. 6). This massive number of developed pathway-based markers have been mapped and supplemented with all essential information to offer users a modern version of markers called “pathway gene set markers”. All developed molecular markers for the 82 plant species are saved in separate backend tables for each plant. These datasets are searchable and can be downloaded conveniently via the PPMdb website. The statistical records of all analyzed plant genomes within the PPMdb, including the number of mapped pathways and designed primers, are summarized in Table 1.
Figure 6

Statistical layout of the 82 analyzed plant genomes. The middle part shows the plants’ scientific names (each plant is assigned to a unique color). The black-colored numbers refer to the number of mapped enzymes within each plant genome, where the blue-colored numbers refer to the number of developed PGTMs, and the purple-colored numbers refer to the number of developed PMTMs.

Table 1

Summarize the number of mapped pathways and the number of designed primers of each plant genome.

Plant NameNo. of mapped pathwaysNo. of designed primersPlant NameNo. of mapped pathwaysNo. of designed primers
Arabidopsis lyrata12127,138Lupinus angustifolius12136,373
Arabidopsis thaliana12028,598Malus domestica12134,730
Arachis duranensis12128,876Manihot esculent12129,493
Arachis ipaensis12129,122Medicago truncatula12031,198
Asparagus officinalis12023,804Momordica charantia12019,537
Beta vulgaris12021,396Nelumbo nucifera12024,488
Brachypodium distachyon12129,339Nicotiana attenuata12127,056
Brassica napus12182,544Nicotiana sylvestris12026,301
Brassica oleracea12141,415Nicotiana tabacum12048,326
Brassica rapa12141,458Nicotiana tomentosiformis12026,408
Cajanus cajan12129,112Olea europaea12130,611
Camelina sativa12171,883Oryza brachyantha12127,551
Capsella rubella12126,457Oryza sativa Japonica12138,757
Capsicum annuum12126,524Papaver somniferum12046,150
Carica papaya12017,993Phalaenopsis equestris11920,473
Chenopodium quinoa12039,628Phaseolus vulgaris12127,001
Cicer arietinum12125,279Populus euphratica12133,292
Citrus clementina12122,067Populus trichocarpa12176,911
Citrus sinensis12123,316Prunus avium12124,449
Cucumis melo12050,235Prunus mume12123,764
cucumis sativus12022,641Prunus persica12123,604
Cucurbita maxima12031,349PyrusXbretschneideri12133,747
Cucurbita moschata12031,857Quercus suber12147,644
Cucurbita pepo12032,658Raphanus sativus12147,881
Daucus carota12128,750Ricinus communis12121,616
Dendrobium catenatum12022,022Rosa chinensis12126,410
Durio zibethinus12137,592Sesamum indicum12125,735
Elaeis guineensis12031,528Setaria italica12132,295
Eucalyptus grandis12129,917Solanum lycopersicum12026,382
Eutrema salsugineum12126,003Solanum pennellii12126,602
Glycine max12146,715Solanum tuberosum12027,323
Glycine soja12147,199Sorghum bicolor12133,047
Gossypium arboreum12133,110Spinacia oleracea12021,937
Gossypium hirsutum12153,945Tarenaya hassleriana12130,052
Gossypium raimondii12133,246Theobroma cacao12122,544
Helianthus annuus12137,275Vigna angularis12127,066
Hevea brasiliensis12133,813Vigna radiata12126,690
Ipomoea nil12130,569Vigna unguiculata12128,687
Jatropha curcas11866,782Vitis vinifera12125,183
Juglans regia12029,028Zea mays12135,395
Lactuca sativa12129,366Ziziphus jujuba12126,484
Statistical layout of the 82 analyzed plant genomes. The middle part shows the plants’ scientific names (each plant is assigned to a unique color). The black-colored numbers refer to the number of mapped enzymes within each plant genome, where the blue-colored numbers refer to the number of developed PGTMs, and the purple-colored numbers refer to the number of developed PMTMs. Summarize the number of mapped pathways and the number of designed primers of each plant genome.

Insights

Advances in plant omics promise to transform the molecular markers research area, in which the main challenge will not be the development of novel markers rather than the optimum selection and validation of a group of useful functional markers from the big collection of candidates[32]. For many decades, genetic diversity and molecular breeding studies have involved a few pre-specified candidate markers/genes. This knowledge-based approach was found to run a high risk of missing critical genes related to interest traits. Genes or gene families involved in a biological pathway are often switched on or off together to reflect a particular biological function or elucidate specific phenotypes[1]. In plants, complex economic traits such as yield, resistance to a particular disease, production of secondary metabolites, etc.,… have thus driven the scientists' needs for new ‘systems’-based approaches that can illuminate the molecular mechanisms underlying specific trait(s) rather than the effect of distinct genes[3]. Based on this deep understanding, this study's scientific vision provides a platform for the development of pathways-based markers toward designing future studies that aim to disentangle the causal biological pathways and their phenotypic reflections. As a part of this vision, microsatellites have been utilized as one of the most common sources of genetic markers, which have served as a keystone for massive genetic studies due to their robust and unique features. Microsatellites, which demonstrate their efficiency in broad applications, such as diversity studies, genome mapping, molecular breeding, and molecular phylogeny[33,34], were successfully implemented in our developed PGM pipeline to develop the PMTM system as a novel class of pathway-targeted markers. In silico microsatellite mining on a plant genome scale is expected to advance our understanding and elucidate the functional impact of microsatellites within biological pathways and, consequently, in the context of systems biology[35]. This study also aimed to develop and map pathway-based genetic marker panels that support and boost molecular breeding programs, genetic diversity, and genetic characterization studies. From this point of view, the value of our developed marker panels will need to be assessed in the context of availability and mining of biological pathway information to generate knowledge that is more actionable rather than more complex.

Conclusion

In summary, we present PPMdb as a comprehensive database for pathway-based markers in plant genomes. To our knowledge, PPMdb is the first portal providing unique pathway-targeted marker panels not presented in any previous database. PPMdb classifies developed marker panels based on their biological functions. This advantage allows researchers to deeply focus on the functional roles of utilized markers and may explain many phenotypic variations in the future. Our developed PPMdb is substantially different from similar plant genetic marker databases. The PPMdb will regularly be updated by integrating any newly released plant genomes. Furthermore, the PPMdb graphical user interface and functionality will always be enhanced and continuously supported with new tools and technologies. Overall, we believe that the PPMdb hub will serve as a starting point or cornerstone for pathway-targeted marker research. In addition, we believe that PPMdb will catch great attention from a wide range of plant scientists in different disciplines, including genetic diversity, species characterization, population genetics, genome mapping, and targeted trait improvement.
  30 in total

1.  Assignment of allelic configuration in polyploids using the MAC-PR (microsatellite DNA allele counting-peak ratios) method.

Authors:  G D Esselink; H Nybom; B Vosman
Journal:  Theor Appl Genet       Date:  2004-04-14       Impact factor: 5.699

Review 2.  Pathway analysis of complex diseases for GWAS, extending to consider rare variants, multi-omics and interactions.

Authors:  Patrick Y P Kao; Kim Hung Leung; Lawrence W C Chan; Shea Ping Yip; Maurice K H Yap
Journal:  Biochim Biophys Acta Gen Subj       Date:  2016-11-23       Impact factor: 3.770

3.  Creation of a genome-wide metabolic pathway database for Populus trichocarpa using a new approach for reconstruction and curation of metabolic pathways for plants.

Authors:  Peifen Zhang; Kate Dreher; A Karthikeyan; Anjo Chi; Anuradha Pujar; Ron Caspi; Peter Karp; Vanessa Kirkup; Mario Latendresse; Cynthia Lee; Lukas A Mueller; Robert Muller; Seung Yon Rhee
Journal:  Plant Physiol       Date:  2010-06-03       Impact factor: 8.340

4.  Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes.

Authors:  Zuguang Gu; Jialin Liu; Kunming Cao; Junfeng Zhang; Jin Wang
Journal:  BMC Syst Biol       Date:  2012-06-06

Review 5.  Ten years of pathway analysis: current approaches and outstanding challenges.

Authors:  Purvesh Khatri; Marina Sirota; Atul J Butte
Journal:  PLoS Comput Biol       Date:  2012-02-23       Impact factor: 4.475

6.  Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data.

Authors:  Hyeonjeong Lee; Miyoung Shin
Journal:  BioData Min       Date:  2017-02-01       Impact factor: 2.522

7.  Plant Reactome: a resource for plant pathways and comparative analysis.

Authors:  Sushma Naithani; Justin Preece; Peter D'Eustachio; Parul Gupta; Vindhya Amarasinghe; Palitha D Dharmawardhana; Guanming Wu; Antonio Fabregat; Justin L Elser; Joel Weiser; Maria Keays; Alfonso Munoz-Pomer Fuentes; Robert Petryszak; Lincoln D Stein; Doreen Ware; Pankaj Jaiswal
Journal:  Nucleic Acids Res       Date:  2016-10-30       Impact factor: 16.971

Review 8.  Advances in Omics Approaches for Abiotic Stress Tolerance in Tomato.

Authors:  Juhi Chaudhary; Praveen Khatri; Pankaj Singla; Surbhi Kumawat; Anu Kumari; Vinaykumar R; Amit Vikram; Salesh Kumar Jindal; Hemant Kardile; Rahul Kumar; Humira Sonah; Rupesh Deshmukh
Journal:  Biology (Basel)       Date:  2019-11-25

9.  The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases.

Authors:  Ron Caspi; Tomer Altman; Richard Billington; Kate Dreher; Hartmut Foerster; Carol A Fulcher; Timothy A Holland; Ingrid M Keseler; Anamika Kothari; Aya Kubo; Markus Krummenacker; Mario Latendresse; Lukas A Mueller; Quang Ong; Suzanne Paley; Pallavi Subhraveti; Daniel S Weaver; Deepika Weerasinghe; Peifen Zhang; Peter D Karp
Journal:  Nucleic Acids Res       Date:  2013-11-12       Impact factor: 16.971

10.  A genome scale metabolic network for rice and accompanying analysis of tryptophan, auxin and serotonin biosynthesis regulation under biotic stress.

Authors:  Palitha Dharmawardhana; Liya Ren; Vindhya Amarasinghe; Marcela Monaco; Jim Thomason; Dean Ravenscroft; Susan McCouch; Doreen Ware; Pankaj Jaiswal
Journal:  Rice (N Y)       Date:  2013-05-29       Impact factor: 4.783

View more
  1 in total

1.  PltRNAdb: Plant transfer RNA database.

Authors:  Morad M Mokhtar; Achraf El Allali
Journal:  PLoS One       Date:  2022-05-23       Impact factor: 3.752

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.