Literature DB >> 30395277

PlantPAN3.0: a new and updated resource for reconstructing transcriptional regulatory networks from ChIP-seq experiments in plants.

Chi-Nga Chow1, Tzong-Yi Lee2, Yu-Cheng Hung3, Guan-Zhen Li3, Kuan-Chieh Tseng4, Ya-Hsin Liu4, Po-Li Kuo3, Han-Qin Zheng3, Wen-Chi Chang1,3,4.   

Abstract

The Plant Promoter Analysis Navigator (PlantPAN; http://PlantPAN.itps.ncku.edu.tw/) is an effective resource for predicting regulatory elements and reconstructing transcriptional regulatory networks for plant genes. In this release (PlantPAN 3.0), 17 230 TFs were collected from 78 plant species. To explore regulatory landscapes, genomic locations of TFBSs have been captured from 662 public ChIP-seq samples using standard data processing. A total of 1 233 999 regulatory linkages were identified from 99 regulatory factors (TFs, histones and other DNA-binding proteins) and their target genes across seven species. Additionally, this new version added 2449 matrices extracted from ChIP-seq peaks for cis-regulatory element prediction. In addition to integrated ChIP-seq data, four major improvements were provided for more comprehensive information of TF binding events, including (i) 1107 experimentally verified TF matrices from the literature, (ii) gene regulation network comparison between two species, (iii) 3D structures of TFs and TF-DNA complexes and (iv) condition-specific co-expression networks of TFs and their target genes extended to four species. The PlantPAN 3.0 can not only be efficiently used to investigate critical cis- and trans-regulatory elements in plant promoters, but also to reconstruct high-confidence relationships among TF-targets under specific conditions.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30395277      PMCID: PMC6323957          DOI: 10.1093/nar/gky1081

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Regulatory factors contain transcription factors (TFs), modified histones, and other DNA-binding proteins that affect gene transcriptional activity and chromatin remodeling due to interactions with their target DNA sequences. Recognizing transcription factor binding sites (TFBSs), and actual regulatory regions of modified histones has been one of the most important problems in functional genomics. In the last few decades, high-throughput techniques, such as chromatin immunoprecipitation sequencing (ChIP-seq), DNase sequencing (DNase-seq) and DNA affinity purification sequencing (DAP-seq), have been carried out to reveal the genomic binding landscapes of regulatory factors (1–3). Specifically, these techniques can help scientists reveal the regulations occurring in a specific biological process or under condition of interest. Although high-throughput sequencing data have grown exponentially in the public domain, the diverse data processed methods and file formats cause low utility of these big data. Thus, there is a strong need to set up an integrative resource to interpret complicated transcriptional regulatory networks from various datasets. To date, several databases have been developed to explore the occupancy landscapes of regulatory factors, such as, Cistrome DB (4), ReMap (5), Factorbook (6), GTRD (7), PlantTFDB (8), ChIPBase (9) and Expresso (10). However, most of such resources only support mammalian and Drosophila research, and a limited number were designed for plants. Among them, PlantTFDB, ChIPBase and Expresso focus on a restricted number of ChIP-seq data, and only support Arabidopsis thaliana (8–10). Furthermore, these systems do not provide customized sequence analysis capabilities, depending on their experimental DNA binding matrices. In this update version (PlantPAN 3.0), the genomic regulatory sites of each genes will be supplied from experimental sequencing data. Additionally, the experimental matrices will be utilized in plant promoter sequence analysis. TFs regulate cell processes by binding a specific DNA motif on promoter regions and affecting downstream gene expression. However, the recognized mechanisms between TF and DNA remain unclear. To address this question, high-resolution crystal structures of DNA-binding domains have been generated to interpret protein–DNA recognition (11,12). In addition, the tertiary structures refine the functional features of proteins, such as dimerization, protein-protein interaction sites, and transcriptional activation sites in an effector domain (13). The Protein Data Bank (PDB) serves as a searchable platform for the 3D structures of proteins, nucleic acids, and multiprotein complexes (14,15). Besides tertiary structures, several primary and secondary structural features have also been used to infer protein functions. For example, variants located in pivotal amino acids or domains might confer deleterious effects on protein functions (16), and functional domains might suggest evolutionary relationships in protein families and DNA-binding preferences (13,17). The ePlant provides useful function to identify DNA binding domains and non-synonymous SNPs in 3D structures (18). Therefore, it is worth integrating these features in public TF repositories to interpret the DNA-binding structure and biological function of TFs. PlantPAN is aimed toward reconstructing transcriptional regulatory networks and providing actual cis-regulatory elements for plant genes. This study reports the first introduction of genomic binding events from 421 Chip-seq datasets for 99 regulatory factors across seven plant species into the PlantPAN system. In addition to TFBSs, ChIP-seq occupancy of histone modifications and DNA-binding proteins, such as the chromatin remodeling complex protein, have been added to illustrate the transcriptional activity of genomic landscapes. In newly constructed PCBase resource, users can not only access the ChIP-seq results of interest via four functions: Gene Search, Protein Search, Genome Browse, and Promoter Analysis, but also download the processed result files for further analysis. With the incorporation of gene annotation and the promoter sequence in seven plant species, PlantPAN 3.0 offers an efficient platform on which to identify important regulatory elements (binding sites of regulatory factors, CpG islands, tandem repeats, and conserved regions) of user queries. In addition, by adding protein sequence annotation and structural basis features, PlantPAN 3.0 is expected to help users explore the critical interaction motif among TFs and binding DNA molecules. An overview and several handpicked results of PlantPAN 3.0 are shown in Supplementary Figure S1.

DATA CONTENT AND WEB INTERFACE

ChIP-seq data collection

The plant ChIP-seq data were collected systematically from Gene Expression Omnibus (GEO) and Sequence Read Archive (SRA) (19,20). Only the dataset containing a pair of a ChIP-seq experimental sample and an input sample for a regulatory factor were collected. The datasets were manually checked using the following three criteria: (i) methods for the ChIP-seq experiments, (ii) available raw data (FASTQ or SRA formats) and (iii) comprehensive description of the experimental purpose for each ChIP-seq sample. Datasets lacking any of the above information were filtered out. In total, 421 datasets (662 samples) were used to reveal regulatory relationships for 99 regulatory factors across seven species including A. thaliana, Oryza sativa, Zea mays, Glycine max, Solanum lycopersicum, Gossypium hirsutum and A. lyrata (Table 1).
Table 1.

The ChIP-seq data statistics from seven species

SpeciesRegulatory factorsDatasetssamplesBinding sitesBinding relationships
Arabidopsis thaliana 823555353 456 486966 251
Oryza sativa 11320191480
Zea mays 6357131 43613 623
Glycine max 42038798 340163 698
Solanum lycopersicum 112127466
Gossypium hirsutum 124117 76844 830
Arabidopsis lyrata 479167 01444 051
Total994216624 574 3371 233 999
The ChIP-seq data statistics from seven species

ChIP-seq data processing and motif discovery

All datasets were systematically processed according to the following procedures: Raw data in SRA (.sra) format were converted to FASTQ (.fastq) by using SRA toolkit (version 2.8.2-1, http://www.ncbi.nlm.nih.gov/Traces/sra). The FASTX-Toolkit (version 0.0.13, http://hannonlab.cshl.edu/fastx_toolkit/) and cutadapt (version 1.16) (21) were used to remove low quality reads (reads ≧Q30 and reads up to 30 bp) and adapters from single-end and paired-end datasets, respectively. Bowtie (version 1.2.2) was used to align reads to the reference genome (22). The reference genomes used in this study are shown in Supplementary Table S1. SAMtools (version 1.4) was used to sort and cut the reads labelled ‘reads unmapped’, ‘not primary alignment’ and ‘reads are PCR or optical duplicates’ as FLAGs in Bowtie results (23). The duplicate reads were discarded using Picard (version 2.18.5, http://broadinstitute.github.io/picard/). For peak calling, SPP (version 1.14) in AQUAS pipeline and MACS2 (version 2.1.0) programs were applied to identify the protein binding sites for single-end and paired-end datasets, respectively (24–26). Consequently, a total of 4 574 337 protein binding sites were obtained (Table 1). De novo motif discovery of each protein was performed using MEME-ChIP in MEME SUITE (version 4.12.0) (27). The top three position-specific probability matrices from two motif discovery algorithms, MEME and DREME, were then used to scan the binding sites on any input promoter sequences. Totally, 2449 PWMs (position weight matrices) and 2459 motif logos were obtained.

Regulatory linkage construction

The genome locations of regulatory factor binding sites were required to construct the regulatory relationship between a regulatory factor and a target gene. For target gene recognition, all genomic binding sites from the ChIP-seq datasets were overlapped with the potential regulatory regions of all genes on their chromosome coordinates. The potential regulatory protein-coding gene regions were defined as promoter (upstream 2000 bp from transcription start sites (TSS)), 5’UTR, exon, intron, 3’UTR and downstream regions (downstream 350 bp from transcription stop sites for genes without 3’UTR). In three species (A. thaliana, Z. mays and A. lyrata), the regulatory relationships among regulatory factor and non-coding RNAs (ncRNAs) were also considered. Finally, 1,233,999 regulatory linkages were identified among the target genes (including protein coding genes and ncRNAs) and 99 regulatory factors.

Integrative information of regulatory factors

The regulatory factors collected from ChIP-seq datasets can be divided into TFs, histones, and other DNA-binding proteins. Proteins not found in PlantPAN 2.0, PlantTFDB and HistoneDB 2.0 were classified into other DNA-binding proteins (8,28,29). The detailed information about the regulatory factors was retrieved from the UniProt database (30). The additional histone modification descriptions were extracted from HistoneDB 2.0 (28).

Structure-based and protein sequence-based annotation of TFs

The secondary and tertiary structures of protein functions are significant features. Specifically, the local sequence-specific binding structures may contribute to the binding specificity of TFs. Thus, in this release, the PlantPAN provides protein–DNA complex tertiary structures and protein sequence-based annotation. The protein tertiary structures were retrieved from the PDB through the ID mapping files from UniProt (14,15,30). Due to the limited amount of structural data in PDB, the tertiary structures of homologous proteins were also collected to facilitate making a homology model and to compare structural models. The homologs of TFs in other species were congregated from the HomoloGene Database (https://www.ncbi.nlm.nih.gov/homologene) and InParanoid, (31). The JSmol applet was applied to visualize the tertiary structures. ProtVista was used to provide a graphical representation of protein sequence annotation (such as functional domains, secondary structures, post-translational modification and variants, etc.) (32). In PlantPAN 3.0, users can access these advanced functional features via the TF/TFBS Search function.

Prediction of TFBSs in promoter sequences

Since genomic binding sites are good clues to capture the appearance of conserved TF binding variations, PWMs created from high-throughput experiments have become vital for TFBS prediction. By incorporating 1100 PWMs from three studies and 2449 PWMs from ChIP-seq datasets, a total of 4703 PWMs for 17 230 regulatory factors were collected in PlantPAN 3.0 (1,3,33). The putative TFBS predictions in a given promoter sequence was implemented using the Match™ program (34). The procedure for creating the cut-off profiles is described in our previous paper (29).

New PCBase function for identifying protein–DNA regulatory relationships derived from ChIP-seq experiments

To facilitate user access to ChIP-seq data, a new portal ChIP-seq search, called PCBase was developed in PlantPAN 3.0. Two main functions, ‘Gene Search’ and ‘Protein Search’, were designed for the most frequent queries regarding transcriptional regulation. To investigate the transcriptional regulatory networks and histone modification of target genes, PlantPAN 3.0 provides six types of potential regulatory regions (i.e. upstream, 5’UTR, exon/ncRNA, intron, 3’UTR, and downstream). In the ‘Gene Search’ mode of PCBase, users can input their genes of interest to explore which regulatory factors are located on the potential regulatory regions. Moreover, users can choose a regulatory factor and a serious of datasets to obtain graphical diagrams of genomic binding locations by using the D3.js JavaScript library (https://d3js.org). In the ‘Protein Search’, users can search their datasets of interest by browsing proteins for a specific species. The result page provides (i) detailed information from each dataset, including name and type of regulatory factor, tissue, and experimental treatment, (ii) motif logos generated from MEME or DREME, (iii) Target Browse (iv) Binding Proportion, (v) Peak Browse and (vi) a downloadable table of the processed results. The ‘Protein Search’ output interfaces are summarized in Figure 1. The Target Browse function allows users to retrieve all target genes through filter options such as dataset ID, replicate, chromosome, and potential regulatory region (Figure 1B). To characterize the preferred potential regulatory region of regulatory factors, the Binding Proportion function shows the distribution of all binding events relying on six types of potential regulatory regions (Figure 1C). Furthermore, Peak Browse helps users restrict their genomic regions of interest, and each binding sites with gene structures can be visualized in the JBrowse genome browser (Figure 1E) (35). To assist users with conducting further analysis, PlantPAN 3.0 compiles the locations and sequences of peak-calling results into BED and FASTA formats, respectively (Figure 1F).
Figure 1.

The output interfaces of ‘Protein Search’ in PCBase in PlantPAN 3.0. (A) After users select a species and a regulatory factor (marked in red boxes), detailed information for the selected dataset (right) is displayed. The result page also provides (B) a searchable table for Target Browse, where user can click ‘Visualize’ to identify the location of a binding site, (C) binding proportion, (D) motif logos, (E) Peak Browse for a dataset and (F) tables to download processed files and link with external databases.

The output interfaces of ‘Protein Search’ in PCBase in PlantPAN 3.0. (A) After users select a species and a regulatory factor (marked in red boxes), detailed information for the selected dataset (right) is displayed. The result page also provides (B) a searchable table for Target Browse, where user can click ‘Visualize’ to identify the location of a binding site, (C) binding proportion, (D) motif logos, (E) Peak Browse for a dataset and (F) tables to download processed files and link with external databases.

Enhancement of the existing functions

For a cis-regulatory element search in PlantPAN, the input can be either a transcript locus or a group of genes. Four new plant species, G. max, S. lycopersicum, G. hirsutum and A. lyrata, were added (Supplementary Table S1) (36–39). Gene annotations for A. thaliana and Z. mays were updated to Araport11 and AGV.4 version (37,40). For Z. mays, the gene IDs of AGV.3 were converted to AGV.4 by using geneIDhistory files obtained from Gramene (41). Furthermore, the expression profiles of 58 samples were imported from SoyBase to illustrate co-expression networks among TFs and target genes in G. max (Supplementary Table S2) (42).

APPLICATION AND DISCUSSION

Extended usage of ChIP-data in TFBS predictions

Determining how to reduce false positive in TFBS predictions remains a difficult task in bioinformatic methods. Additionally, inferring promoter activity under different conditions or for different plant tissues is a major challenge to the study of the transcriptional regulation of genes. To handle these problems, CpG islands, tandem repeats, conserved regions, and co-expression profiles were used to identify the actual TFs/TFBSs in PlantPAN 2.0 (29). In this release, PlantPAN 3.0 provides a significant improvement in terms of elucidating the transcriptional regulation of genes by integrating experimental binding sites for TFs and other DNA-binding proteins, as well as histone modification marks from ChIP-seq data. In the proposed PlantPAN 3.0, the predicted and experimental TFBSs were shown in Jbrowse, which enables a straightforward comparison of different regulatory tracks on any genomic regions. Here, a case study of SUPPRESSOR OF OVEREXPRESSION OF CONSTANS1 (SOC1) is given below to demonstrate applications of PlantPAN 3.0. SOC1 is known to regulate flowering time in A. thaliana. A previous study suggests that APETALA1 (AP1) and SEPALLATA 3 (SEP3) may play an important role as regulators to modify the gene expression of SOC1 and change chromatin accessibility (43). Based on the analysis in PCBase function, the results show that AP1 and SEP3 occupied the promoter and intron of SOC1 in inflorescences (Figure 2), which is consistent with pervious findings. Interestingly, the H3K4me3 was found to locate in adjacent TFBSs and TSS regions of SOC1. This may imply that the activation of SOC1 gene expression is related to co-occupancy of TFs and H3K4me3 in the early developmental stage of the flower. In addition, based on the results of the TFBS prediction via PWM similarities, six binding sites of 11bps core motifs (CCAAAAA[AT]GGA) for SEP3 were found to overlap with the peak signals in the ChIP-seq experiments. On the other hand, several TFs that have been studied to regulate SOC1, such as Flowering Locus C (FLC) and AGAMOUS‐like 15 (AGL15), also can be observed in the upstream regions of SOC1 (44,45). This case reveals the comparability of the ChIP-seq dataset results and the predicted TFBSs in PlantPAN 3.0. By integrating multiple features of promoters, PlantPAN 3.0 is expected to help users reconstruct complex transcriptional regulatory genes networks and to decrease the false positives in TFBS predictions.
Figure 2.

The binding sites for SEP3, AP1, H3K4me3, FLC, and AGL15 across SOC1 gene (AT2G45660) in the Jbrowse viewer. The upstream region (+500 to -2000; Chr2:18813047–18810548) of SOC1 is highlighted with a yellow background. The experimental binding sites from ChIP-seq are shown in the first three tracks. The last three lines are the predicted TFBSs via PWM patterns.

The binding sites for SEP3, AP1, H3K4me3, FLC, and AGL15 across SOC1 gene (AT2G45660) in the Jbrowse viewer. The upstream region (+500 to -2000; Chr2:18813047–18810548) of SOC1 is highlighted with a yellow background. The experimental binding sites from ChIP-seq are shown in the first three tracks. The last three lines are the predicted TFBSs via PWM patterns.

Construction of transcription regulatory networks across different species

Isolation of homologs of an already-known gene has been widely used in tracking evolutionary conservation in gene regulations and characterizing necessary changes during genetic divergence (46,47). To assist users in comparing the gene regulation between homologous genes, PlantPAN 3.0 offers an effective ‘Cross Species’ function for detecting TFBSs in conserved regions of promoters. Furthermore, a transcriptional regulatory network in a model plant can be easily referred to understand the mechanism in several important crops, G. max, S. lycopersicum, G. hirsutum and Z. mays. In several plant species, circadian rhythms have been reported to be essential in the regulation of plant growth (48,49). For example, circadian-regulated MYB-like transcription factor, REVEILLE 1 (RVE1, AT5G17300) is able to regulate hypocotyl growth by increasing IAA concentrations through activation of the auxin biosynthesis‐related genes YUCCA8 (YUC8, AT4G28720) (50). Based on these findings, the cross species analysis between A. thaliana and G. max was performed to illustrate the usage of PlantPAN 3.0. In ‘Cross Species’ function, eleven conserved regions were identified in promoters of YUC8 and its homolog (Glyma.03G169600) (Supplementary Figure S2). As expected, the TFBS prediction results show that YUC8 promoter harbors RVE1 binding sites within seventh conserved regions (Figure 3A). In G. max, the predicted binding sites of four homeodomain-like superfamily proteins (Glyma.18G044200, Glyma.14G210600, Glyma.13G15230 and Glyma.02G241000) were found in the corresponding seventh conserved regions of Glyma.03G169600 (Figure 3A). Expectedly, there are homologous relationships among RVE1 and four homeodomain-like superfamily proteins of G. max (Figure 3B). The transcriptional regulatory network of YUC8 and Glyma.03G169600 displays similar transcriptional regulations between A. thaliana and G. max (Figure 3B). These results might imply that the circadian regulation of auxin biosynthesis is functionally conserved in G. max through auxin biosynthesis‐related genes and homeodomain-like superfamily proteins. Accordingly, other A. thaliana homeodomain-like superfamily proteins, such as LATE ELONGATED HYPOCOTYL (LHY, AT1G01060) and EARLY PHYTOCHROME RESPONSIVE 1 (EPR1, AT1G18330) also show the homologous relationships with G. max TFs. These TFs might be the new candidates involved in the regulation of circadian rhythms in both A. thaliana and G. max. This case reveals that PlantPAN 3.0 can helps users compare both similarities and differences in transcriptional regulatory networks between homologous genes across different plant species.
Figure 3.

Conserved transcription regulation between YUC8 and its homolog (Glyma.03G169600). (A) The partial TFBS prediction results in the seventh conserved regions, where RVE1 and four homeodomain-like superfamily proteins were marked in red. (B) The transcriptional regulatory network of YUC8 (pink node) and Glyma.03G169600 (light yellow node). Red and yellow nodes represent predicted TFs from A. thaliana and G. max, respectively. Green line were used to link homologous TFs.

Conserved transcription regulation between YUC8 and its homolog (Glyma.03G169600). (A) The partial TFBS prediction results in the seventh conserved regions, where RVE1 and four homeodomain-like superfamily proteins were marked in red. (B) The transcriptional regulatory network of YUC8 (pink node) and Glyma.03G169600 (light yellow node). Red and yellow nodes represent predicted TFs from A. thaliana and G. max, respectively. Green line were used to link homologous TFs.

Significance and utility of PlantPAN3.0

The utility and comparisons of PlantPAN 3.0 with other similar resources are illustrated in Table 2.
Table 2.

A comparison of PlantPAN 3.0 with previous version and similar resources

PlantPAN 3.0PlantPAN 2.0aPlantTFDB 4.0bChIPBase v2.0cExpressodReMap 2018ePCSDf
Number of species in this databases 787616510113
Number of TFs 17 23016 960320 3702620NAg46
Number of TF matrices 4 7031143674NAg (∼6 200)h000
Number of plant species in ChIP-seq datasets 7021103
Number of regulatory factors in ChIP-seq datasets 9901429 (1414)h200 (485)h110
Number of ChIP-seq samples 6620NANANANANA
Number of ChIP-seq datasets 42102654 i (10 216)h200 (2 829)h303
Annotation of Target genes YesYesNoYesYesNoYes
Histon/Nuclesome Binding regions YesNoYesYesNoYesYes
Genome Browse for binding regions YesNoYesYes iNoYesYes
Uniform ChIP-seq data processing YesNoYesNoNoYesYes
Download whole genomic binding peaks (bed/bigwig files) YesNoNoNoNoYesYes
Comprehensive curation of TF information (i.e. functional domain, response conditions, target genes, activator or repressor, and sequence logos of binding motifs) Yes (increase secondary and 3D structures, PTM, and variants)YesYesNoNoNoNo
Co-expression profiles of TFs and their target genes YesYesNoYesYesNoNo
Cis-regulatory element prediction YesYesYesNoNoNoNo

aPlantPAN 2.0: http://PlantPAN2.itps.ncku.edu.tw/ (29).

bPlantTFDB 4.0: http://planttfdb.cbi.pku.edu.cn/ (8).

cChIPBase v2.0: http://rna.sysu.edu.cn/chipbase/ (9).

dExpresso: http://bioinformatics.cs.vt.edu/expresso/ (10).

eReMap 2018: http://remap.cisreg.eu (5).

fPCSD: http://systemsbiology.cau.edu.cn/chromstates (51).

gEach TF/matrices can be accessed from the database separately. However, the total number of TFs/matrices cannot be calculated via the resource.

hThe number of data for plant species is shown without brackets, whereas the total number of data for plant and non-plant species is indicated within brackets.

iRecently, this function/resource has not been available on the website (http://rna.sysu.edu.cn/chipbase/).

A comparison of PlantPAN 3.0 with previous version and similar resources aPlantPAN 2.0: http://PlantPAN2.itps.ncku.edu.tw/ (29). bPlantTFDB 4.0: http://planttfdb.cbi.pku.edu.cn/ (8). cChIPBase v2.0: http://rna.sysu.edu.cn/chipbase/ (9). dExpresso: http://bioinformatics.cs.vt.edu/expresso/ (10). eReMap 2018: http://remap.cisreg.eu (5). fPCSD: http://systemsbiology.cau.edu.cn/chromstates (51). gEach TF/matrices can be accessed from the database separately. However, the total number of TFs/matrices cannot be calculated via the resource. hThe number of data for plant species is shown without brackets, whereas the total number of data for plant and non-plant species is indicated within brackets. iRecently, this function/resource has not been available on the website (http://rna.sysu.edu.cn/chipbase/). Based on the rapid accumulation of ChIP-seq data, several public web-based resources were developed. For example, ReMap currently increased their ChIP-seq collection and provided an exhaustive review of regulatory maps (5). Unfortunately, ReMap only supports humans. For plant species, there are several databases devoted to collecting plant ChIP-seq data and identifying their gene regulation mechanisms. For example, Expresso was created to identify the regulatory relationships among 20 A. thaliana TFs and their target genes from public ChIP-seq data (10). PlantTFDB 4.0 is a useful TF repository for green plants, providing various information leading to an understanding of the functions of TFs as well as genome-wide regulatory maps for 14 TFs (8). However, their collection in ChIP-seq experiments has only included for A. thaliana, and there is a lacks of annotation of target genes. Although ChIPBase v2.0 provided A. thaliana ChIP-seq data and co-expression profiles for TFs and target genes, the ChIP-seq data were not processed consistently (9). Moreover, recently, the function for many species has not been available on the website (http://rna.sysu.edu.cn/chipbase/). Furthermore, PCSD is dedicated to recognizing the chromatin states of A. thaliana, O. sativa and Z. mays based on both public and in-house epigenomic data (51). However, this resource did not provide the specific condition or the tissue where the regulation or chromatin state was observed. Compared among these resources and PlantPAN 3.0, the distinctive advantages of PlantPAN 3.0 are listed as follows: (i) PlantPAN 3.0 collected comprehensive public ChIP-seq datasets, which cover 421 datasets for 99 regulatory factors across seven plant species. (ii) The collected ChIP-seq datasets were processed systematically, and all analysis results are downloadable. (iii) The detailed information for each dataset and functional annotation of both TFs and target genes are available. (iv) The most complete plant PWMs are provided for analyzing TFBSs in a promoter or a set of promoters. Additionally, users can graphically compare predicted TFBSs with the experimental binding sites of regulatory factors and other important regulatory elements (CpG islands, tandem repeats, and conserved regions). (v) PlantPAN 3.0 also can be used to elucidate similarities among the expression profiles of TFs and target genes under various environmental stresses, hormone treatments, or developmental stages across four species. In summary, PlantPAN 3.0 facilitates an understanding of complicated transcriptional regulatory networks in plants.

DATA AVAILABILITY

The PlantPAN 3.0 is available via a web interface and is freely to all interested user, at http://PlantPAN.itps.ncku.edu.tw/. Click here for additional data file.
  50 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository.

Authors:  Ron Edgar; Michael Domrachev; Alex E Lash
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

3.  MATCH: A tool for searching transcription factor binding sites in DNA sequences.

Authors:  A E Kel; E Gössling; I Reuter; E Cheremushkin; O V Kel-Margoulis; E Wingender
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

Review 4.  NAC transcription factors: structurally distinct, functionally diverse.

Authors:  Addie Nina Olsen; Heidi A Ernst; Leila Lo Leggio; Karen Skriver
Journal:  Trends Plant Sci       Date:  2005-02       Impact factor: 18.313

5.  Soybean WRKY-type transcription factor genes, GmWRKY13, GmWRKY21, and GmWRKY54, confer differential tolerance to abiotic stresses in transgenic Arabidopsis plants.

Authors:  Qi-Yun Zhou; Ai-Guo Tian; Hong-Feng Zou; Zong-Ming Xie; Gang Lei; Jian Huang; Chun-Mei Wang; Hui-Wen Wang; Jin-Song Zhang; Shou-Yi Chen
Journal:  Plant Biotechnol J       Date:  2008-03-31       Impact factor: 9.803

6.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

7.  Barley grain with adhering hulls is controlled by an ERF family transcription factor gene regulating a lipid biosynthesis pathway.

Authors:  Shin Taketa; Satoko Amano; Yasuhiro Tsujino; Tomohiko Sato; Daisuke Saisho; Katsuyuki Kakeda; Mika Nomura; Toshisada Suzuki; Takashi Matsumoto; Kazuhiro Sato; Hiroyuki Kanamori; Shinji Kawasaki; Kazuyoshi Takeda
Journal:  Proc Natl Acad Sci U S A       Date:  2008-03-03       Impact factor: 11.205

8.  The circadian clock regulates auxin signaling and responses in Arabidopsis.

Authors:  Michael F Covington; Stacey L Harmer
Journal:  PLoS Biol       Date:  2007-08       Impact factor: 8.029

9.  Model-based analysis of ChIP-Seq (MACS).

Authors:  Yong Zhang; Tao Liu; Clifford A Meyer; Jérôme Eeckhoute; David S Johnson; Bradley E Bernstein; Chad Nusbaum; Richard M Myers; Myles Brown; Wei Li; X Shirley Liu
Journal:  Genome Biol       Date:  2008-09-17       Impact factor: 13.583

10.  Design and analysis of ChIP-seq experiments for DNA-binding proteins.

Authors:  Peter V Kharchenko; Michael Y Tolstorukov; Peter J Park
Journal:  Nat Biotechnol       Date:  2008-11-16       Impact factor: 54.908

View more
  79 in total

1.  Prediction of condition-specific regulatory genes using machine learning.

Authors:  Qi Song; Jiyoung Lee; Shamima Akter; Matthew Rogers; Ruth Grene; Song Li
Journal:  Nucleic Acids Res       Date:  2020-06-19       Impact factor: 16.971

2.  Genome-wide analysis of general phenylpropanoid and monolignol-specific metabolism genes in sugarcane.

Authors:  Douglas Jardim-Messeder; Thais Felix-Cordeiro; Lucia Barzilai; Ygor de Souza-Vieira; Vanessa Galhego; Gabriel Afonso Bastos; Gabriela Valente-Almeida; Yuri Ricardo Andrade Aiube; Allana Faria-Reis; Régis Lopes Corrêa; Gilberto Sachetto-Martins
Journal:  Funct Integr Genomics       Date:  2021-01-06       Impact factor: 3.410

3.  OXR2 Increases Plant Defense against a Hemibiotrophic Pathogen via the Salicylic Acid Pathway.

Authors:  Regina Mencia; Gabriel Céccoli; Georgina Fabro; Pablo Torti; Francisco Colombatti; Jutta Ludwig-Müller; Maria Elena Alvarez; Elina Welchen
Journal:  Plant Physiol       Date:  2020-07-29       Impact factor: 8.340

4.  Allelic Variation of MYB10 Is the Major Force Controlling Natural Variation in Skin and Flesh Color in Strawberry (Fragaria spp.) Fruit.

Authors:  Cristina Castillejo; Veronika Waurich; Henning Wagner; Rubén Ramos; Nicolás Oiza; Pilar Muñoz; Juan C Triviño; Julie Caruana; Zhongchi Liu; Nicolás Cobo; Michael A Hardigan; Steven J Knapp; José G Vallarino; Sonia Osorio; Carmen Martín-Pizarro; David Posé; Tuomas Toivainen; Timo Hytönen; Youngjae Oh; Christopher R Barbey; Vance M Whitaker; Seonghee Lee; Klaus Olbricht; José F Sánchez-Sevilla; Iraida Amaya
Journal:  Plant Cell       Date:  2020-09-30       Impact factor: 11.277

5.  Nucleo-plastidic PAP8/pTAC6 couples chloroplast formation with photomorphogenesis.

Authors:  Monique Liebers; François-Xavier Gillet; Abir Israel; Kevin Pounot; Louise Chambon; Maha Chieb; Fabien Chevalier; Rémi Ruedas; Adrien Favier; Pierre Gans; Elisabetta Boeri Erba; David Cobessi; Thomas Pfannschmidt; Robert Blanvillain
Journal:  EMBO J       Date:  2020-10-01       Impact factor: 11.598

6.  The Acetate Pathway Supports Flavonoid and Lipid Biosynthesis in Arabidopsis.

Authors:  Leonardo Perez de Souza; Karolina Garbowicz; Yariv Brotman; Takayuki Tohge; Alisdair R Fernie
Journal:  Plant Physiol       Date:  2019-11-12       Impact factor: 8.340

7.  Insights obtained using different modules of the cotton uceA1.7 promoter.

Authors:  Marcos Fernando Basso; Isabela Tristan Lourenço-Tessutti; Carlos Busanello; Clidia Eduarda Moreira Pinto; Elínea de Oliveira Freitas; Thuanne Pires Ribeiro; Janice de Almeida Engler; Antonio Costa de Oliveira; Carolina Vianna Morgante; Marcio Alves-Ferreira; Maria Fatima Grossi-de-Sa
Journal:  Planta       Date:  2020-01-31       Impact factor: 4.116

8.  Molecular characterization of SlATG18f in response to Tomato leaf curl New Delhi virus infection in tomato and development of a CAPS marker for leaf curl disease tolerance.

Authors:  Ashish Prasad; Gunaseelen Hari-Gowthem; Mehanathan Muthamilarasan; Zakir Hussain; Pawan Kumar Yadav; Sandhya Tripathi; Manoj Prasad
Journal:  Theor Appl Genet       Date:  2021-02-07       Impact factor: 5.699

9.  Genome-wide analysis of fluoride exporter genes in plants.

Authors:  Samridhi Agarwal; Preetom Regon; Mehzabin Rehman; Bhaben Tanti; Sanjib Kumar Panda
Journal:  3 Biotech       Date:  2021-02-12       Impact factor: 2.406

10.  Combined effects of a glycine-rich RNA-binding protein and a NAC transcription factor extend grain fill duration and improve malt barley agronomic performance.

Authors:  Burcu Alptekin; Dylan Mangel; Duke Pauli; Tom Blake; Jennifer Lachowiec; Traci Hoogland; Andreas Fischer; Jamie Sherman
Journal:  Theor Appl Genet       Date:  2020-10-21       Impact factor: 5.699

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.