| Literature DB >> 32296450 |
Liya Wang1, Zhenyuan Lu1, Melissa delaBastide1, Peter Van Buren1, Xiaofei Wang1, Cornel Ghiban1, Michael Regulski1, Jorg Drenkow1, Xiaosa Xu1, Carlos Ortiz-Ramirez2, Cristina F Marco1, Sara Goodwin1, Alexander Dobin1, Kenneth D Birnbaum2, David P Jackson1, Robert A Martienssen1, William R McCombie1, David A Micklos1, Michael C Schatz1,3, Doreen H Ware1,4, Thomas R Gingeras1.
Abstract
MaizeCODE is a project aimed at identifying and analyzing functional elements in the maize genome. In its initial phase, MaizeCODE assayed up to five tissues from four maize strains (B73, NC350, W22, TIL11) by RNA-Seq, Chip-Seq, RAMPAGE, and small RNA sequencing. To facilitate reproducible science and provide both human and machine access to the MaizeCODE data, we enhanced SciApps, a cloud-based portal, for analysis and distribution of both raw data and analysis results. Based on the SciApps workflow platform, we generated new components to support the complete cycle of MaizeCODE data management. These include publicly accessible scientific workflows for the reproducible and shareable analysis of various functional data, a RESTful API for batch processing and distribution of data and metadata, a searchable data page that lists each MaizeCODE experiment as a reproducible workflow, and integrated JBrowse genome browser tracks linked with workflows and metadata. The SciApps portal is a flexible platform that allows the integration of new analysis tools, workflows, and genomic data from multiple projects. Through metadata and a ready-to-compute cloud-based platform, the portal experience improves access to the MaizeCODE data and facilitates its analysis.Entities:
Keywords: ENCODE; bioinformatics; cloud computing; functional annotations; workflows
Year: 2020 PMID: 32296450 PMCID: PMC7136414 DOI: 10.3389/fpls.2020.00289
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
SciApps release 1.0 RESTful API.
| Endpoint | Method | Description |
| /job | GET | List all jobs |
| /job/new/{id} | POST | Run a new job |
| /workflow/build | POST | Build a workflow from jobs |
| /workflowJob/new | POST | Generate a workflow JSON |
| /workflow/new | POST | Save a new workflow |
| /job/{id} | GET | Return the job JSON |
| /job/{id}/delete | GET | Delete the job |
| /workflowJob/run/{id} | GET | Run a new workflow |
| /workflow/{id}/metadata | GET | Get the workflow metadata |
| /workflow/{id}/update | POST | Update the workflow with metadata etc. |
| /workflow | GET | List all workflows |
| /apps/{id} | GET | Return the application JSON |
| /workflow/{id}/delete | GET | Delete the workflow |
| /workflow/{id} | GET | Return the workflow JSON |
| /apps | GET | List all integrated apps |
FIGURE 1Web browser interface of the MaizeCODE data page. In the middle panel page, a list of workflows/experiments is presented. Above the list, several action buttons are available: “Relaunch” the analysis, “Visualize” the graphic diagram of the workflow (with URLs for the raw sequence files from the input file node), “Load” the results to the History panel, “Share” the analysis with others, and display the experimental “Metadata”. User can perform a keyword search for a specific dataset (e.g., B73 ears RNA-Seq). In the right panel, SciApps displays the history of the selected datasets; the visualization (eye) icon opens a panel where users can generate links to visualize the results in a web browser (e.g., a QC report) or genome browser (e.g., alignments or signal tracks). The left panel shows a list of modular apps that can be launched to perform a variety of downstream analyses with the loaded results.
FIGURE 2Genome browser tracks for the MaizeCODE data. JBrowse is used to hold the MaizeCODE signal tracks, which are organized in the following order: genome, tissue, replicate, and assay. Clicking on each track brings up the workflow “Relaunch” interface.
FIGURE 3Graphical workflow diagrams for differential expression analysis (top) and MethylC-seq analysis (bottom). The interactive graph demonstrates the relationships among input–output files, displays provenance of the software tools, and provides real-time job status updates of new analyses via the color of the app node (green: completed; blue: running; yellow: pending; red: failed). Blue, yellow, and red colors are not shown.
FIGURE 4MaizeCODE MCrna app for processing RNA-Seq data from two replicates. The MCrna app wraps six tools, FastQC, bbduk, MultiQC, STAR, RSEM, and StringTie, together for QC and quantification of each replicate.
FIGURE 5Using the RSEM_de app for gene-level differential expression analysis. Job histories are “Loaded” from the MaizeCODE data page (https://www.sciapps.org/data/MaizeCODE); Clicking the RSEM_de-1.3.0 from the left App panel brings up the app form in the main panel; Dragging and dropping the gene quantification result files starting with “rsem_” into the input field, then clicking the “Submit job” button to run the differential expression analysis.